Choosing DBMS for sensor network like system

05-09-2012

Registered User

110, 6

Join Date: Sep 2010

Last Activity: 3 November 2013, 7:32 AM EST

Location: Tabriz, Iran

Posts: 110

Thanks Given: 53

Thanked 6 Times in 6 Posts

Choosing DBMS for sensor network like system

This is just a problem with our current system. We have a distributed number of stations all around the country gathering information and sending to our database. number stations are about 300 and our database server is Postgresql 8.4 running on Debian 6.(two quad core xeon with a single 1TB hard disk). because it doesn't scale very good we decided to change our architecture. because of lots of relational and non relational systems out there, I have problem choosing for our needs. here is our challenges:

current system is generating 4 million records per day. we are planning to go beyond 1000 stations. we need a product that could handle 2000+ connections per second on our current server in next 5 years.
we do not need much tests on data validity.(I think we need NoSQL here). stations don't even check that data is delivered or not. they just send and pass.
currently we are using triggers to update our report tables based on input data. the problem is after a month we got bloat indexes which need many maintenance tasks. Is there any trigger like mechanisms that doesn't cause bloat indexes or we should forget real time report generations and use OLAP products?
the DBMS should handle at maximum 5TB of data on a single table. on our current system because it gets high load, we delete past data (past two or three month). thus we can not take reports from data that belongs to last year.
we doesn't need much ACID features. we need simple insert and select. We don not have two phase commit in our systems. we need extreme fast inserts.
Is table partitioning good for our problem? (we categorize our data based on date)
As I said we need real time reports.(when I say real time I mean if reports is for yesterday, it doesn't have problem). Are there any open source OLAP product for direct feed?
we do not need much high availability that force us use cluster.

majid.merkava

View Public Profile for majid.merkava

Find all posts by majid.merkava

05-09-2012

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

How you need to store the data really depends on what you're doing with it... What kind of reports are you generating? What sort of ordering?

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

05-09-2012

Registered User

110, 6

Join Date: Sep 2010

Last Activity: 3 November 2013, 7:32 AM EST

Location: Tabriz, Iran

Posts: 110

Thanks Given: 53

Thanked 6 Times in 6 Posts

each record is like this:
<record id>, <station id>,<x1>,<x2>,<x3>,<capture-time>

almost all reports are based on time.
1. total sum of x1 for station_id = 223 between month 8 to 12
2. total sum of x1 for station_id = 2344, 123 which x2 is less than 13 in month 1, day 12 to 23
3. total number of x3 during night hours in year 2008

majid.merkava

View Public Profile for majid.merkava

Find all posts by majid.merkava

05-09-2012

Registered User

23,310, 4,623

Join Date: Aug 2005

Last Activity: 7 July 2020, 11:47 AM EDT

Location: Saskatchewan

Posts: 23,310

Thanks Given: 1,331

Thanked 4,623 Times in 4,217 Posts

Is a relational database really what you need? When your records all end up in date order anyway, the index overhead is pointless... A binary search suffices.

This User Gave Thanks to Corona688 For This Post:

Corona688

View Public Profile for Corona688

Visit Corona688's homepage!

Find all posts by Corona688

05-09-2012

Registered User

110, 6

Join Date: Sep 2010

Last Activity: 3 November 2013, 7:32 AM EST

Location: Tabriz, Iran

Posts: 110

Thanks Given: 53

Thanked 6 Times in 6 Posts

That's why we are getting problem and I'm looking for a good dbms for this case. I think relational DB are not suitable for us. I'm considering to use MongoDB.

majid.merkava

View Public Profile for majid.merkava

Find all posts by majid.merkava

UNIX and Linux Applications

Choosing DBMS for sensor network like system

5 More Discussions You Might Find Interesting

1. AIX

Network File System using HACMP

Discussion started by: aixromeo

2. SCO

Ingres dbms on sco unix

Discussion started by: javad1_maroofi

3. Linux

network audio system

Discussion started by: ramen_noodle

4. Shell Programming and Scripting

Taking a system off a network on a system

Discussion started by: Terrible

5. UNIX for Advanced & Expert Users

Network and System files

Discussion started by: trip