109 | амазон эластично входит и выходит; пора уже определиться, туда или сюда

You're viewing

109's journal
Create a Dreamwidth Account Learn More

Reload page in style: site light

как я уже писал, distributed storage space is very hot now, туда лезут все, кому не лень, от чего вреда бывает больше, чем пользы, поскольку понижается уровень обсуждения и повышается уровень шума. например, какие-то ламеры пишут: Today, Amazon announced its second entry in to the world of cloud databases. Called Amazon Elastic MapReduce, this appears to be a hosted implementation of the Hadoop framework.

какой специалист в здравом уме назовёт хадуп или мапредьюс базой данных? их даже более обобщённым словом "storage" нельзя назвать.

Flat | Top-Level Comments Only

From:

jdevelop.livejournal.com

там есть BigTable, а это уже где-то рядом

From:

109.livejournal.com

спасибо! этот комментарий является отличным подверждением моему тезису.

From:

jdevelop.livejournal.com

пожалуйста

From:

109.livejournal.com

so hbase works on top of hadoop file system. does it make hadoop a database? is windows a database because sql server works on top of ntfs?

From:

katsnelson.livejournal.com

I will be be the first to say that hadoop is not a database, at least not the way we DBMS people (I spent the last 16 years working on DB2) think of databases. However, when I talk to our customers they DO consider hadoop to be a solution to the same set of problems they use DB2. So, in their mind it is a database management system or maybe a data processing system.
In DB2 we have this feature called Data Partitioning Feature which lets one distribute data across a cluster of independent database nodes. This is share nothing approach i.e. each node is responsible for its own portion of the data. When a query comes in it is split up and is executed on multiple nodes.
It is not MapReduce but the point is that the use case is the same i.e. run complex data processing tasks against very large data sets.

From:

109.livejournal.com

Well, neither Hadoop nor MapReduce offer any persistence by themselves, wouldn't you agree?

Anyway, I am very interested in the Data Partitioning Feature you described. Where can I read more about it?

From:

katsnelson.livejournal.com

No argument about Hadoop/MapReduce not being a persitent data store on their own but typically used with HDFS/GFS.
You can find more info on DB2 Database partitioning Feature in this free red book http://www.redbooks.ibm.com/abstracts/sg246917.html. It is a bit dated and talks about several partitioning options. But if you ignore table partitioning and multi-dimentioning clustering you will get a good idea of database partitionign that DB2 does. Or you can read this http://www.ibmpressbooks.com/articles/article.asp?p=375537&seqNum=6

Flat | Top-Level Comments Only

Profile

109

March 2019

S	M	T	W	T	F	S
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

Page Summary

Style Credit

Style: Basic for Transmogrified by Yvonne

Expand Cut Tags

No cut tags

Top of page

Latched, pinned, and marked dirty

Автор эпических сказаний

амазон эластично входит и выходит; пора уже определиться, туда или сюда

(no subject)

(no subject)

(no subject)

(no subject)

is hadoop a database?

(no subject)

I agree

Profile

March 2019

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags