<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Dive Into A Data Deluge &#187; vldb</title>
	<atom:link href="http://diveintodata.org/tag/vldb/feed/" rel="self" type="application/rss+xml" />
	<link>http://diveintodata.org</link>
	<description>Discussion about Newly Emerging Issues on Database</description>
	<lastBuildDate>Thu, 29 Mar 2012 09:43:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='diveintodata.org' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Dive Into A Data Deluge &#187; vldb</title>
		<link>http://diveintodata.org</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://diveintodata.org/osd.xml" title="Dive Into A Data Deluge" />
	<atom:link rel='hub' href='http://diveintodata.org/?pushpress=hub'/>
		<item>
		<title>HadoopDB: An Open Source Parallel Database for Analytical Workloads</title>
		<link>http://diveintodata.org/2009/07/31/hadoopdb-releases/</link>
		<comments>http://diveintodata.org/2009/07/31/hadoopdb-releases/#comments</comments>
		<pubDate>Thu, 30 Jul 2009 15:01:15 +0000</pubDate>
		<dc:creator>Hyunsik Choi</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoopdb]]></category>
		<category><![CDATA[map-reduce]]></category>
		<category><![CDATA[vldb]]></category>

		<guid isPermaLink="false">http://diveintodata.org/?p=155</guid>
		<description><![CDATA[With the increasingly growing volume of data, the techniques to manage big data are needed in many areas. Open source community and many companies have attempted developing solutions to deal with big data. Recently, Prof. Daniel Abadi, who is an Assistant Professor of Computer Science at Yale University, announced HadoopDB release and the paper published [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=diveintodata.org&#038;blog=12237478&#038;post=155&#038;subd=diveintodata&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<p><span class="dropcaps">W</span>ith the increasingly growing volume of data, the techniques to manage big data are needed in many areas. Open source community and many companies have attempted developing solutions to deal with big data.</p>
<p>Recently, <a href="http://cs-www.cs.yale.edu/homes/dna/" target="_blank">Prof. Daniel Abadi</a>, who is an Assistant Professor of Computer Science at Yale University, announced <a href="http://dbmsmusings.blogspot.com/2009/07/announcing-release-of-hadoopdb-longer.html" target="_blank">HadoopDB release and the paper</a> published in <a href="http://vldb2009.org/" target="_blank">VLDB’09</a>. HadoopDB is an open source analytical database, being developed by him and his students. The paper states that HadoopDB is a hybrid of both MapReduce and parallel  database and it takes the best features from both.</p>
<p><img style="display:inline;margin-left:0;margin-right:0;" title="Hadoop Logo" src="http://hadoop.apache.org/images/hadoop-logo.jpg" alt="Hadoop Logo" width="198" height="47" align="right" />Actually, MapReduce has made controversial issues from a database point of view. Formerly, there was some debates. Representatively, <a href="http://pages.cs.wisc.edu/~dewitt/" target="_blank">Prof. David Dewitt</a>, who is well known as a great master of (parallel) database, critiqued that <a href="http://databasecolumn.vertica.com/2008/01/mapreduce-a-major-step-back.html" target="_blank">MapReduce is a major step backwards</a>. On the other hand, proponents of MapReduce argue that MapReduce outperforms parallel database in respect of scalability, fault tolerance, and flexibility to unstructured data.</p>
<p>This paper concludes that HadoopDB is close to the performance of parallel databases while it is similar score on fault tolerance and feasibility in heterogeneous systems as Hadoop.</p>
<p>In sum, HadoopDB is a hybrid system of MapReduce and parallel DBMS. It is quite interesting achievement. I respect their decision to release HadoopDB as open source because their achievement will more broadly contribute to Hadoop and data analytical database. Still, I do not read this paper completely, and sooner I will discuss HadoopDB in detail.</p>
<h3>Some interesting points:</h3>
<ul>
<li>They carried out experiments on a 100 node of amazon EC2 cluster.</li>
<li>They try to deal with semantic web data (i.e., RDF) by HadoopDB.</li>
<li>HadoopDB is a full open source project.</li>
<li>HadoopDB isn’t well suited for real-time data yet.</li>
<li>I can participate in his presentation at the session at VLDB.</li>
</ul>
<h3>See Also:</h3>
<ul>
<li><a href="http://news.idg.no/cw/art.cfm?id=9D2C109A-1A64-6A71-CE90BD44D98F12B1" target="_blank">Yale researchers create database-Hadoop hybrid</a>, Computer World</li>
<li><a href="http://radar.oreilly.com/2009/07/hadoopdb-an-open-source-parallel-database.html" target="_blank">HadoopDB: An Open Source Parallel Database</a>, <a href="http://radar.oreilly.com/" target="_blank">O’REILLY radar</a></li>
<li><a href="http://databasecolumn.vertica.com/2008/01/mapreduce-a-major-step-back.html" target="_blank">MapReduce: A major step backwards</a></li>
<li><a href="http://databasecolumn.vertica.com/2008/01/mapreduce-continued.html" target="_blank">MapReduce: A major step backwards (II)</a></li>
</ul>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/diveintodata.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/diveintodata.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/diveintodata.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/diveintodata.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/diveintodata.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/diveintodata.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/diveintodata.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/diveintodata.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/diveintodata.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/diveintodata.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/diveintodata.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/diveintodata.wordpress.com/155/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/diveintodata.wordpress.com/155/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/diveintodata.wordpress.com/155/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=diveintodata.org&#038;blog=12237478&#038;post=155&#038;subd=diveintodata&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://diveintodata.org/2009/07/31/hadoopdb-releases/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4213567e11cad51fc02bc2038e9ace27?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Hyunsik Choi</media:title>
		</media:content>

		<media:content url="http://hadoop.apache.org/images/hadoop-logo.jpg" medium="image">
			<media:title type="html">Hadoop Logo</media:title>
		</media:content>
	</item>
		<item>
		<title>Three nice articles that address Very Large Data Base</title>
		<link>http://diveintodata.org/2008/09/26/three-nice-articles-that-address-very-large-data-base/</link>
		<comments>http://diveintodata.org/2008/09/26/three-nice-articles-that-address-very-large-data-base/#comments</comments>
		<pubDate>Thu, 25 Sep 2008 23:23:38 +0000</pubDate>
		<dc:creator>Hyunsik Choi</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[bigTable]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hbase]]></category>
		<category><![CDATA[map-reduce]]></category>
		<category><![CDATA[vldb]]></category>

		<guid isPermaLink="false">http://diveintodata.org/2008/09/three-nice-articles-that-address-very-large-data-base/</guid>
		<description><![CDATA[Big Data: The futhre of biocuration, Nature Greenplum MapReduce for the Petabyte Database Aster nCluster: In-Database MapReduce<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=diveintodata.org&#038;blog=12237478&#038;post=47&#038;subd=diveintodata&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
			<content:encoded><![CDATA[<ul style="list-style-type:disc;">
<li><a title="[http://www.nature.com/nature/journal/v455/n7209/full/455047a.html]로 이동합니다." target="_blank" href="http://www.nature.com/nature/journal/v455/n7209/full/455047a.html">Big Data: The futhre of biocuration, Nature</a>
</li>
<li><a title="[http://www.greenplum.com/resources/mapreduce/]로 이동합니다." target="_blank" href="http://www.greenplum.com/resources/mapreduce/">Greenplum MapReduce for the Petabyte Database</a></li>
<li><a title="[http://www.asterdata.com/product/mapreduce.html]로 이동합니다." target="_blank" href="http://www.asterdata.com/product/mapreduce.html">Aster nCluster: In-Database MapReduce</a></li>
</ul>
<p></p>
<br />  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/diveintodata.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/diveintodata.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/diveintodata.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/diveintodata.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gofacebook/diveintodata.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/facebook/diveintodata.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gotwitter/diveintodata.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/twitter/diveintodata.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/diveintodata.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/diveintodata.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/diveintodata.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/diveintodata.wordpress.com/47/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/diveintodata.wordpress.com/47/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/diveintodata.wordpress.com/47/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=diveintodata.org&#038;blog=12237478&#038;post=47&#038;subd=diveintodata&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://diveintodata.org/2008/09/26/three-nice-articles-that-address-very-large-data-base/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/4213567e11cad51fc02bc2038e9ace27?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">Hyunsik Choi</media:title>
		</media:content>
	</item>
	</channel>
</rss>
