This page refers to the Fall 2012 offering of CSE 704 only. The information on this page does not necessarily apply to every offering of CSE 704.
Fall 2012
23474
Web-scale data management systems
This seminar explores techniques and systems built to scale data storage, analytics, stream processing and transaction processing to the massive data sizes and rates seen in many modern application domains. Topics covered may include large-scale data storage systems (Cassandra, BigTable/HBase, PNuts), distributed data analysis frameworks (MapReduce/Hadoop, Dryad), large-scale data analytics systems (Pig, HadoopDB, Greenplum), distributed transaction processing systems (H-Store, Dynamo, Percolator), and stream processing systems (Storm, Borealis, DBToaster). A small number of papers will be selected for the course; approximately one per lecture. Students are expected to submit short summaries/critiques of the week's paper and participate in a discussion about it. Students will be expected to volunteer to present the basic ideas behind one of the papers, and then lead the week's discussion. This course meets WEEKLY on Mondays from 9:00 AM to 11:00 AM in Davis 113A
CSE 462, 486, 562 (or equivalents), and familiarity with the Java programming language.
Ph.D.: This course does not fulfill core area or core course requirements.
M.S.: This course does not fulfill core area or core course requirements.