| Week |
Topic |
Presenter |
Material |
| September 5 |
No class |
|
Yanif is out of town |
| September 12 |
Welcome, SQL+DBMS intro |
Yanif Ahmad |
Course overview, and background material (relational algebra,
SQL, DBMS architecture refresher). |
|
| Section: Data models |
|
|
| September 19 |
Sequences, streams |
Yanif Ahmad |
Botan et al.: SECRET: A Model for Analysis of the Execution Semantics of Stream Processing Systems. PVLDB 3(1): 232-243 (2010) |
|
W. Lam et al.: Muppet: MapReduce-Style Processing of Fast Data. PVLDB 5(12): 1814-1825 (2012) |
| September 26 |
No class |
|
Yanif is out of town |
| October 3 |
Nested, NoSQL, NewSQL |
Naveen, Yanif |
A. Lakshman, P. Malik: Cassandra: a decentralized structured storage system. Op. Sys. Rev:44(2), 2010 |
| A. Thomson et al.: Calvin: fast distributed transactions for partitioned database systems. SIGMOD 2012. |
| October 10 |
Arrays, scientifc data |
Lakshmisha, Yanif |
J. Buck et al.: SciHadoop: array-based query processing in Hadoop. SC 2011. |
| A. Seering et al.: Efficient Versioning for Scientific Array Databases. ICDE 2012: 1013-1024 |
|
Short paper 1 due. |
|
|
| October 17 |
Graphs, recursion and constraints |
Frank, Yanif |
J. Mondal, A. Deshpande: Managing large dynamic graphs efficiently. SIGMOD 2012: 145-156 |
| Liu et al.: Cologne: A Declarative Distributed Constraint Optimization Platform. PVLDB 5(8): 752-763 (2012) |
|
Midterm out |
|
|
|
| Section: Architectures |
|
|
| October 24 |
I/O |
Yanif |
**E. Nightingale et al.: Flat Datacenter Storage. OSDI 2012. |
| **B. Xie et al.: Characterizing Output Bottlenecks in a Supercomputer. SC 2012. |
| October 31 |
Networking |
Andong, Yanif |
Corbett et al.: Spanner: Google's Globally-Distributed Database. OSDI 2012. |
| Zaharia et al.: Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster Computing. NSDI 2012. |
|
Midterm in |
|
|
| November 7 |
GPUs |
Nick, Vaibhav |
Mudalige et al.: Designing OP2 for GPU Architectures. Journal of Par. and Dist. Computing, 2012. |
| **S. Lee, J. Vetter: Early Evaluation of Directive-Based GPU Programming Models for Productive Exascale Computing. SC 2012. |
|
| Section: Applications |
|
|
| November 14 |
Large web data |
Abhijeet, Raghu |
S. Melnik et al.: Dremel: Interactive Analysis of Web-Scale Datasets. PVLDB 3(1): 330-339 (2010) |
| A. Hall et al.: Processing a Trillion Cells per Mouse Click. PVLDB 5(11):1436-1446, 2012. |
|
Short paper 2 due. |
|
|
| November 21 |
No class |
|
Thanksgiving |
| November 28 |
Large-scale learning |
Olivia, Aric |
A. Smola, S. Narayanamurthy: An Architecture for Parallel Topic Models. PVLDB 3(1): 703-710 (2010) |
| X. Feng et al.: Towards a unified architecture for in-RDBMS analytics. SIGMOD 2012: 325-336 |
| December 5 |
Crowdsourcing |
Svitlana, Debu |
M. Franklin et al.: CrowdDB: answering queries with crowdsourcing. SIGMOD 2011: 61-72 |
| Parameswaran et al.: CrowdScreen: algorithms for filtering data with humans. SIGMOD 2012: 361-372 |
|
Final projects due. |
|
|