CSE 710

Parallel and Distributed File Systems

Spring 2015

 

 

Instructor:

 

Prof. Tevfik Kosar

Office: 338J Davis Hall

Phone: 645-2323

Email: tkosar@buffalo.edu

Office hours: Tue 11:00am - 1:00pm

 

Course Description:

 

As the data requirements of commercial as well as scientific applications continue to increase, the ability to share large amounts of data across widely distributed sites (i.e. data centers, clouds, clusters, supercomputers) becomes more and more important.

 

This seminar will be discussing state-of-the-art research, development, and deployment efforts in parallel and distributed file systems on clustered, grid, and cloud infrastructures. We will be reading and discussing two papers every week in one of the following areas:

 

·      File System Design Decisions

·      Performance, Scalability, and Consistency issues in File Systems

·      Traditional Distributed File Systems

·      Parallel Cluster File Systems

·      Wide Area Distributed File Systems

·      Cloud File Systems

·      Commercial vs Open Source File System Solutions

 

Course Location and Time:

 

The seminars will be held Wednesdays 10:00am-12:00pm @ 113A Davis Hall. First day of classes will be on Wednesday, January 28th, 2014.

 

Reading List:

 

The "tentative" reading list for this seminar is available here.

 

Projects:

 

The seminar will include a "term project" on the implementation of the core parallel and distributed file system components, which will be implemented and presented in teams. Some project ideas include:

Project-1: FuseDLS: Design and Implementation of a Fuse-based file system interface to a Cloud-hosted Directory Listing Service
Project-2: MDS: Design and Implementation of a Distributed Metadata Server for Global Name Space in a Wide-area File System
Project-3: SmartFS: Design and Implementation of a Serverless Distributed File System for Smartphones
Project-4: PowerFS: Energy-Aware File System Design
Detailed project descriptions are available here.

Project Milestones are:

Mar 2nd: System Design Document & Literature Survey due
Mar 4th - 11th: Midterm Presentations
Apr 22nd - May 6th: Final Presentations and Demos
May 11th: Final Reports due

 

Grading:

 

This is a research course. There will be team projects and paper presentations, but no exams. Each student will present 1-2 papers from the reading list provided above. Additionally, each student is expected to read all papers, submit questions and comments about the papers, attend classes, and join the discussion of the papers. Grading will be P/F.

 

Useful Links:

 

·      How to Read a Paper, by S. Keshav.

·      Reviewing a Technical Paper, by M. Ernst

 

Piazza Page:

 

    We will also have an online discussion of the papers at the course piazza page. The details about this will be posted here soon.

 

 

Seminar Schedule:


Date

Week

Papers to be Discussed

Presenter

Jan. 28

1

Introduction: Wide Area Distributed File Systems

Tevfik

Feb. 4

2

The Sun Network File System: Design, Implementation and Experience (NFS)

Kyle

Scale and Performance in a Distributed File System (AFS)

Aniket

Feb. 11

3

Disconnected Operation in the Coda File System

Narasimha

Serverless Network File Systems (xFS)

Sayaritra

Feb. 18

4

PVFS: A Parallel File System for Linux Clusters

Ji

Lustre: A Scalable, High-Performance File System

Gokhan

Feb. 25

5

GPFS: A Shared-Disk File System for Large Computing Clusters

Archana

Panache: A Parallel File System Cache for Global File Access

Subhadeep

Mar. 4

6

Midterm Presentations

Teams 1 – 3

Mar. 11

7

Midterm Presentations

Teams 4 - 6

Mar. 18

SPRING BREAK

 

Mar. 25

8

Energy-Efficiency and Storage Flexibility in the Blue File System

Zulkar

Evaluation and Analysis of GreenHDFS

Luigi

Apr. 1

9

The Google File System

Anvay

Ceph: A Scalable, High-Performance Distributed File System

Surya

Apr. 8

10

Distributed Directory Service in the Farsite File System

Hrishikesh

Flexible, Wide-Area Storage for Distributed Systems with WheelFS

Samved

Apr. 15

11

BlueSky: A Cloud-Backed File System for the Enterprise

Sean

Replication, History, and Grafting in the Ori File System

Shreyas

CalvinFS: A Scalable File System Built on Top of a Distributed DBMS

Hitpal

Apr. 22

12

Final Project Presentations & Demos

Teams 1 – 2

Apr. 29

13

Final Project Presentations & Demos

Teams 3 - 4

May. 6

14

Final Project Presentations & Demos

Teams 5 - 6