Till KTH:s startsida Till KTH:s startsida

Master Thesis Defense on "Dataset versioning in Hops Filesystem" (Monday, Aug 28, 9:00am)

Tid: Måndag 28 augusti 2017 kl 09:00 - 10:30 2017-08-28T09:00:00 2017-08-28T10:30:00

Kungliga Tekniska högskolan
KTH Kista Degree projects, Master-level (Examensarbete, Master)

Plats: Ada Room

Info:

Student:  Braulio Grana Gutiérrez
Date and Time:  09:00am, Monday, 28th August 2017
Place: Ada room, 4th floor
Examiner:  Šarūnas Girdzijauskas
Supervisor:  Jim Dowling
Title: Dataset versioning in Hops Filesystem
Opponent: Adrián Ramírez del Río


Abstract
As the awareness of the potential of Big Data araises, more and more companies
are starting to create their own Data Science divisions and their projects are
becoming big and complex handled by big multidisciplinary teams. Furthermore,
with the expansion of fields such as Deep Learning, Data Science is becoming a
very popular research field both in companies and universities.

In this context it becomes crucial for Data Scientists to be able to reproduce
their experiments in a reliable way. This Master Thesis project
presents the design
of a snapshotting system for the distributed File System HopsFS based on Apache
HDFS and developed at the Swedish Institute of Computer Science (SICS) along
with comments and discussion on the implementation of said system.

Among the contributions of this project are, not only to build the mentioned
snapshotting system for HopsFS but to improve on previous solutions designed
for both HopsFS and HDFS by solving problems such as the incomplete block
problem as well as finding adding new uses to the system such as the automatic
snapshots to allow users to undo the last few changes of a file.

Hela världen får läsa.

Senast ändrad 2017-08-23 11:47

Taggar: Saknas än så länge.