STAR/ALICE Data Management

The purpose of this repository is to store code used to perform data management tasks in the STAR and ALICE projects.

Motivation

STAR and ALICE have spent a lot of money and time designing their data management architecture for the experiment data, down to the level of the micro-dst's but the local data analyses are completely ad hoc.

Goal

Provide a local light-weight data management infrastructure that allows analyses to be more seamlessly parallelized, and the results to be easily queried and shared. The infrastructure will be generally applicable to data analysis workflows in STAR, ALICE, and other Nuclear Physics projects.

Approach

Automated distillation of metadata from analysis data files and allow user annotations where required
Automated capture provenance of data analysis process
Store intermediate data and final data products in a schema-less database
Fast sub-setting of data and metadata for high-throughput parallel analyses, including use of MapReduce frameworks such as Hadoop.

People

Dan Gunter
Keith Beattie
Jeff Porter
Lavanya Ramakrishnan

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

STAR/ALICE Data Management

Motivation

Goal

Approach

People

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

STAR/ALICE Data Management

Motivation

Goal

Approach

People

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages