archive:appds:main
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| archive:appds:main [05/07/2024 19:03] – removed - external edit (Unknown date) 127.0.0.1 | archive:appds:main [05/07/2024 19:19] (current) – ↷ Links adapted because of a move operation admin | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| + | {{archive: | ||
| + | ====== APPDS ====== | ||
| + | Astroparticle Physics Data Storage (APPDS) is a project for design and development of distributed data storage for cosmic ray experiments such as [[: | ||
| + | |||
| + | ===== Annotation ===== | ||
| + | |||
| + | The project will address the problem of management of contemporary scientific data. | ||
| + | Nowadays the exponential growth of the amount of experimental data can be observed. While there was | ||
| + | 1-10 Tb of data per year in astrophysics 10-15 years ago, new experimental facilities generate data sets | ||
| + | ranging in size from 100’s to 1000's of terabytes per year. It can be illustrated by a growth of the amount | ||
| + | of data acquired by satellites. While the Integral satellite [http:// | ||
| + | ground 1.2 Gb of data per day in 2002, now the Gaia spacecraft (2013) transfers about 5 Gb of data per | ||
| + | day. The other example is the ground-based experiment LSST [https:// | ||
| + | gigapixels per image with an exposure every 15 seconds. It is expected to take ~10 petabytes of | ||
| + | information per year. | ||
| + | |||
| + | These trends give rise to a number of emerging issues of big data management. It's obvious that various | ||
| + | activities must be performed continually across all stages of the data life cycle to help support effective | ||
| + | data management (see an example in https:// | ||
| + | collection and storage of data, its processing and analysis, refining the physical model, making | ||
| + | preparations for publication, | ||
| + | modern science in general and astroparticle physics in particular is open science, the model of free | ||
| + | access to data (e.g. Paul A. David, Industrial and Corporate Change, Volume 13, Number 4, pp. 571–589; | ||
| + | http:// | ||
| + | members but to all levels of an inquiring society, amateur or professional. This approach is especially | ||
| + | important in the age of Big Data, when a complete analysis of the experimental data cannot be performed | ||
| + | within one collaboration. | ||
| + | |||
| + | The present project will strive to develop an open science system to be able to collect, store, and analyze | ||
| + | astrophysical data having the TAIGA [http:// | ||
| + | [https:// | ||
| + | The novelty of the proposed approach can be seen in developing integrated solutions including: | ||
| + | * development and adaptation of distributed data storage algorithms and techniques with a common meta-catalog to provide a common information space of the distributed repository; | ||
| + | * development and adaptation of data transmission algorithms as well as simultaneous data transmission from several data repositories thus significantly reducing load time; | ||
| + | * development of machine-learning techniques for identifying mass groups of particles and their properties in a fully remote access mode; | ||
| + | * installation of the KCDC-based prototype system of Big Data analysis and exporting the experimental data from KASKADE and TAIGA for testing technology of data life cycle management. | ||
| + | |||
| + | We will also create an educational system on the HubZero platform [https:// | ||
| + | astroparticle physics. | ||
| + | |||
| + | This is an innovative approach that will be used in astroparticle physics research for the first time. | ||
| + | Plans are underway to expand the number of experiments by exporting data from other scientific | ||
| + | collaborations, | ||
| + | It's noteworthy that the suggested approach can be used not only in the specified field of science but also | ||
| + | adapted to other scientific disciplines. | ||
| + | |||
| + | ===== Collaboration ===== | ||
| + | |||
| + | * Russian institutes: | ||
| + | * SINP MSU, Moscow | ||
| + | * IGU, Irkutsk | ||
| + | * IDSTU, Irkutsk | ||
| + | * Germany institutes: | ||
| + | * KIT, Karlsruhe | ||
| + | |||
| + | ====== Principal members ====== | ||
| + | |||
| + | * Germany principal members | ||
| + | * [[mailto: | ||
| + | * [[mailto: | ||
| + | * Russian principal members | ||
| + | * [[mailto: | ||
| + | * [[mailto: | ||
| + | * [[mailto: | ||
| + | * [[mailto: | ||
| + | |||
| + | Mailing list: [[mailto: | ||
| + | |||
| + | ====== Project structure ====== | ||
| + | |||
| + | ===== WP's ===== | ||
| + | |||
| + | * [[.wp1: | ||
| + | * [[.wp2: | ||
| + | * [[.wp3: | ||
| + | * [[.wp4: | ||
| + | |||
| + | ====== Schedule and Milestones ====== | ||
| + | |||
| + | Considering the above mentioned specific tasks, we defined the schedule and a couple of | ||
| + | milestones to be reached within this project: | ||
| + | |||
| + | ===== 2018 ===== | ||
| + | |||
| + | 1. Development of standard astrophysical data format and preparation of the TAIGA data (wp1) \\ | ||
| + | 2. Preparing the environment to move KCDC to the computing facilities in KIT and MSU (wp2) \\ | ||
| + | 3. Workout of a coherent concept for a dedicated Data Life Cycle Lab (wp2) \\ | ||
| + | 4. Defining physics tasks for common data analysis (wp3) \\ | ||
| + | 5. Creating webpage and announcing KRAD to the world (wp4) | ||
| + | |||
| + | ===== 2019 ===== | ||
| + | |||
| + | 1. Inclusion of TAIGA data within KCDC (wp1) \\ | ||
| + | 2. Final installation of extended KCDC at large-scale computing facilities in KIT and MSU (wp2) \\ | ||
| + | 3. Installing the data life cycle lab (wp2) \\ | ||
| + | 4. Performing multi-messenger analysis using the extended KCDC (wp3) \\ | ||
| + | 5. Publishing of the Data Life Cycle Lab (wp4) | ||
| + | |||
| + | ===== 2020 ===== | ||
| + | |||
| + | 1. Generalization of the Data Life Cycle Lab (wp2) \\ | ||
| + | 2. Workout of a concept for global astroparticle physics data centre (wp2) \\ | ||
| + | 3. Finalizing and publication of specific data analyses using the new environment (wp3) | ||
| + | |||
