Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1010039

AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark


Zečević, Petar; Slater, Colin T.; Jurić, Mario; Connolly, Andrew J.; Lončarić, Sven; Bellm, Eric C.; Golkhou, V. Zach; Suberlak, Krzysztof
AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark // Astronomical journal, 158 (2019), 1; 37, 14 doi:10.3847/1538-3881/ab2384 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 1010039 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark

Autori
Zečević, Petar ; Slater, Colin T. ; Jurić, Mario ; Connolly, Andrew J. ; Lončarić, Sven ; Bellm, Eric C. ; Golkhou, V. Zach ; Suberlak, Krzysztof

Izvornik
Astronomical journal (0004-6256) 158 (2019), 1; 37, 14

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
astronomical databases ; catalogs, data analysis

Sažetak
We introduce AXS (Astronomy eXtensions for Spark), a scalable open-source astronomical data analysis framework built on Apache Spark, a widely used industry-standard engine for big- data processing. Building on capabilities present in Spark, AXS aims to enable querying and analyzing almost arbitrarily large astronomical catalogs using familiar Python/AstroPy concepts, DataFrame APIs, and SQL statements. We achieve this by (i) adding support to Spark for efficient on-line positional cross- matching and (ii) supplying a Python library supporting commonly used operations for astronomical data analysis. To support scalable cross-matching, we develop a variant of the ZONES algorithm capable of operating in distributed, shared-nothing architecture. We couple this to a data partitioning scheme that enables fast catalog cross-matching and handles the data skew often present in deep all-sky data sets. The cross-match and other often-used functionalities are exposed to the end users through an easy-to-use Python API. We demonstrate AXS’s technical and scientific performance on Sloan Digital Sky Survey, Zwicky Transient Facility, Gaia DR2, and AllWise catalogs. Using AXS we were able to perform on-the-fly cross-match of Gaia DR2 (1.8 billion rows) and AllWise (900 million rows) data sets in ∼30 s. We discuss how cloud-ready distributed systems like AXS provide a natural way to enable comprehensive end-user analyses of large data sets such as the Large Synoptic Survey Telescope.

Izvorni jezik
Engleski

Znanstvena područja
Fizika, Računarstvo



POVEZANOST RADA


Projekti:
KK.01.1.1.01.0009 - Napredne metode i tehnologije u znanosti o podatcima i kooperativnim sustavima (EK )

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Sven Lončarić (autor)

Avatar Url Petar Zečević (autor)

Citiraj ovu publikaciju:

Zečević, Petar; Slater, Colin T.; Jurić, Mario; Connolly, Andrew J.; Lončarić, Sven; Bellm, Eric C.; Golkhou, V. Zach; Suberlak, Krzysztof
AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark // Astronomical journal, 158 (2019), 1; 37, 14 doi:10.3847/1538-3881/ab2384 (međunarodna recenzija, članak, znanstveni)
Zečević, P., Slater, C., Jurić, M., Connolly, A., Lončarić, S., Bellm, E., Golkhou, V. & Suberlak, K. (2019) AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark. Astronomical journal, 158 (1), 37, 14 doi:10.3847/1538-3881/ab2384.
@article{article, author = {Ze\v{c}evi\'{c}, Petar and Slater, Colin T. and Juri\'{c}, Mario and Connolly, Andrew J. and Lon\v{c}ari\'{c}, Sven and Bellm, Eric C. and Golkhou, V. Zach and Suberlak, Krzysztof}, year = {2019}, pages = {14}, DOI = {10.3847/1538-3881/ab2384}, chapter = {37}, keywords = {astronomical databases, catalogs, data analysis}, journal = {Astronomical journal}, doi = {10.3847/1538-3881/ab2384}, volume = {158}, number = {1}, issn = {0004-6256}, title = {AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark}, keyword = {astronomical databases, catalogs, data analysis}, chapternumber = {37} }
@article{article, author = {Ze\v{c}evi\'{c}, Petar and Slater, Colin T. and Juri\'{c}, Mario and Connolly, Andrew J. and Lon\v{c}ari\'{c}, Sven and Bellm, Eric C. and Golkhou, V. Zach and Suberlak, Krzysztof}, year = {2019}, pages = {14}, DOI = {10.3847/1538-3881/ab2384}, chapter = {37}, keywords = {astronomical databases, catalogs, data analysis}, journal = {Astronomical journal}, doi = {10.3847/1538-3881/ab2384}, volume = {158}, number = {1}, issn = {0004-6256}, title = {AXS: A Framework for Fast Astronomical Data Processing Based on Apache Spark}, keyword = {astronomical databases, catalogs, data analysis}, chapternumber = {37} }

Časopis indeksira:


  • Current Contents Connect (CCC)
  • Web of Science Core Collection (WoSCC)
    • Science Citation Index Expanded (SCI-EXP)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font