Automated and traceable processing for large-scale high-throughput sequencing facilities

dc.contributor.authorPireddu, Luca
dc.contributor.authorCuccuru, Gianmauro
dc.contributor.authorLianas, Luca
dc.contributor.authorVocale, Matteo
dc.contributor.authorFotia, Giorgio
dc.contributor.authorZanetti, Gianluigi
dc.date.accessioned2014-05-16T07:53:52Z
dc.date.available2014-05-16T07:53:52Z
dc.date.issued2013
dc.description.abstractScaling up production in medium and large high-throughput sequencing facilities presents a number of challenges. As the rate of samples to process increases, manually performing and tracking the center’s operations becomes increasingly difficult, costly and error prone, while processing the massive amounts of data poses significant computational challenges. We present our ongoing work to automate and track all data-related procedures at the CRS4 Sequencing and Genotyping Platform, while integrating state-of-the-art processing technologies such as Hadoop, OMERO, iRODS, and Galaxy into our automated workflows. Currently, the core system is in its testing phase and it is on schedule to be in production use at CRS4 by May 2013. The results thus far obtained are encouraging and the authors are confident that the CRS4 Platform will increase its efficiency and capacity thanks to this system. In the near future, the integration components will be released as as open source software.IT
dc.description.pagenumber23-24IT
dc.description.statusPubblicatoIT
dc.identifier.doi10.14806/ej.19.A.626IT
dc.identifier.issn2226-6089
dc.identifier.urihttp://hdl.handle.net/11050/908
dc.language.isoenIT
dc.relation.ispartofEMBnet.journal. The Next NGS Challenge Conference: Data Processing and Integration 14-16 May 2013, Valencia, SpainIT
dc.relation.ispartofseries19;Suppl. A
dc.rightsAttribuzione - Non commerciale - Condividi allo stesso modo 3.0 Italia*
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/it/*
dc.subjectngsIT
dc.subjectautomationIT
dc.subjectbioinformaticsIT
dc.subjectdata analysisIT
dc.subjecthigh-performance computingIT
dc.subject.een-cordisEEN CORDIS::SCIENZE BIOLOGICHE ::Ricerca sul genoma ::BioinformaticaIT
dc.titleAutomated and traceable processing for large-scale high-throughput sequencing facilitiesIT
dc.typeArticoloIT
File
Original bundle
Ora in mostra 1 - 1 di 1
Caricamento...
Immagine di anteprima
Nome:
626-3699-2-PB.pdf
Dimensione:
328.03 KB
Formato:
Adobe Portable Document Format
Descrizione:
Articolo in Open Access
License bundle
Ora in mostra 1 - 1 di 1
Caricamento...
Immagine di anteprima
Nome:
license.txt
Dimensione:
2.06 KB
Formato:
Item-specific license agreed upon to submission
Descrizione:
collections