BYTE-Sea project

Objectives

BYTE-Sea coordinates the development of interfaces for data access, progress monitoring, and management of the ATLASea programme.

1. Interoperability and Data Publication

The BYTE-Sea project ensures the interoperability and security of data produced by the ATLASea programme, and facilitates its dissemination in line with FAIR principles and Open Science practices. In collaboration with the SEQ-Sea project, it supports the validation and publication of genomic data in reference repositories (such as European Nucleotide Archive – ENA), including raw reads, assembled genomes, and both structural and functional annotations.

2. Data Visualisation

A data portal allows users to track analysed species, sampling events, and the production and publication of genomic data in relation to taxonomic and biodiversity reference systems.

This portal also provides access to genome visualisation tools (e.g., Genome Browser) and similarity search functions, and will soon include first-level analysis tools (for functional and comparative genomics).

A monitoring portal, currently under development and restricted to consortium members, will enable tracking of sample processing—from taxonomic validation through DNA preparation and sequencing, to the publication of assemblies and annotations in public repositories.

3. Annotation

Predicting gene structure and function in eukaryotes is a complex task that largely depends on existing knowledge and available resources for the species in question. To meet this challenge, the ATLASea programme—which aims to sequence genomes from a wide range of under-studied species with few or no close relatives already sequenced—has adopted a systematic approach.

In collaboration with the SEQ-Sea project, this approach initially involves automating structural and functional annotation pipelines. Subsequently, a community annotation environment will be established to refine gene structure and function predictions following their initial release.

4. Comparative analysis

A dedicated portal is being developed to provide access to all genomic data on marine organisms generated by the ATLASea programme and other consortia. This environment will offer access to high-quality genome assemblies (including chromosome-level sequences).

Available data will include genomic sequences, gene annotations, coding and non-coding regions, and functional annotations. The portal will also offer tools for sequence search, evolutionary and functional analysis, and data visualisation in environmental and biological contexts.

Resources

The project’s main digital infrastructure is hosted at the IFBcore facility (IDRIS-Orsay) and within the BigOuest platform, a federation of the ABiMS (Roscoff) and GenOuest (Rennes) infrastructures.

We rely on storage and computing resources at these data centres, ensuring high data and portal availability by distributing access across three locations: Orsay, Rennes, and Plouzané.

Organisation

Lead organisation : Institut Français de Bioinformatique (IFB)

Manager : Erwan Corre

Station Biologique de Roscoff
Team
Institut de Biologie de l’École Normale Supérieure (IBENS)
Team
Institut de Recherche en Informatique et Systèmes Aléatoires (IRISA)
Team
Institut Français de Bioinformatique (IFB)
Team
Ifremer
Team
Loraine Guegen
Mark Hoebeke
Annie Lebreton
Alexandre Nicaise
Gildas Le Corguille
Lucile Jeusset
Samuel Lalam
Alexandra Louis
Hugues Roest Crollius
Anthony Bretaudeau
Théo Foulquier
Romane Libouban
Mateo Boudet
Nicole Charrière
Julien Seiler
Pauline Auffret
Patrick Durand
Laura Leroi
Yaëlle Pihan

Scroll to Top