The institute of experimental medicine (IMEX) has a bioinformatics service whose objective is to advise the different research groups on modern computational biology methods that allow them to achieve their scientific goals. Bioinformatics has a transversal role, so it collaborates with the rest of the laboratories, providing a high added value to research and development activities.
Responsable: Martín Ledesma, PhD
E-mail: imex.bioinf@gmail.com
Bioinformatics services
Description: The service includes the processing of NGS data, including quality analysis, filtering and pre-processing of the data. Alignment, variants call (SNPs + indels and copy number variations), variants analysis and differential gene expression analysis.
Methodology: Reads quality analysis using FASTQC / FASTX-TOOLKIT, reads filtering using Trimmommatic or FASTX-TOOLKIT. Alignment with the reference genome (GRCh38 / hg38 or b37 / hg19) using BWA or Bowtie. Pre-processing and variants call using the GATK toolkit and in accordance with the good practices indicated by the Broad Institute. Variants analysis with Variant Effect Predictor or SnpEff. Differential expression analysis with Bioconductor packages.
Description: The service includes the processing NGS data, including quality analysis, filtering and pre-processing of the reads. Alignment with the reference genome, characterization of structural variants (SV), de novo assembly, assembly optimization, prediction and annotation of genes, search for coding genes or genomic regions of interest.
Methodology: Reads quality analysis using FASTQC / FASTX-TOOLKIT, reads filtering using Trimmommatic or FASTX-TOOLKIT. Different tools are used for de novo assembly (SPAdes, ABBySS, Flye, Unicycler) and also for the characterization of SVs (Lumpy, SvABA, Manta). To search for regions or genes of interest, the BLAST and HMMER tools are used, using databases associated to each particular requirement. Phylogenetic analysis based on ANI or genes of interest.
Description: The service includes the NGS data processing, including quality analysis, filtering and pre-processing of the reads, filtering of host reads, taxonomic classification of reads, assembly and prediction of genes.
Methodology: Reads quality analysis using FASTQC / FASTX-TOOLKIT, reads filtering using Trimmommatic or FASTX-TOOLKIT. The taxonomic profile is made with the Kaiju, Kraken or Centrifuge tools. The assembly is done with metaSPAdes, MEGAHIT. Binning analysis with the MetaBAT or CONCOCT tools.
Technical specifications
Two High-performance computers for massive data analysis: 1 Asus Z370-P Prime 1151 Motherboard, 1 Kingston A400 120gb Sata SSD Disk, 1 Seagate Ironwolf 7200 256MB NAS RAID 6T Hard Drive, 1 Gigabyte GTX 1060 mini lTX GF video card OC 3GB DDDR5, 1 Source of 700W Gigabyte 8700h 80 plus bronze, 1 Micro Intel Core I7 8700k 4.70ghz Cofee Lake 1151, 3 memory DDR4 16GB 2400 Mushkin Blackline.