data2biology - Professional Bioinformatics Services and Courses

What we can do for you.

From data to biology.

Initial meeting

Project discussion and workflow development.

Data processing

Quality control, data analysis, results visualisations.

Results delivery

Final report and resulting files.

Final remarks

Extra time to review and discuss the results.

Data analysis and software development.

We offer a wide array of bioinformatic services, including analysis of next-generation sequencing data. We have been working on model and non-model organisms, including viruses, bacteria, fungi, plants, animals, and human. We support study design and build analytical pipelines for customized data analysis with Python and in R environment. We also design webpages and biological databases, both for the needs of your research projects as well as for publication purposes. Find the details below!

Steps:

Quality filtering
Adapter trimming
Discarding rRNA-mapped reads or contaminants (per request)

Output files include:

quality report (HTML or PDF formats)
filtered reads in FASTQ format (per request)

Steps:

Estimating gene and transcript expression values
Differential expression analysis
Enrichment analysis of KEGG pathways and Gene Ontology terms

Output files include:

Tab-separated files with calculated expression values for genes and transcripts; these files might be open in a spreadsheet, such as MS Excel (the expression values are provided in FPKM/RPKM and TPM units)
A list of differentially expressed genes, with fold change values and adjusted P-values
Raw expression values for each gene in each condition as a single table (per request)
A heatmap of differentially expressed genes in PDF format
MA and volcano plots - essential diagnostic plots for differential expression analysis, in PDF format

Steps:

Read mapping against a reference genome sequence
Quality control of mapping results and post-processing them
Ab initio assembly of the transcriptome (detection of novel genes and splicing isoforms, including antisense and intronic transcripts)
Quality filtering of the transcriptome (if required)
Comparison with a reference transcriptome/annotations (if available)
Possible further steps: Expression analysis; Annotation of transcriptomes

Output files include:

Quality report from read mapping
A file in BAM format containing raw results of read mapping (per request)
The transcriptome in GFF/GTF and FASTA formats; the GFF/GTF files might be used for direct visualization of the transcriptome in genome browsers, such as IGV
Sashimi plots for genes of interest (per request)

Steps:

De novo assembly of the transcriptome
Calculation of quality measures
Possible further steps: Expression analysis; Annotation of transcriptomes

Output files include:

A transcriptome in FASTA format
Quality report for the transcriptome assembly

Steps:

Identification of protein-coding genes
Annotation of proteins: finding structural and functional domains; similarity search against databases of annotated proteins in other organisms
Assignment of KEGG pathways and Gene Ontology (GO) terms
Identification of tRNAs, rRNAs, snoRNAs, snRNAs
Identification of long non-coding RNAs (lncRNAs)
Identification of circRNAs
Possible further steps: identification of microRNAs

Output files include:

A single tab-delimited file (to be open in a spreadsheet program, such as MS Excel) with most of annotation results, including assigned KEGG and GO terms and found protein domains
FASTA and GTF/GFF/BED files with identified open reading frames and different classes of non-coding RNAs

Identification of miRNAs and phased siRNAs
Quantification and differential expression
IsomiR and miRNA editing analysis
Degradome-Seq data processing
Identification of sRNA targets
Data upload to miRBase

Support in the preparation, measurement and analysis of proteomics and peptidomics data.

Colaboration in research planning and sample preparation.

Assistance with biological interpretation of data and preliminary validation of results.

Services in the field of mass spectrometry data analysis:

Quality control.
Identification of proteins and peptides using both sequence database searches and de novo approaches.
Data preparation and processing for quantitative LC/MS experiments.
DIA data analysis (SWATH).
Statistical analysis and visualisation.
Identification of sample preparation and measurement problems.

SNPs, small INDELs, structural variants
Germline and somatic alterations (e.g. tumor vs normal tissue)
Whole Genome Sequencing (WGS), Exome-Seq, targeted sequencing

Automated annotation of prokaryotic and eukaryotic genomes
De novo genome assembly and annotation
Genome annotation with RNA-Seq data
Identification of non-coding RNAs (long non-coding RNAs, microRNAs and other ncRNAs)
Identification of repetitive elements
Setting up stand-alone (IGV) and online (JBrowse) genome browsers
Building online databases with genomic and transcriptomic data

ChIP-Seq: Pol II, histone modifications
Finding RNA editing events
Finding targets for microRNAs
Analysis of splicing (different variants)
Data upload to NGS databases (Sequence Read Archive, Gene Expression Omnibus)

PCR/qPCR primers design
Custom qPCR probes design

Want to discuss a project?

Don't hesitate to contact us to arrange a consultation!

Get in touch

What we can do for you.

From data to biology.

Initial meeting

Data processing

Results delivery

Final remarks

Data analysis and software development.

Data pre-processing

Expression analysis

Ab initio transcriptome assembly (genome dependent)

De novo transcriptome assembly (does not require genome sequence)

Annotation of transcriptomes

Small RNA Analysis

Proteomics and peptidomics

Genome Variants Calling

Genome assembly and annotation

Other NGS analyses

Primers and probes design

Want to discuss a project?