Explore tens of thousands of sets crafted by our community.
Bioinformatics Tools
20
Flashcards
0/20
PICARD
Primary Use: Manipulating high-throughput sequencing (HTS) data and formats. Example Command: java -jar picard.jar SortVcf I=input.vcf O=sorted.vcf
QIIME
Primary Use: Microbial community analysis from raw DNA sequencing data. Example Command: qiime demux emp-single --i-seqs emp-single-end-sequences.qza --m-barcodes-file sample-metadata.tsv --m-barcodes-column BarcodeSequence --o-per-sample-sequences demux
GATK
Primary Use: Variant discovery in high-throughput sequencing data. Example Command: gatk HaplotypeCaller -R reference.fa -I input.bam -O output.vcf
Kraken
Primary Use: Assigning taxonomic labels to short DNA sequences with high accuracy. Example Command: kraken --db minikraken_20171019_8GB --fastq-input input.fastq --output output.txt --report report.txt
Bioconductor
Primary Use: Analysis and comprehension of high-throughput genomic data. Example Command: source('https://bioconductor.org/biocLite.R'); biocLite('GenomicFeatures')
FastQC
Primary Use: Quality control for high throughput sequence data. Example Command: fastqc input.fastq -o output_dir
BEDTools
Primary Use: A powerful toolset for genome arithmetic. Example Command: bedtools intersect -a fileA.bed -b fileB.bed > output.bed
SAMtools
Primary Use: Manipulating alignments in the SAM format. Example Command: samtools view -bS input.sam > output.bam
Bowtie 2
Primary Use: Fast and memory-efficient tool for aligning sequencing reads to long reference sequences. Example Command: bowtie2 -x genome_index -U reads.fq -S output.sam
HISAT2
Primary Use: Aligning sequences to a reference genome. Example Command: hisat2 -x genome_index -f -U input.fasta -S output.sam
BLAST
Primary Use: Finding regions of local similarity between sequences. Example Command: blastn -query input.fasta -db nt -out results.out
Cufflinks
Primary Use: Transcriptome assembly and differential expression analysis for RNA-Seq. Example Command: cufflinks -p 8 -o output_dir input.bam
MAFFT
Primary Use: Multiple sequence alignment. Example Command: mafft --auto input.fasta > output.fasta
MUSCLE
Primary Use: Multiple sequence alignment program for protein and nucleotide sequences. Example Command: muscle -in input.fa -out output.fa
Clustal Omega
Primary Use: Multiple sequence alignment. Example Command: clustalo -i input.fasta -o output.fasta -v
SnpEff
Primary Use: Variant annotation and effect prediction tool. Example Command: java -Xmx4g -jar snpEff.jar eff -v genome.versions input.vcf > output.vcf
Seurat
Primary Use: Tool designed for QC, analysis, and exploration of single-cell RNA-seq data. Example Command: Seurat::CreateSeuratObject(raw.data = raw.data)
TopHat
Primary Use: Aligning RNA-Seq reads to a genome in order to identify splice junctions. Example Command: tophat2 -p 8 -G genes.gtf -o output genome_index reads.fq
Galaxy
Primary Use: Enables accessible, reproducible, and transparent computational research. Example Command: Through a web interface rather than command line.
Trimmomatic
Primary Use: A flexible read trimming tool for Illumina NGS data. Example Command: trimmomatic PE -phred33 input_forward.fq input_reverse.fq output_forward_paired.fq output_forward_unpaired.fq output_reverse_paired.fq output_reverse_unpaired.fq ILLUMINACLIP:TruSeq3-PE.fa:2:30:10 LEADING:3 TRAILING:3 SLIDINGWINDOW:4:15 MINLEN:36
© Hypatia.Tech. 2024 All rights reserved.