Hands on Workshop: Introduction to Bioinformatics
The purpose of this course is to provide graduate students with the theoretical knowledge and practical skills for the analysis of high throughput sequence data. The course will entail data retrieval and assembly, alignment techniques, variant calling, gene expression analyses, hypothesis testing, and population genomic and phylogenomic approaches. The course will be presented as a series of short lectures and lab exercises over a one week period during the mid semester break of the second semester.
Dr. Kathryn Hodgins, Dr. Gregory Owens, Dr. Matt McGee, Dr. Sonika Tyagi
A mix of lecture and tutorial exercises running in a 2-hour block.
Prior exposure to R and Unix command line is recommended. However, we will be running a crash course on command line the first day.
30 Sept 2019 – 4 Oct 2019 10am to Noon, 1pm to 3pm
Room 144 (Digilabs), Building 17 School of Biological Sciences
The course material is organized in several topics, with slides and coding examples.
To get up to speed on working with a Unix system, take a look at the unix help file. There are some resources there that will help you find the specific command you need for each task.
Topic 1 Workshop overview & sequencing technology and approaches (Day 1 AM)
Lecture: Workshop overview and goals (Kay) Sequencing technology and approaches (Matt Tinning, NGS manager of AGRF) Tutorial: R intro (Sonika)
Topic 3 Sequence file formats and quality control/trimming (Day 2 AM)
Lecture: Sequence file formats and quality control/trimming (Kay) Tutorial: QC GBS and RNAseq reads (Kay)
Topic 4 Sequence alignment (Day 2 PM)
Lecture: Alignment algorithms and tools (Greg) Tutorial: Alignment of NGS data (Greg)
Topic 5 Variant calling (Day 3 AM)
Lecture: SNP and variant calling (Greg) Tutorial: Call variants for NGS data (Greg)
Topic 6 Population genomics (Day 4 AM)
Lecture: Population genomics (Greg) Tutorial: Population genomics (Greg)
Topic 7 Population genomics and plotting in R (Day 4 AM)
Tutorial: Population genomics and plotting in R (Greg)
To access the course content offline, you may download an up-to-date snapshot archive of the site content from this location: https://github.com/khodgins/Bioinformatics_Introduction/archive/master.zip
But the recommended approach is to use git
which tracks changes and supports incremental updates:
git clone https://github.com/khodgins/Bioinformatics_Introduction.git
If the course content changes, you can update your local copy by going into to the Bioinformatics_Introduction directory that was created by the previous command and invoking the command:
git pull
You may use any of the materials provided here, and modify them in any way, provided there is appropriate attribution according the license found below and included with this project.
Copyright (C) 2015 S. Evan Staton, Sariel Hubner, Sam Yeaman
Modified work (c) 2016, 2017, 2018 Gregory Owens, Kathryn Hodgins
Modified work (c) 2019 Gregory Owens, Kathryn Hodgins, JS Legare
Modified work (c) 2019 Gregory Owens, Kathryn Hodgins, S Tyagi, M McGee
This program is distributed under the MIT (X11) License, which should be distributed with the package. If not, it can be found here: http://www.opensource.org/licenses/mit-license.php
The license file is here
This site is powered by GithubPages, and the code backing it is on GitHub here.