This website uses cookies in order to improve our services. If you proceed visiting this website you accept the usage of cookies. For more info please read our Data Privacy statement.

1000 Plant Genomes project

uablertalogoThe 1000 plants (oneKP or 1KP) initiative is an international multi-disciplinary consortium that has generated large-scale gene sequencing data for over 1000 species of plants. Major supporters include Alberta Ministry of Innovation and Advanced Education, Musea Ventures (Somekh Family Foundation), Beijing Genomics Institute in Shenzhen (BGI-Shenzhen), China National GeneBank (CNGB), iPlant Tree-of-Life (iPToL) Grand Challenge, Compute Canada (Westgrid), Alberta Innovates Technology Futures (AITF-iCORE Strategic Chair). The sample selection was originally based on a series of overlapping sub-projects with scientific objectives that could be addressed by sequencing multiple plant species (links on left). As more collaborators joined 1KP, however, the objectives evolved and are now exemplified by the diverse collection of papers described by the links below.

Source: University of Alberta


The 100,000 Genomes Project

Genomics England logo colourGenomics England is a company owned by the UK Department of Health and was set up to deliver the 100,000 Genomes Project. This flagship project will sequence 100,000 genomes from around 70,000 people. Participants are NHS patients with a rare disease, plus their families, and patients with cancer.

The aim is to create a new genomic medicine service for the NHS – transforming the way people are cared for. Patients may be offered a diagnosis where there wasn’t one before. In time, there is the potential of new and more effective treatments.

The project will also enable new medical research. Combining genomic sequence data with medical records is a ground-breaking resource. Researchers will study how best to use genomics in healthcare and how best to interpret the data to help patients. The causes, diagnosis and treatment of disease will also be investigated. We also aim to kick-start a UK genomics industry. This is currently the largest national sequencing project of its kind in the world.

Source: Genomics England


100K Genome Project

FDALogoMicrobiology is embracing high-throughput genomics because many long-standing questions can be quickly addressed and entire microbial communities can be examined quickly. The lack of food-related bacterial genomes is hindering advancements to improve the safety and security of the world food supply. The 100K Foodborne Pathogen Genome Sequencing Project directly addresses the lack of genomic information by sequencing 100,000 foodborne pathogen isolates for the most important worldwide outbreak organisms. 

UCDLogoTo close this knowledge gap and enable worldwide contribution, the genome sequence diversity of the top foodborne pathogens will be produced using NGS with BGI@UCDavis. This new facility will have ~10 HiSeq 2000 instruments for use by this project. A small number of whole genomes will be “finished” to completion for use as reference genomes. The vast majority of isolates will be sequenced and assembled as draft genomes for eventual public release. This approach will enable systematic definition of a biomarker gene sets associated with persistence, serotype diversity, location, antibiotic resistance, pathogenesis, and host association – just to name a few applications.

Source: UC Davis School of Veterinary Medicine

MSSNG is a groundbreaking collaboration between Google and Autism Speaks to create the world’s largest genomic database on autism.

cloudegoogleBy sequencing the DNA of over 10,000 families affected autism, MSSNG will answer the many questions we still have about the disorder.

T​hanks to the Google Cloud, this vast sea of information will be made accessible for free to researchers everywhere. The greatest minds in science from around the world will be able to study trillions of data points in one single database. Our pilot program of 1,000 whole genomes has led to new discoveries about the disorder. With over 10,000 whole genomes and the help of the global science community we can far outreach what has been accomplished so far. MSSNG will identify many subtypes of autism, which may lead to more personalized and more accurate treatments. For questions about MSSNG, please contact us at This email address is being protected from spambots. You need JavaScript enabled to view it. or call (646) 385-8593.

Source: MSSNG

Genome 10K

genome10KThe Genome 10K project aims to assemble a genomic zoo—a collection of DNA sequences representing the genomes of 10,000 vertebrate species, approximately one for every vertebrate genus. The trajectory of cost reduction in DNA sequencing suggests that this project will be feasible within a few years. Capturing the genetic diversity of vertebrate species would create an unprecedented resource for the life sciences and for worldwide conservation efforts.

The growing Genome 10K Community of Scientists (G10KCOS), made up of leading scientists representing major zoos, museums, research centers, and universities around the world, is dedicated to coordinating efforts in tissue specimen collection that will lay the groundwork for a large-scale sequencing and analysis project.

Source: Genome 10K

Human ENCODE Project

The National Human Genome Research Institute (NHGRI) launched a public research consortium named ENCODE, the Encyclopedia Of DNA Elements, in September 2003, to carry out a project to identify all functional elements in the human genome sequence. The project started with two components - a pilot phase and a technology development phase.ENCODE logo
The pilot phase tested and compared existing methods to rigorously analyze a defined portion of the human genome sequence (See: ENCODE Pilot Project). The conclusions from this pilot project were published in June 2007 in Nature and Genome Research []. The findings highlighted the success of the project to identify and characterize functional elements in the human genome. The technology development phase also has been a success with the promotion of several new technologies to generate high throughput data on functional elements.
Source: National Human Genome Research Institute


International Cancer Genome Consortium 

ICGClogoCurrently, the ICGC has received commitments from funding organizations in Asia, Australia, Europe, North America and South America for 88 project teams in 17 jurisdictions to study over 25,000 tumor genomes. Projects that are currently funded are examining tumors affecting: the biliary tract, bladder, blood, bone, brain, breast, cervix, colon, eye, head and neck, kidney, liver, lung, nasopharynx, oral cavity, ovary, pancreas, prostate, rectum, skin, soft tissues, stomach, thyroid and uterus.

Source: International Cancer Genome Consortium


craigventurelogoThe influenza genome sequencing project (IGSP) was initiated in 2005 to investigate influenza evolution by providing a public data set of complete influenza genome sequences from collections of isolates representing diverse temporal, spatial and species distributions. One of NIAID's primary goals for the project is to dramatically improve the availability of influenza genomic sequence in the public domain. We will sequence the complete genomes of a large collection of human influenza isolates, as well as a select number of avian and other non-human influenza strains important in the evolution of viruses with pandemic potential. The strains will be chosen to represent many subtypes with a wide geographical and chronological distribution. All the data generated will be released to the public domain in accordance with NIAID's data and reagent sharing and release guidelines.

Source: J. Graig Venter Institute