
DNA Quality control
Microsynth performs for every project a quality control of the input DNA. Depending on the project, this is done by agarose gel electrophoresis, Agilent bioanalyzer electrophoresis or Sanger sequencing reactions.
Microsynth propose both shotgun reads and or 3kb, 8kb and 20 kb long-tag paired end reads.

Microsynth prepares the DNA for the FLX run according to one of the two options mentioned above, performs the sequencing run on either a full large PTP (picotiterplate), on a 1/2, 1/4, 1/8 or 1/16 PTP.
An electronical assembling is made with the software "GS de Novo Assembler" and or " GS reference Mapper".
The sequencing data will be delivered on a DVD. The data consist in Raw Reads (FASTA format, Quality Scores, Sff files) and the Assembly data (Fasta formats and Ace files)
Advantages of the FLX sequencing system in whole genome sequencing
In resequencing or comparative genomics projects, large discrepancies between the sequenced genome and the reference genome can happen and make any alignment impossible. The long reads (400 bases) of the FLX Titanium system give the unique advantage, to allow also an assembling as a de Novo project by using the "GS de Novo Assembler" and to obtain by this way usable datas.
Microsynth also provides a library generation service for Sanger sequencing. If desired a hybrid approach of Sanger sequencing and 454 sequencing can be performed to profit from the advantages of both technologies.
Example of de Novo assembly results
|
Organism 1 De Novo Shotgun |
Organism 1 De Novo Paired end |
Organism 2 De Novo Shotgun |
Total of bases |
153.4 Mb |
115.6 Mb |
133,7 Mb |
Total of reads |
607’102 |
694’566 |
572’232 |
Large contigs (>500bp) |
|
|
|
Nb of large contigs |
82 |
305 |
87 |
Largest Contig |
158.4 kb |
59 kb |
421.4 kb |
Avg contig size |
33,7 kb |
9 kb |
57 kb |
N50 contig size |
60.7kb |
17.4 kb |
226.2 kb |
Nb of bases |
2,763 Mb |
2,762 Mb |
4,971 Mb |
All contigs |
|
|
|
Total nb. of contigs |
141 |
963 |
141 |
Total Nb of bases |
2.777 Mb |
2.883 Mb |
4,984 Mb |
Scaffold |
- |
- |
- |
Nb. of scaffolds |
- |
16 |
- |
Nb. of bases |
- |
2,765 Mb |
- |
Largest Scaffold |
- |
1.555Mb |
- |
Avg. Scaffold size |
- |
172.8 kb |
- |
N50 Scaffold size |
- |
1.555 Mb |
- |
Organism. 1 : Clostridium chauvoei – with the courtesy of Prof. J. Frey, Bakteriologie, Fakultaet, Universität Bern
Organism 2 : Escherichia coli strain – with the courtesy of Prof. R. Stephan, Institut für Lebensmittelsicherheit & Hygiene, Universität Zürich
Depending on their sizes several BACs can be sequenced during one run. Two options can be used:
The preparation of the BAC-DNA libraries for the FLX , the sequencing runs, the electronical assembling and the transmission of the datas are similar to what is described above in the whole genome sequencing section.
It is important to be aware of the fact that, unlike with Sanger sequencing technology, by BAC sequencing a possible contamination with E. coli DNA cannot be excluded and may result in a loss of total capacity.
cDNA for direct sequencing or RNA for cDNA libraries preparation (cloning of full-length cDNA, cloning of small non-coding RNA) and subsequent sequencing can be sent to Microsynth.
The preparation of the cDNA for the FLX , the sequencing runs and the transmission of the datas are similar to what is described above in the whole genome sequencing section.
Electronical assembling : If de Novo, the alignment is done electronically with the software "GS de Novo Assembler".If a reference sequence exist, the alignment is done electronically with the software "GS Reference Mapper"
The procedure for preparing a DNA sample for Amplicon Sequencing consists of a simple PCR amplification, but requires special Fusion Primers which must be designed by the user according to the specific requirements of the experiment.
The Fusion Primers must contain the GS FLX Titanium Primer A and Primer B (which includes a four base library key sequence TCAG) , are always the same, as dictated by the requirements of the GS FLX Genome Sequencer System. An optional Multiplex Identifier (MID) sequence may be added to allow further sorting of the samples.
They have the following characteristics:
Forward primer (Primer A)
I. Primer A: 5’ CGTATCGCCTCCCTCGCGCCATCAG-MID-template-specific-sequence-3'
II. Primer B: 5’ CTATGCGCCTTGCCAGCCCGCTCAG-MID-template-specific-sequence-3'
The 3’-part of each Fusion Primer is specific to each Amplicon and is used by the Amplicon Variant Analyser software to assign reads:
The amplicons are pooled (equimolar pooling) in 1, 1/2, 1/4, 1/8 or 1/16 run. The number of amplicons that can be pooled together is a compromise between ensuring a high probability of finding a variant and pooling the largest possible number of amplicons.
As an example : if a desired sentivity for a variant within a sample is 10%, 5% or 2%, ans 50 reads pro amplicons, the number of amplicons that can be pooled in 1/16 of a PTP plate (30'000 reads) will be respectively 60, 30 and 12.
The sequencing runs and the transmission of the data are similar to what is described above in the whole genome sequencing section.
Electronical analysis is made with the software « GS Amplicon Variant Analyser software »
Example of deep sequencing of PCR products
To pool several samples within one region the PCR products were barcoded by fusionprimers. Six different fusionprimers with a 6 bp barcode were used to deep sequence totally 36 samples on six regions.
Sample | Region | Barcode | Reads per region | Reads per sample |
1 | 1 | 1 | 9476 | 1688 |
2 | 2 | 1866 | ||
3 | 3 | 1220 | ||
4 | 4 | 1479 | ||
5 | 5 | 1293 | ||
6 | 6 | 1925 | ||
7 | 2 | 1 | 11769 | 2171 |
8 | 2 | 2626 | ||
9 | 3 | 1276 | ||
10 | 4 | 1321 | ||
11 | 5 | 2571 | ||
12 | 6 | 1800 | ||
13 | 3 | 1 | 9444 | 1887 |
14 | 2 | 1463 | ||
15 | 3 | 1165 | ||
16 | 4 | 1840 | ||
17 | 5 | 1494 | ||
18 | 6 | 1599 | ||
19 | 4 | 1 | 8465 | 1263 |
20 | 2 | 1984 | ||
21 | 3 | 1177 | ||
22 | 4 | 1433 | ||
23 | 5 | 1337 | ||
24 | 6 | 1268 | ||
25 | 5 | 1 | 11892 | 2614 |
26 | 2 | 2161 | ||
27 | 3 | 1947 | ||
28 | 4 | 2039 | ||
29 | 5 | 1297 | ||
30 | 6 | 1855 | ||
31 | 6 | 1 | 8734 | 1388 |
32 | 2 | 1394 | ||
33 | 3 | 1440 | ||
34 | 4 | 2127 | ||
35 | 5 | 992 | ||
36 | 6 | 1401 |
Table1: Number of reads received per region and sample after equimolar pooling of the PCR products. With the courtesy of Prof I. Roditi, Institute of Cell Biology, University of Bern
The long reads obtained with the Titanium series enables diversity, metabolic and microbial abundance studies of the metagenomes of interest.
Two approaches are possible for metagenomic projects: