Data cleaning


Correction exercises

Recover ESTs chromatograms


Translate chromatograms


Filter vector sequences


Mask repeats

Assembly

If everything went right you must see 3 contigs:

>dataset.fasta.Contig1
GGCCGCAATGCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNGGGGATTTATATGACTTTTATTTTAATATCATTTTAATA
TCGAATATGTCACAAGATTTAATGACCAAATTAGTCATTTACATTTTTCA
AAGTATCTAAATATAAAATTGCTAAAAATTCAAATACAGATAGCAAGCTA
CAAATATTTCTTTTGTTTTTGGGGGGGGGGTAGGAATGGAAGACATTAAA
AAAGGACGCTCCTGTTGGCTGAACAAAGATCAGAATGAAAGAAGAATTCA
TCAGCTTGACCCCTTGTGCAAAGAAATACACTCAGCTTTAAGATGCTGTT
TTAGACACATCTCTTCCTGTACAACAATTTAAAAAATGTTTCTAATGCAG
GTCCTCAGTGAAACACCCCTCTCCCTGAAACGTGAGAAACAGCAGCTTTC
CGCCTGCTTGAAAAAGGAGGCAGGCGCTGTAAGGAGAAAAGGAATGGGGA
ATGTGGCCCCTTCCAAGGCAGCGAAGCAGAGAACATGCAGGTGGAACTGC
CAGCTGAAACCTACACTTGCTACTTAAGAAACGGAAAGACCATGTCTGGC
TGGGAGTGCAGCGTGCCTGAGGACCAGCCACCGCCCCTCCTCCTGGCACC
CCACACTGTNCACGCAGCTCCTGGGCTCTGAGTCAGAAAGCTGGTGTGCT
GAAGGGGAAGTGACACGGCCTTGACACACCACTAGGTCTTTGTCTAACTT
TAGTAAGACCTACCCTCGAGCAAATGACTCATTTAACAAATGGCATGNTA
AACCTATTGCTAGAC
>dataset.fasta.Contig2
GGCCGCAATGCNNNNNNNNNNNNNNNNNNNNNNNNNNGTATTTATATGAC
TTTTATTTTAATATCATTTTAATATCGAATATGTCACAAGATTTAATGAC
CAAATTAGTCATTTACATTTTTCAAAGTATCTAAATATAAAATTGCTAAA
AATTCAAATACAGATAGCAAGCTACAAATATTTCTTTTGTTTTTGGGTGG
GGGGTAGGAATGGAAGACATTANAAAAGGACGCTCCTGTTGGCTGAACAA
AGATCAGAATGAAAGAAGAATTCATCAGCATGACTTGTAATGGTGGCTGC
TAAGCATATCCTGTACAACAATTTAAAAAATGTTTCTAATGCAGGTCCTC
AGTGAAACACCGGTCCCCGGCCCTGGCTGGGGACAGTAAGGACATCACCG
CAGGAGGGACACTGAAGAGGCTGTCGAGGACTGCAGAGGCATCTGGTGTG
GCCAGAGGCGTGGTGTCAGGGGCATCTGATCCCTTGCTGTTCCACCCAGG
GAGCCGGACGCACGGACACAGGTCTCCCTCCGCTCACCCCTCTCCCTGAA
ACGTGAGAAACAGCAGCTTTTCCCCCTCGTGCCGAATTCTTT
>dataset.fasta.Contig3
GGCACGAGATCATTTTAATATCGAATATGTCACAAGATTTAATGACCAAA
TTAGTCATTTACATTTTTCAAAGTATCTAAATATAAAATTGCTAAAAATT
CAAATACAGATAGCAAGCTACAAATATTTCTTTTGTTTTTGGGTGGGGGG
TAGGAATGGAAGACATTAGAAAAGGACGCTCCTGTTGGCTGAACAAAGAT
CAGAATGAAAGAAGAATTCATCAGCATGACCCCTTGTGCAAAGAAATACA
CTCAGCTTTAAGATGCTGTTTTAGACACATCTCTTCCTGTACAACAATTT
AAAAAATGTTTCTAATGCAGGTCCTCAGTGAAACACCGGTCCCCGGCCCT
GGCTGGGGACAGTAAGGACATCACCGCAGGAGGGACACTGAAGAGGCTGT
CGAGGACTGCAGAGGCATCTGGTGTGGCCAGAGGCGTGGTGTCAGGGGCA
TCTGATCCCTTGCCTGTTCCCACCCAGGGAGCCGGACGGCACGGACACAG
GTCTCCCTCCGCTCACCCCTCTCCCTGAAACGTGAGAAACAGCAGCTTTC
CGCCTGCTTGAAAAAGGAGGCAGGCGCTGTAAGGAGAAGAGGAATGGGGA
ATGTGGGCCCTTCCAAGGCAGCGAAGCAGAGAACATGCAGGTGGAACTGC
CAGCTGAAACCTACACTTGCTACTTAAGAAACGGAAAGACCATGTCTGGC
TGGGAGTGCAGCGTGCCTGAGGACCAGCCACCGCCCCTCCTCCTGGCACC
CCACACTGTCCACGCAGCTCCTGGGCTCTGAGGTCAGAAGCTGGGGTGTG
CTGAAGGGGAAAGTGACACGGCCTTGGACACACCACTAGGTCTCTGTTCC
TAACTTTTAGTAAGACACTACCCTCGAAGCAAATGCACTCATTTAAACAA
ATGGCATAGTTAAAACTATTGACTAAGACCTAAACATTTCTTCTGAGAAA
TCGAACCATAGCTTTTCCAATCTGCCTGTTCAATATGGGAACAGATTTTA
AAGAGAGTAACATAATCAATCCTCCGCCCCAAAGAATGGTCCATCATACC
AACCATGATAAAGAGTACACCTGTTTATTAAAAGGAAAAACAGAAATGTA
CATTTTTGTTGTTTGCTTTAAGAAATTTGATCAAGTTGCAAGGAAATGTG
TGGGCACGGCTCTGTACATCCTCGGGCAGGGTGGCAGGCATTGAAGGTGG
CTGGGCGGCACGTGGCTCCTCTAGGGGGTGGTCTGACCCCCAAGCATCGC
TTATCAAAGCCACTGCCAAGCAGACTTCCGTCCCATGGCAATGTCCCCAG
CGCTCCCTCCTAGGGGGCCCCCGACACCTTCCCCGAGAGCCCACCTGCCC
TGTGCCAGTGAGCCAGGGGCTGGCCTCCCCCCGAGGACTTCAGGACACCG
GGTGGGCTCTAGGGCACTTGGCCCTGGCAGGCAGGCTCTGAGCTACGGAA
CAGATTGAGGCTGGAGGCCAGTGTCACTGCTGCCCGGATTTCTTCTCCTT
CTGGAAGCTGAAGCTCGTACGTCTGTTCAGTTTTCTTTGTTTTTGAGCAA
AGTAGTCAGCCCGGGCAGTGCTAGCTCGCGATTCCAGGATGTAGTTAACC
TTGAGCACAATTTCATTGACGGTAGCAGCGTCCGATTCAAAGTAGAGGTG
TTTATAGTCGTGATTGCTTAGATACGTGAGTTTAAATATTGCGTGACTGG
GGCTTTTCTCTTCAGCAAGGTCACAGGCACAGAGCAGGTCGGAATCGATT
GAGATGGGTTTCTGCTTAATCCAAAACTTAGTGCTGGCTTTCTGATTCGT
AACAGGGTCTATCTCTACTTTGTCTCCAGAGATACCTAGCTGTACGTCGG
TTGTGAATCGCAGTCTGTGGATCATGCTGACTTTGAATGACTTGTAATGG
TGGCTGCTAACAGTATCCTGTACTGTGGCTATGTCAATTTGCGAATCCTC
CTCAAAAACCCCGTCTGCCCTTGAACTGTTCTCGCGGACCAGGCAGAACT
CGCATGCGCTCTGGCTCTCCAAAGTGCTGTCCAGGTCAACGGCGACATTG
GGCTCGCTCTGCTTCTCCAGGCGGTACTGAGGGCTCGGTTCTCTCG


... time for a beer!