SPIDBARSPIDBAR
Species Identification using DNA Barcode

 

Home

Help

  • Input
  • Output
  • Dataset


    There are two input file needed to be supplied by the user in similar to BLOG 2.0 (http://dmb.iasi.cnr.it/blog-downloads.php), one is reference dataset and other is query dataset. In both reference and query sets, the barcode sequences should be in the FASTA format, where the name of the sequences must be in BOLD (http://www.boldsystems.org/) format. However in case of query set, if the name of barcode sequence is not known, a hypothetical name must be supplied and it should be also in BOLD format. An example of reference set, query set (with known species name) and query set (with hypothetical name) is provided below.

    REFERENCE SET

    >EF079971|Ametrida centurio|ROM 98798|COI
    ACATTGTACTTACTATTTGGTGCTTGAGCAGGAATAGTAGGTACCGCACTAAGCCTACTTATTCG
    TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGACGACCAAATCTATAATGTTATCGTTACAG
    CCCACGCTTTCGTAATGATTTTCTTTATAGTAATACCCATCATGATTGGAGGGTTCGGCAACTGA
    CTTGTACCACTAATAATTGGCGCACCTGACATAGCATTCCCACGAATAAATAACATAAGCTTCTG
    ACTTCTCCCACCCTCTTTCCTGCTTCTACTGGCCTCCTCAACAGTCGAAGCTGGTGTTGGGACTG
    CTTATTT------
    >EF079972|Ametrida centurio|ROM 98849|COI
    ACATTGTACTTACTATTTGGTGCTTGAGCAGGAATAGTAGGTACCGCACTAAGCCTACTTATTCG
    TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGATGACCAAATCTATAATGTTATCGTTACGG
    CCCACGCTTTCGTAATGATTTTCTTTATAGTAATGCCCATCATGATTGGAGGGTTCGGCAACTGA
    CTTGTACCACTAATAATCGGCGCACCTGACATAGCATTCCCACGAATAAATAACATAAGCTTCTG
    ACTTCTCCCACCCTCTTTCCTACTTCTACTGGCCTCCTCAACAGTTGAAGCTGGTGTTGGGACTG
    TAGTC----------
    >EF079973|Ametrida centurio|ROM 100832|COI
    ACATTGTACTTACTATTTGGTGCTTGAGCAGGAATAGTAGGTACCGCACTAAGCCTACTTATTCG
    TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGATGACCAAATCTATAATGTTATCGTTACAG
    CCCACGCTTTCGTAATGATTTTCTTTATAGTAATGCCCATCATGATTGGAGGGTTCGGCAACTGA
    TGTCA--

    QUERY SET (With known species name)

    >EF079975|Ametrida centurio|ROM 101098|COI
    ACATTGTACTTACTATTTGGCGCTTGAGCAGGGATAGTAGGTACCGCACTAAGCCTACTTATTCG
    TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGATGACCAAATCTATAATGTTATCGTTACAG
    CCCACGCTTTCGTAATGATTTTCTTTATAGTAATGCCCATCATGATTGGAGGGTTCGGCAACTGA
    CTTGTACCACTAATAATCGGCGCACCTGACATAGCATTCCCACGAATAAATAACATAAGCTTCTG
    ACTTCTCC----
    >EF079991|Anoura caudifer|ROM 115346|COI
    ACTCTGTACTTACTATTCGGCGCCTGAGCTGGCATAGTAGGTACCGCACTAAGCCTTCTCATCCG
    TGCTGAGCTAGGCCAACCCGGAGCCCTGTTAGGTGATGATCAAATTTACAATGTAATCGTAACAG
    CCCATGCCTTTGTAATAATTTTCTTCATAGTTATGCCAATTATAATCGGAGGTTTTGGCAATTGA
    CTAATCCCCCTAATAATTGGAGCACCTGATATAGCATTTCCTCGGATGAATAATATAAGCTTCTG
    ACTTC---

    QUERY SET (With hypothetical species name)

    >A1|S1 P1|B1 C1|D1
    ACTCTATACTTACTGTTTGGTGCCTGAGCCGGTATAGTAGGCACTGCACTTAGCCTTCTCATCCG
    CGCCGAATTGGGCCAACCTGGAGCTTTATTAGGTGATGACCAAATCTATAATGTAATCGTAACAG
    CTCATGCATTCGTGATAATTTTCTTCATAGTGATACCAATCATAATTGGAGGCTTTGGTAACTGA
    CT-----
    >A2|S2 P2|B2 C2|D2
    ACTCTATACTTACTGTTTGGTGCCTGAGCCGGTATAGTAGGCACTGCACTTAGCCTTCTCATCCG
    CGCCGAATTGGGCCAACCTGGAGCTTTATTAGGTGATGACCAAATCTATAATGTAATCGTAACAG
    CTCATGCATTCGTGATAATTTTCTTCATAGTGATACCAATCATAATTGGAGGCTTTGGTAACTGA
    C---

    Team: Prabina Kumar Meher, Tanmaya Kumar Sahu and A. R. Rao

    Indian Agricultural Statistics Research Institute, (ICAR), Library Avenue, New Delhi - 110012