There are two input file needed to be supplied by the user in similar to BLOG 2.0 (http://dmb.iasi.cnr.it/blog-downloads.php), one is reference dataset and other is query dataset. In both reference and query sets, the barcode sequences should be in the FASTA format, where the name of the sequences must be in BOLD (http://www.boldsystems.org/) format. However in case of query set, if the name of barcode sequence is not known, a hypothetical name must be supplied and it should be also in BOLD format. An example of reference set, query set (with known species name) and query set (with hypothetical name) is provided below.
REFERENCE SET
>EF079971|Ametrida centurio|ROM 98798|COI
ACATTGTACTTACTATTTGGTGCTTGAGCAGGAATAGTAGGTACCGCACTAAGCCTACTTATTCG
TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGACGACCAAATCTATAATGTTATCGTTACAG
CCCACGCTTTCGTAATGATTTTCTTTATAGTAATACCCATCATGATTGGAGGGTTCGGCAACTGA
CTTGTACCACTAATAATTGGCGCACCTGACATAGCATTCCCACGAATAAATAACATAAGCTTCTG
ACTTCTCCCACCCTCTTTCCTGCTTCTACTGGCCTCCTCAACAGTCGAAGCTGGTGTTGGGACTG
CTTATTT------
>EF079972|Ametrida centurio|ROM 98849|COI
ACATTGTACTTACTATTTGGTGCTTGAGCAGGAATAGTAGGTACCGCACTAAGCCTACTTATTCG
TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGATGACCAAATCTATAATGTTATCGTTACGG
CCCACGCTTTCGTAATGATTTTCTTTATAGTAATGCCCATCATGATTGGAGGGTTCGGCAACTGA
CTTGTACCACTAATAATCGGCGCACCTGACATAGCATTCCCACGAATAAATAACATAAGCTTCTG
ACTTCTCCCACCCTCTTTCCTACTTCTACTGGCCTCCTCAACAGTTGAAGCTGGTGTTGGGACTG
TAGTC----------
>EF079973|Ametrida centurio|ROM 100832|COI
ACATTGTACTTACTATTTGGTGCTTGAGCAGGAATAGTAGGTACCGCACTAAGCCTACTTATTCG
TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGATGACCAAATCTATAATGTTATCGTTACAG
CCCACGCTTTCGTAATGATTTTCTTTATAGTAATGCCCATCATGATTGGAGGGTTCGGCAACTGA
TGTCA--
QUERY SET (With known species name)
>EF079975|Ametrida centurio|ROM 101098|COI
ACATTGTACTTACTATTTGGCGCTTGAGCAGGGATAGTAGGTACCGCACTAAGCCTACTTATTCG
TGCAGAACTTGGACAACCTGGGGCTCTATTAGGTGATGACCAAATCTATAATGTTATCGTTACAG
CCCACGCTTTCGTAATGATTTTCTTTATAGTAATGCCCATCATGATTGGAGGGTTCGGCAACTGA
CTTGTACCACTAATAATCGGCGCACCTGACATAGCATTCCCACGAATAAATAACATAAGCTTCTG
ACTTCTCC----
>EF079991|Anoura caudifer|ROM 115346|COI
ACTCTGTACTTACTATTCGGCGCCTGAGCTGGCATAGTAGGTACCGCACTAAGCCTTCTCATCCG
TGCTGAGCTAGGCCAACCCGGAGCCCTGTTAGGTGATGATCAAATTTACAATGTAATCGTAACAG
CCCATGCCTTTGTAATAATTTTCTTCATAGTTATGCCAATTATAATCGGAGGTTTTGGCAATTGA
CTAATCCCCCTAATAATTGGAGCACCTGATATAGCATTTCCTCGGATGAATAATATAAGCTTCTG
ACTTC---
QUERY SET (With hypothetical species name)
>A1|S1 P1|B1 C1|D1
ACTCTATACTTACTGTTTGGTGCCTGAGCCGGTATAGTAGGCACTGCACTTAGCCTTCTCATCCG
CGCCGAATTGGGCCAACCTGGAGCTTTATTAGGTGATGACCAAATCTATAATGTAATCGTAACAG
CTCATGCATTCGTGATAATTTTCTTCATAGTGATACCAATCATAATTGGAGGCTTTGGTAACTGA
CT-----
>A2|S2 P2|B2 C2|D2
ACTCTATACTTACTGTTTGGTGCCTGAGCCGGTATAGTAGGCACTGCACTTAGCCTTCTCATCCG
CGCCGAATTGGGCCAACCTGGAGCTTTATTAGGTGATGACCAAATCTATAATGTAATCGTAACAG
CTCATGCATTCGTGATAATTTTCTTCATAGTGATACCAATCATAATTGGAGGCTTTGGTAACTGA
C---