blast的结果

做出来了诶O(∩_∩)O哈哈~ 有点小开心(*^▽^*)

conda装的blast做不出来,还不知道是什么原因。重新在自己家里装的blast运行没问题。

建库部分:遇到了一些关于grep、sed、awk还有for循环的问题,再开一篇记录吧。还有fa的格式问题。然后就是,原来blast是可以多对多比对的,那就可以一次性做批量处理了~

#建库

/home/hmguang/biosoft/blast/blast/ncbi-blast-2.9.0+/bin/makeblastdb -in refdata.fasta -dbtype nucl

#比对

/home/hmguang/biosoft/blast_project/blast/ncbi-blast-2.9.0+/bin/blastn -query testseq -out result.txt -db refdata.fasta -evalue 1e-5

结果:

BLASTN 2.9.0+

Reference: Zheng Zhang, Scott Schwartz, Lukas Wagner, and Webb

Miller (2000), "A greedy algorithm for aligning DNA sequences", J

Comput Biol 2000; 7(1-2):203-14.

Database: refdata.fasta

? 16 sequences; 411,130 total letters

Query= YW607_F06

Length=524

? Score E

Sequences producing significant alignments: ? (Bits) Value

NC_000017.11:4932277-4935023Homosapienschromosome17_GP1BA,GRCh38....? 350 8e-98

NC_000017.11:4932277-4935023Homosapienschromosome17_GP1BA_core,GR...? 350 8e-98

>NC_000017.11:4932277-4935023Homosapienschromosome17_GP1BA,GRCh38.p13PrimaryAssembly

Length=2747

Score = 350 bits (189),? Expect = 8e-98

Identities = 194/197 (98%), Gaps = 0/197 (0%)

Strand=Plus/Minus

Query? 1 TACAGCGAGTTCTCTTGGAGGAGAAGGGTGTCGAGATTCTCCAGCCCATTCAGGAGCCCA? 60

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 930? TACAGCGAGTTCTCTTGGAGGAGAAGGGTGTCGAGATTCTCCAGCCCATTCAGGAGCCCA? 871

Query? 61? GCGGGGAGCTCAGTCAAGTTGTTGTTAGCCAGACTGAGCTTCTCCAGCTTGGGTGTGGGC? 120

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 870? GCGGGGAGCTCAGTCAAGTTGTTGTTAGCCAGACTGAGCTTCTCCAGCTTGGGTGTGGGC? 811

Query? 121? GTCAGGAGCCCTGGGGGCAGGGTCTTCAGCTCATTGCCTTTCAGGTAGAGCTCTTGGAGT? 180

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 810? GTCAGGAGCCCTGGGGGCAGGGTCTTCAGCTCATTGCCTTTCAGGTAGAGCTCTTGGAGT? 751

Query? 181? TCGCMAGTACCACGCAG? 197

|||| |? |||||||||

Sbjct? 750? TCGCCAAGACCACGCAG? 734

>NC_000017.11:4932277-4935023Homosapienschromosome17_GP1BA_core,GRCh38.p13PrimaryAssembly

Length=357

Score = 350 bits (189),? Expect = 8e-98

Identities = 194/197 (98%), Gaps = 0/197 (0%)

Strand=Plus/Minus

Query? 1 TACAGCGAGTTCTCTTGGAGGAGAAGGGTGTCGAGATTCTCCAGCCCATTCAGGAGCCCA? 60

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 295? TACAGCGAGTTCTCTTGGAGGAGAAGGGTGTCGAGATTCTCCAGCCCATTCAGGAGCCCA? 236

Query? 61? GCGGGGAGCTCAGTCAAGTTGTTGTTAGCCAGACTGAGCTTCTCCAGCTTGGGTGTGGGC? 120

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 235? GCGGGGAGCTCAGTCAAGTTGTTGTTAGCCAGACTGAGCTTCTCCAGCTTGGGTGTGGGC? 176

Query? 121? GTCAGGAGCCCTGGGGGCAGGGTCTTCAGCTCATTGCCTTTCAGGTAGAGCTCTTGGAGT? 180

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 175? GTCAGGAGCCCTGGGGGCAGGGTCTTCAGCTCATTGCCTTTCAGGTAGAGCTCTTGGAGT? 116

Query? 181? TCGCMAGTACCACGCAG? 197

|||| |? |||||||||

Sbjct? 115? TCGCCAAGACCACGCAG? 99

Lambda ? K H

1.37 0.632 1.16

Gapped

Lambda ? K H

1.28 0.460 0.850

Effective search space used: 207467130

Query= YW614_E07

Length=284

? Score E

Sequences producing significant alignments: ? (Bits) Value

NC_000005.10:52989326-53094779Homosapienschromosome5_ITGA2,GRCh38...? 291 3e-80

NC_000005.10:52989326-53094779Homosapienschromosome5_ITGA2_4core,...? 291 3e-80

>NC_000005.10:52989326-53094779Homosapienschromosome5_ITGA2,GRCh38.p13PrimaryAssembly

Length=105454

Score = 291 bits (157),? Expect = 3e-80

Identities = 161/164 (98%), Gaps = 0/164 (0%)

Strand=Plus/Plus

Query? 1 ? TTGTCAGCAACCAAAACAAAARGTTAACATTTTCAGTAACGCTGAAAAATAAAAGGGAAA? 60

? ||||||||||||||||||||| ||||||||||||||||||||||||||||||||||||||

Sbjct? 83807? TTGTCAGCAACCAAAACAAAAGGTTAACATTTTCAGTAACGCTGAAAAATAAAAGGGAAA? 83866

Query? 61 GTGCATACAACACTGGAATTGTTGTTGATTTTTCAGAAAACTTGTTTTTTGCATCATTCT? 120

? ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 83867? GTGCATACAACACTGGAATTGTTGTTGATTTTTCAGAAAACTTGTTTTTTGCATCATTCT? 83926

Query? 121 CCCTGCCGGTATGTGATGAGACCCTGTACTTAYGTCCACCATGC? 164

? ||||||||||||||||||||||||||||||||? ||||||||||

Sbjct? 83927? CCCTGCCGGTATGTGATGAGACCCTGTACTTACTTCCACCATGC? 83970

>NC_000005.10:52989326-53094779Homosapienschromosome5_ITGA2_4core,GRCh38.p13PrimaryAssembly

Length=1671

Score = 291 bits (157),? Expect = 3e-80

Identities = 161/164 (98%), Gaps = 0/164 (0%)

Strand=Plus/Plus

Query? 1 TTGTCAGCAACCAAAACAAAARGTTAACATTTTCAGTAACGCTGAAAAATAAAAGGGAAA? 60

||||||||||||||||||||| ||||||||||||||||||||||||||||||||||||||

Sbjct? 1022? TTGTCAGCAACCAAAACAAAAGGTTAACATTTTCAGTAACGCTGAAAAATAAAAGGGAAA? 1081

Query? 61 GTGCATACAACACTGGAATTGTTGTTGATTTTTCAGAAAACTTGTTTTTTGCATCATTCT? 120

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 1082? GTGCATACAACACTGGAATTGTTGTTGATTTTTCAGAAAACTTGTTTTTTGCATCATTCT? 1141

Query? 121? CCCTGCCGGTATGTGATGAGACCCTGTACTTAYGTCCACCATGC? 164

||||||||||||||||||||||||||||||||? ||||||||||

Sbjct? 1142? CCCTGCCGGTATGTGATGAGACCCTGTACTTACTTCCACCATGC? 1185

Lambda ? K H

1.42 0.646 1.21

Gapped

Lambda ? K H

1.28 0.460 0.850

Effective search space used: 109283972

Query= YW621_D08

Length=266

? Score E

Sequences producing significant alignments: ? (Bits) Value

NC_000017.11:c44389649-44372181Homosapienschromosome17_ITGA2B_7co...? 329 5e-92

NC_000017.11:c44389649-44372181Homosapienschromosome17_ITGA2B_7co...? 329 5e-92

>NC_000017.11:c44389649-44372181Homosapienschromosome17_ITGA2B_7core,GRCh38.p13PrimaryAssembly

Length=17469

Score = 329 bits (178),? Expect = 5e-92

Identities = 179/180 (99%), Gaps = 0/180 (0%)

Strand=Plus/Minus

Query? 1 GCCTTTCTKAGGTCCCAGATCCTTTAAGGCCCATGCCCTCTGCCTCCTCACCAGCTCACG? 60

|||||||| |||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 9315? GCCTTTCTGAGGTCCCAGATCCTTTAAGGCCCATGCCCTCTGCCTCCTCACCAGCTCACG? 9256

Query? 61 GGTGTCTTGGTCTGAGGTAGGACACAGCTCTTCACAGCAGGATTCAGTGAATCTTGCACC? 120

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 9255? GGTGTCTTGGTCTGAGGTAGGACACAGCTCTTCACAGCAGGATTCAGTGAATCTTGCACC? 9196

Query? 121? AGTAGCTGGACAGAGGCCTTCACCACTGGCTGAGCTCTGATGGGATAGGGTGATGGGGTA? 180

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 9195? AGTAGCTGGACAGAGGCCTTCACCACTGGCTGAGCTCTGATGGGATAGGGTGATGGGGTA? 9136

>NC_000017.11:c44389649-44372181Homosapienschromosome17_ITGA2B_7core,GRCh38.p13PrimaryAssembly

Length=1773

Score = 329 bits (178),? Expect = 5e-92

Identities = 179/180 (99%), Gaps = 0/180 (0%)

Strand=Plus/Minus

Query? 1 GCCTTTCTKAGGTCCCAGATCCTTTAAGGCCCATGCCCTCTGCCTCCTCACCAGCTCACG? 60

|||||||| |||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 632? GCCTTTCTGAGGTCCCAGATCCTTTAAGGCCCATGCCCTCTGCCTCCTCACCAGCTCACG? 573

Query? 61? GGTGTCTTGGTCTGAGGTAGGACACAGCTCTTCACAGCAGGATTCAGTGAATCTTGCACC? 120

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 572? GGTGTCTTGGTCTGAGGTAGGACACAGCTCTTCACAGCAGGATTCAGTGAATCTTGCACC? 513

Query? 121? AGTAGCTGGACAGAGGCCTTCACCACTGGCTGAGCTCTGATGGGATAGGGTGATGGGGTA? 180

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 512? AGTAGCTGGACAGAGGCCTTCACCACTGGCTGAGCTCTGATGGGATAGGGTGATGGGGTA? 453

Lambda ? K H

1.36 0.630 1.15

Gapped

Lambda ? K H

1.28 0.460 0.850

Effective search space used: 101888816

Query= YW665_C09

Length=307

? Score E

Sequences producing significant alignments: ? (Bits) Value

NC_000007.14:80602207-80679277Homosapienschromosome7_CD36,GRCh38....? 416 5e-118

NC_000007.14:80602207-80679277Homosapienschromosome7_CD36_core,GR...? 416 5e-118

>NC_000007.14:80602207-80679277Homosapienschromosome7_CD36,GRCh38.p13PrimaryAssembly

Length=77071

Score = 416 bits (225),? Expect = 5e-118

Identities = 228/229 (99%), Gaps = 1/229 (0%)

Strand=Plus/Plus

Query? 1 ? TAGGTCAATCTATGCTGTATTTGAATCCGACGTTAATCTGAAAGGAATCCCTGTGTATAG? 60

? ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 68768? TAGGTCAATCTATGCTGTATTTGAATCCGACGTTAATCTGAAAGGAATCCCTGTGTATAG? 68827

Query? 61 ATTTGTTCTTCCATCCAAGGCCTTTGCCTCTCCAGTTGAAAACCCAGACAACTATTGTTT? 120

? ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 68828? ATTTGTTCTTCCATCCAAGGCCTTTGCCTCTCCAGTTGAAAACCCAGACAACTATTGTTT? 68887

Query? 121 CTGCACAGAAAAAATTATCTCAAAAAATTGTACATCATATGGTGTGCTAGACATCAGCAA? 180

? ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 68888? CTGCACAGAAAAAATTATCTCAAAAAATTGTACATCATATGGTGTGCTAGACATCAGCAA? 68947

Query? 181 ATGCAAAGAAGGTGAGTAAATAACCTCAGTAGCACAG-CCATACCATAA? 228

? ||||||||||||||||||||||||||||||||||||| |||||||||||

Sbjct? 68948? ATGCAAAGAAGGTGAGTAAATAACCTCAGTAGCACAGTCCATACCATAA? 68996

>NC_000007.14:80602207-80679277Homosapienschromosome7_CD36_core,GRCh38.p13PrimaryAssembly

Length=1580

Score = 416 bits (225),? Expect = 5e-118

Identities = 228/229 (99%), Gaps = 1/229 (0%)

Strand=Plus/Plus

Query? 1 TAGGTCAATCTATGCTGTATTTGAATCCGACGTTAATCTGAAAGGAATCCCTGTGTATAG? 60

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 865? TAGGTCAATCTATGCTGTATTTGAATCCGACGTTAATCTGAAAGGAATCCCTGTGTATAG? 924

Query? 61 ATTTGTTCTTCCATCCAAGGCCTTTGCCTCTCCAGTTGAAAACCCAGACAACTATTGTTT? 120

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 925? ATTTGTTCTTCCATCCAAGGCCTTTGCCTCTCCAGTTGAAAACCCAGACAACTATTGTTT? 984

Query? 121? CTGCACAGAAAAAATTATCTCAAAAAATTGTACATCATATGGTGTGCTAGACATCAGCAA? 180

||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

Sbjct? 985? CTGCACAGAAAAAATTATCTCAAAAAATTGTACATCATATGGTGTGCTAGACATCAGCAA? 1044

Query? 181? ATGCAAAGAAGGTGAGTAAATAACCTCAGTAGCACAG-CCATACCATAA? 228

||||||||||||||||||||||||||||||||||||| |||||||||||

Sbjct? 1045? ATGCAAAGAAGGTGAGTAAATAACCTCAGTAGCACAGTCCATACCATAA? 1093

Lambda ? K H

1.35 0.626 1.14

Gapped

Lambda ? K H

1.28 0.460 0.850

Effective search space used: 118733338

? Database: refdata.fasta

Posted date:? Nov 12, 2019? 3:50 PM

? Number of letters in database: 411,130

? Number of sequences in database:? 16

Matrix: blastn matrix 1 -2

Gap Penalties: Existence: 0, Extension: 2.5