AlfalfaGEDB Alfalfa Gene Editing Database

M. sativa cultivar XinJiangDaYe / MS.gene22188


Query id Subject id identity % alignment length mismatches gap openings q. start q. end s. start s. end e-value bit score
MS.gene22188.t1 MTR_3g450930 99.535 215 1 0 1 215 1 215 1.41e-160 442
MS.gene22188.t1 MTR_3g450790 77.209 215 49 0 1 215 1 215 7.20e-128 359
MS.gene22188.t1 MTR_5g090920 70.968 217 60 2 1 215 1 216 3.84e-111 317
MS.gene22188.t1 MTR_5g090910 69.907 216 64 1 1 215 1 216 5.73e-109 311
MS.gene22188.t1 MTR_3g064700 56.338 213 92 1 1 212 1 213 1.90e-88 259
MS.gene22188.t1 MTR_5g090910 69.079 152 46 1 65 215 8 159 2.62e-73 219
MS.gene22188.t1 MTR_1g088850 46.190 210 110 3 1 208 1 209 9.80e-63 194
MS.gene22188.t1 MTR_1g026140 44.495 218 115 3 3 214 5 222 9.51e-62 192
MS.gene22188.t1 MTR_1g088840 43.333 210 117 2 2 210 3 211 2.66e-58 183
MS.gene22188.t1 MTR_1g088825 43.333 210 117 2 2 210 3 211 2.66e-58 183
MS.gene22188.t1 MTR_1g088845 42.857 210 118 2 2 210 3 211 4.38e-55 175
MS.gene22188.t1 MTR_1g088840 45.122 164 89 1 47 210 2 164 1.42e-47 154
MS.gene22188.t1 MTR_1g088840 48.673 113 57 1 98 210 4 115 4.95e-32 113
MS.gene22188.t1 MTR_8g098420 30.994 171 114 3 1 167 1 171 1.77e-18 80.1
MS.gene22188.t1 MTR_8g098420 30.994 171 114 3 1 167 1 171 3.78e-18 80.5
MS.gene22188.t1 MTR_1g492670 53.030 66 30 1 145 210 2 66 8.81e-17 72.0
MS.gene22188.t1 MTR_4g134370 31.928 166 98 5 11 170 25 181 1.55e-12 64.7
MS.gene22188.t1 MTR_4g134380 30.178 169 103 5 11 173 25 184 5.86e-12 63.2
MS.gene22188.t1 MTR_8g098430 27.485 171 116 4 3 167 5 173 4.24e-11 60.8
MS.gene22188.t1 MTR_4g134380 30.323 155 93 5 11 159 25 170 5.45e-11 59.7
Query id Subject id identity % alignment length mismatches gap openings q. start q. end s. start s. end e-value bit score
MS.gene22188.t1 AT2G30860 60.280 214 85 0 1 214 1 214 3.58e-101 292
MS.gene22188.t1 AT2G30870 58.605 215 89 0 1 215 1 215 1.73e-97 283
MS.gene22188.t1 AT3G03190 52.336 214 101 1 1 213 1 214 1.12e-75 227
MS.gene22188.t1 AT5G17220 50.711 211 103 1 1 210 1 211 4.30e-74 223
MS.gene22188.t1 AT2G30860 58.824 153 63 0 1 153 1 153 3.13e-66 201
MS.gene22188.t1 AT3G62760 46.890 209 108 3 1 207 1 208 1.59e-65 202
MS.gene22188.t1 AT1G02930 42.512 207 116 3 3 208 4 208 1.14e-55 176
MS.gene22188.t1 AT1G02930 42.512 207 116 3 3 208 4 208 1.14e-55 176
MS.gene22188.t1 AT2G02930 44.286 210 112 3 3 208 4 212 7.19e-55 174
MS.gene22188.t1 AT4G02520 43.062 209 114 3 3 207 4 211 9.75e-55 174
MS.gene22188.t1 AT1G02920 42.512 207 117 2 3 208 4 209 2.34e-54 173
MS.gene22188.t1 AT2G47730 42.857 210 117 3 5 212 54 262 8.97e-54 173
MS.gene22188.t1 AT2G47730 42.857 210 117 3 5 212 54 262 8.97e-54 173
MS.gene22188.t1 AT1G49860 43.367 196 108 2 16 208 19 214 9.70e-51 165
MS.gene22188.t1 AT1G02950 40.686 204 118 3 4 205 26 228 2.27e-49 161
MS.gene22188.t1 AT1G02950 40.686 204 118 3 4 205 26 228 2.27e-49 161
MS.gene22188.t1 AT1G02950 40.686 204 118 3 4 205 28 230 2.70e-49 161
MS.gene22188.t1 AT1G02950 40.686 204 118 3 4 205 38 240 4.03e-49 161
MS.gene22188.t1 AT1G02940 39.713 209 123 3 4 210 40 247 3.58e-46 153
MS.gene22188.t1 AT1G02940 39.713 209 123 3 4 210 40 247 3.58e-46 153
MS.gene22188.t1 AT1G02940 39.713 209 123 3 4 210 53 260 4.61e-46 154
MS.gene22188.t1 AT1G02940 39.713 209 123 3 4 210 65 272 5.38e-46 154
MS.gene22188.t1 AT1G02950 39.583 144 85 2 63 205 60 202 7.94e-32 115
MS.gene22188.t1 AT5G41210 28.655 171 118 3 1 167 2 172 2.19e-13 67.4
MS.gene22188.t1 AT5G41240 28.070 171 119 2 1 167 1 171 2.52e-13 68.6
MS.gene22188.t1 AT5G41220 26.901 171 121 2 1 167 1 171 2.97e-11 62.4

Find 63 sgRNAs with CRISPR-Local

Find 0 sgRNAs with CRISPR-GE


CRISPR-Local

CRISPR-Local
sgRNA_sequence on_target_score Position Region
GGTACATGTTGATGGCTTTA+AGG 0.187628 3.4:+40822959 MS.gene22188:CDS
CGTTTCGGGCAAGCATAGTC+TGG 0.293826 3.4:-40822894 None:intergenic
CAACTTGGTTATGAATCTTT+TGG 0.307735 3.4:+40837020 MS.gene22188:CDS
TAGTCACCATCTTGAATAAC+AGG 0.313539 3.4:-40836555 None:intergenic
ACGATGGTTGTGAAGGTATA+TGG 0.320484 3.4:+40822870 None:intergenic
TTCAATGGGAAAAGAGAATA+TGG 0.323458 3.4:+40837218 MS.gene22188:CDS
GACACACAATCACCCGTTTC+GGG 0.336118 3.4:-40822908 None:intergenic
AGCTGAATACCTTAAGTTGC+AGG 0.348647 3.4:+40822995 MS.gene22188:CDS
GATCTTAGCCATCTTCCATT+TGG 0.361097 3.4:+40837180 MS.gene22188:CDS
GACTGGTGGGTTGAAGTTAT+GGG 0.372883 3.4:-40836995 None:intergenic
AGACACACAATCACCCGTTT+CGG 0.378935 3.4:-40822909 None:intergenic
GATGGTGACTATACTCTCTA+TGG 0.382544 3.4:+40836567 MS.gene22188:CDS
AGACTGGTGGGTTGAAGTTA+TGG 0.387257 3.4:-40836996 None:intergenic
GTACATGTTGATGGCTTTAA+GGG 0.392713 3.4:+40822960 MS.gene22188:CDS
AGACTATGCTTGCCCGAAAC+GGG 0.405607 3.4:+40822896 MS.gene22188:CDS
TCATTATTTGATGAATTCAA+TGG 0.409469 3.4:+40837203 MS.gene22188:CDS
AAAGGTGTTGGATATTTATG+AGG 0.436190 3.4:+40837107 MS.gene22188:CDS
AATCAAGGGACTGATTTGCT+TGG 0.451580 3.4:+40836922 MS.gene22188:CDS
TTTGAAACGGTACATGTTGA+TGG 0.459046 3.4:+40822951 MS.gene22188:CDS
TTCATCAAATAATGACCAAA+TGG 0.459253 3.4:-40837195 None:intergenic
CATTATTTGATGAATTCAAT+GGG 0.460909 3.4:+40837204 MS.gene22188:CDS
TGGAAAGACAATAGAAGAAA+GGG 0.463681 3.4:+40836942 MS.gene22188:CDS
CAGACTATGCTTGCCCGAAA+CGG 0.465174 3.4:+40822895 MS.gene22188:CDS
TTGTTTCATATGCAGCCCTT+TGG 0.467394 3.4:+40836525 MS.gene22188:intron
GGTGGGATGATATTAGCAAT+AGG 0.491315 3.4:+40837268 MS.gene22188:CDS
ATATTAGCAATAGGCCATCT+TGG 0.491886 3.4:+40837277 MS.gene22188:CDS
TTACTCCCTGTTATTCAAGA+TGG 0.495181 3.4:+40836549 MS.gene22188:CDS
TTGGAAAGACAATAGAAGAA+AGG 0.498497 3.4:+40836941 MS.gene22188:CDS
CAATAGGCCATCTTGGAAGA+AGG 0.501103 3.4:+40837284 MS.gene22188:CDS
AATAATGACCAAATGGAAGA+TGG 0.513184 3.4:-40837188 None:intergenic
AAAGGATATTGAGTTTGAAA+CGG 0.517622 3.4:+40822938 MS.gene22188:CDS
GGGAAAAGAGAATATGGTAA+AGG 0.528479 3.4:+40837224 MS.gene22188:CDS
AGAAAAGCTTGGAAAGGTGT+TGG 0.539764 3.4:+40837095 MS.gene22188:CDS
CAACCCACCAGTCTACAACT+TGG 0.543299 3.4:+40837005 MS.gene22188:CDS
TTCATAACCAAGTTGTAGAC+TGG 0.547963 3.4:-40837012 None:intergenic
TCAAATTACGATGGTTGTGA+AGG 0.550550 3.4:+40822863 None:intergenic
GCTATCAAAGACTAAGTACT+TGG 0.554970 3.4:+40837134 MS.gene22188:CDS
CAAGAGAGTGAAGAAAAGCT+TGG 0.555057 3.4:+40837084 MS.gene22188:CDS
TCAAAGACTAAGTACTTGGC+TGG 0.557175 3.4:+40837138 MS.gene22188:CDS
GATTGTGTGTCTGATTGAAA+AGG 0.560625 3.4:+40822920 MS.gene22188:CDS
TGTAGAACCTTCTTCCAAGA+TGG 0.563715 3.4:-40837291 None:intergenic
AATAGAAGAAAGGGGTCTTG+TGG 0.568965 3.4:+40836951 MS.gene22188:CDS
CACAAACCCTCTAATCACAT+AGG 0.571439 3.4:-40837327 None:intergenic
GTTTCGGGCAAGCATAGTCT+GGG 0.573332 3.4:-40822893 None:intergenic
AAAGGGGTCTTGTGGAGCAA+TGG 0.573472 3.4:+40836959 MS.gene22188:CDS
TGGCTTTAAGGGAGAGCACA+AGG 0.573525 3.4:+40822971 MS.gene22188:CDS
GAGTGAAGAAAAGCTTGGAA+AGG 0.576328 3.4:+40837089 MS.gene22188:CDS
AGAGGAAGCATGTGAATGCT+TGG 0.577585 3.4:+40837247 MS.gene22188:CDS
ATAACCAAGTTGTAGACTGG+TGG 0.589687 3.4:-40837009 None:intergenic
ACAAACCCTCTAATCACATA+GGG 0.591213 3.4:-40837326 None:intergenic
TGTTGGATATTTATGAGGAG+AGG 0.593358 3.4:+40837112 MS.gene22188:CDS
AATAACAGGGAGTAATCCAA+AGG 0.597435 3.4:-40836541 None:intergenic
CAGAATCTAGGGCAATCCTA+AGG 0.599160 3.4:+40836878 MS.gene22188:intron
AAGAGAATATGGTAAAGGAG+AGG 0.603130 3.4:+40837229 MS.gene22188:CDS
TAACCAAGTTGTAGACTGGT+GGG 0.611018 3.4:-40837008 None:intergenic
GGAAGCATGTGAATGCTTGG+TGG 0.612663 3.4:+40837250 MS.gene22188:CDS
GAAGCATGTGAATGCTTGGT+GGG 0.616766 3.4:+40837251 MS.gene22188:CDS
CAGATGGAAGTCCCACTAGT+GGG 0.622928 3.4:-40837047 None:intergenic
ATAACAGGGAGTAATCCAAA+GGG 0.626677 3.4:-40836540 None:intergenic
TCAGATGGAAGTCCCACTAG+TGG 0.646315 3.4:-40837048 None:intergenic
GGAAAGACAATAGAAGAAAG+GGG 0.672825 3.4:+40836943 MS.gene22188:CDS
GAAGATGGCTAAGATCAGCA+AGG 0.700476 3.4:-40837173 None:intergenic
AGTCACCATCTTGAATAACA+GGG 0.712338 3.4:-40836554 None:intergenic

CRISPR-GE

badsite warning sgRNA_sequence Strand Position Region GC_content


Chromosome Type Strat End Strand Name
chr3.4 gene 40822873 40837335 40822873 ID=MS.gene22188
chr3.4 mRNA 40822873 40837335 40822873 ID=MS.gene22188.t1;Parent=MS.gene22188
chr3.4 exon 40822873 40823016 40822873 ID=MS.gene22188.t1.exon1;Parent=MS.gene22188.t1
chr3.4 CDS 40822873 40823016 40822873 ID=cds.MS.gene22188.t1;Parent=MS.gene22188.t1
chr3.4 exon 40836540 40836588 40836540 ID=MS.gene22188.t1.exon2;Parent=MS.gene22188.t1
chr3.4 CDS 40836540 40836588 40836540 ID=cds.MS.gene22188.t1;Parent=MS.gene22188.t1
chr3.4 exon 40836881 40837335 40836881 ID=MS.gene22188.t1.exon3;Parent=MS.gene22188.t1
chr3.4 CDS 40836881 40837335 40836881 ID=cds.MS.gene22188.t1;Parent=MS.gene22188.t1
Gene Sequence

>MS.gene22188

ATGGTTGTGAAGGTATATGGCCCAGACTATGCTTGCCCGAAACGGGTGATTGTGTGTCTGATTGAAAAGGATATTGAGTTTGAAACGGTACATGTTGATGGCTTTAAGGGAGAGCACAAGGAAGCTGAATACCTTAAGTTGCAGGTTCTTTTTCTCTGTTTTAATTTTTCTATTATACTTTTGGTCCCAAGTTTTTTTTTTTTTTTTTAATTCAATCTACAATATGTTGGGTTGAATTCAAATGGTCTATGGTCGATAATTGCCACCTTTATGCAGGTGGCATATAGGTGGGTGTTTTTAGGCGGTATAGATTTCAATTGATAATTTTAATTTGCCTTTTAAAACCAAATTTAAAATATGTATTATTTAATCTCAATTCAACGGCCATCAAAACACTAAATGTATGATAGTCATATGATCCAAATTGACTTGAGCCGGCTCATTGGATTGAATAAAATTATAATTTGATGGAAAAAAAAAAATTAGAACAAACCCAAAGACCAAAAAAATATGGTGTTTCTTACTTTCTCTTGGGTGAAAACCCTTTTTGGTACAAAATAGAAAAGTTCTTTAGGATAAGGGTCGATTCTCACAAATGCAGAATAATGTATTTGAATTCTATCTAGTGAGAAGATCATGCAATCAAATGATCTGAAATTTCTGATTAAGATAATATGACTCATATTTTGACTTTATAGAATAGATCCAAAACACAGTTCAAGTTTGATCTTTAGATTCCTGTAACTGCGTGATGCAATTGCAAATACGTGTAATTTGAGCTACCCGGTTTTCAACAAAGGACCGAAATGTAAAATTGCACTGCATGTGGATTTTTACAATTAGGCAATACAAGTATATTTTATGGATTTGGATTTAACGTTGTAAATGCGTCCACACATATATTCATCCTAACTGTCCGATCTAAAATCGACGTCCGAAATTGATTATAGTTGATCTATGTATTGTATGATCTCAACCGTCCGATTTAAAATCAATGGTTGTGATTTAAAACAGCGTACATCTGTGTGCTTTTATAACATTGGGGAATGTGGATCCATCCCTGTTTTATACGTAATGATAGTCAAATTTTTGTGGGGTCCAACCTTTGCCAGCCAACTTATATTTGACAAAATGGAAGGTTGCATGTGATGACAAGTAGCAAAGTGTATGGTGTACAGTTGGTGTAGTGCATGAGTAACACATGTTGTGGTGCATGCATTGCAAGGGACACTTGGTGTTTTATTAACGTATCCCAGCCGTATCGTATCTTGATTTTTGAAATTTTTCCGTATCGGCGTATCGGTGCAGTATCGTATCCGTATCGTATCTCGTATCCGGGCTTCACAGCAAGAGTCACATGGTGTGGTGCATTCATTGCAAGGGACACTTGGTGTTTTGCATGAAAGGCAAATGACAAGTGTCTTCATGCATGTAAGGCAACTTTAGCTTGATTCTCTTATAGATAGGAGCATGAGCAAGAATTCACAACACACCACAAGAGAATAACAACAAAAAACAAGAGAGAAAGAGAGCCTTGCTCAAAAGAGAAAAGTGATTCTCATAAAGAGTTTCTTGTGAATAGTGACCCAATTTGTGAGAGTGTTCCTTACAAATTGAGAGATATATTGTAGTGAGATCCACCTAGGGGTGAAGTTGTTGTATTTCCATTATCTTTATAGTGGAAGTTTAAGAGGCTCAAAAATCCTGTGGTTTTTTTCTTTCTCATATTGAGAAGGTTTTCCACGCTAAAATTCTATGTGTCATTACGCTTATATTTTATGTTCCCGCTAGTGCCTATTTTGAGTTCACGGATGAGGGAAATTATGGAGGTTATTTCCCAACAACTTTGTCCATCTATTTCTTTCAGTAGAGTTAGATTTTCAATCTACCAATTCATTTAGATCATCCACATATAAGTCATTAAAATCTGGAACAGTGAAACATGACTTGAACGAGGTTGGCCCGTCAATTTTTTCTTTTATTTAATCAACAACTTTTTATTGTATGTATGATTTCAATAACTTATTTTTAAACAAAAACAAGATTAATGCATTTCATATAAAATAAGAGTTTCAGTGTATGTTGGAATGATGGAAACTAACTTTGGATGGAGTTGCCCTAACTAAGTTATGCGAAACTTTATTGGCTTGTCTCCTAACGAACCTAACATGAGAGGTTGCAAGATCTATAAGTAAACGATGACAACAGTCCTTGATGATTGCACCGAAATCTGAGCTGCTGTCCCTCTTTCCATAAATATTATCTACCAACGTTTTCGAATCCACCTCAATTAAAGTCCATATTCCGCAGATGAAAGTTACTTATTTCAATAACTTTTATTCATGTATCCCGTAAAAAAAATTATATTCTGAAGGTGAGAAAAAACACAAGAAGGGGGGTTGAATTGTGTTTTTGGAAAATTCTTCGTTTCTCCTAAAAACTGAACACACTGATCAGAAGCTAAGTGAGACAGCTTCTGAAGTGATAATCAGAATCTAAATGCAGCGGAATAAATCAGAGAGAGAGAAGAAGAACGACACAAAGCAATTATACTGGTTCCTTCCACAATCCGGAAGTAGTCCAGTCCCCCTTGCACTTCCAAGGAGATTTCACTATGAGTAATATCTGATTACAAATGCTCAAGCACACAAGCAAGAGACTTCCAATGCTCAAGCACACAAGCAAGAGACTTCCTTTGCTCAAGAACACAATCAAGAGACTTCCTGCTCAAGCACAACTGCAAGAGACTTCCTAACTAAACAAAATTACACAGAAAATTGTTTAAGGTTGAACACTTGATATACAATCAGAGGTGCTCACAATACAAATCAGATACAGACTCAATGGACTTAAGAATTTCTAAGATATGACTTTGAGACACAGAAATTCTAAGTGAATGCAGAACAATTTCAGCAGAGGTTTGGTACTTGTGAAAATCTTTGTGAGTCTTTCGTTCAGCGTTCTTGCATTACCAATTCTCTAGGTCTTCACTCCTTTATATAGAGGTGTGAAAGAGACGTTGTGAAATTGCCAGCACACCAAAGAGTCGTTGTTGAAGCTTGATCTTTCACCAATGATCTTTTGCCTGATTTGGTTGTTGTCCTATGAAGAACCAATTTGAAATCCATTCCTTATCCAAAGGAACATCTATGCAGGCGAGACCGTATTCTGGAGCTGTTTTCTTGTCTTGAAGTGCAGACACAAAACAGAGTAGTGGAGAACGTGGTTGTACAATTCGTACAAAGTACAATGTCAACGCTTTCGTCCAAATTGCAAACACATGCCTTCAAACACCTTTGAAATGACCGTTGCAAACTCCACTAATCCTTTGTTTGTGTTTTCGAATCCAATAGACAGCTTTTGCATTGATGATTCTTCAATTCTTGATTCTTTGCAAATCCAGTTGATATTCAGCTTCTGGTGAAATGGCTTCTGATGAGGCAGCTTCTGATGATGCAGCTTCTGATGAGATGACTTCTGCAGCTTCTGATGATGCAGCTTCTGATGATGTGACTTCTGGACCTCTTGTCTTCAGGAGCACTATTCTTCAGGAGCAGCTTACTTCAGGAGCGCTTTTTCTTTAGGAGCAAGTTGCCTTCAGGAGCGTCCAAGCTTCAAATTTTTGTTGTTTCCATAGCTCTTTATTTGCTCATTCAGAACCAATGATGATGTAACTTTATAATTATGGTCCTGTACACTTGAACAAATATTAGCTAATCCAATTTACAATTTTTAATACCTTGTTATCATCAAAACTCTTTAAGGTTTATTGTTAAACACATTTTGTTCCAACAATCTCCCCCTTTTTGATGATGACAAACAAAAGTATTTAAAATTGATCAATTGTTGTTAATCTAATTAACAAGTTGACTTTGGGGTTCTGAGGTTTGTAAGCTCAGAATCTGAGGTTTGTAAGCTCAAACTCCCCCTGAGTCTGATAGTCTTTAAGGTGAGGTTTGTAAGCTCCCCCTAAGTTCTATCCAGGTTAATTAAATTCATTTTGCGCGCAATAATTATTATCAGAGCTTAAAAGCAAAACTGAATCAATGAAAGAATTTACTTATAGATAAGAAGAATGGTTCAATTAACTTTACTTGAATTTTGATACTCCTTCGTTCATTCATAGCTTATACCAATAAGGTAAAAAGATAGAAGTTTTTCAAAAATTTCAAATGCCATTAATGGGGCTCAGAGCCTTTAAATACTTGTCACATATACTCCCCCTTTTGTCATTATCAAAAAAAATTAAAATAAATCTAAGCACAAAAGACAAAAAGATATGAGAAAAACTCAAACGTGAGAGAGTATACTAAGCAATGAACTAAATAAATCAGAGCAAAAATCATGTATATTCAATTGTCAGAAGCAGCTAAACAGTCAGAATCAGTAAGCATTCAGAAGCAAAGAGAATTAAAACAGAATGGTTTCAGAAGCAAGGACTAGGAATTCAAAACAAACAACAAACAAAAAGAAAAACACAAGGGTGTGCAACTAAGTGTTGGTTTGATGTTGTTTCTTCATCTCCTCCATTTCTTCAATCTTCTTCTGCAACAGTTCTTCAAGCTGTTGTTGCCTCTGGAGAAGAGCCTGTTCCCTTTCCTTAGCTTCTTGCTGCTGCTGTGCCATCTGTTTCTTCAGAAGCTTACACAACCTTGCTTCTCTGGTCAAGTAATCCGTTTCTAAAAAGAAAGGCTGGTTTGCCAGGAGCAGTGTAGAACTTCTTATTCTGAGACTCTGATTCAGATGCTCCATTAACTTCCTGACAGAAGCTTCCTTGCTTCTGATGCACTTAGCCCTTAAGCTTTCAATTAGCTCAGTTGCCCGGTTCTGAAGTGATGCCCAGGAGTCTTCATATGCACTTGTGAGATCAATTGTCCTTCGTTGATCCACAAGATTAAGCAATTGACTAAGGATTGTCTGAAGCTCAGAATTGATATATCCAGATTCCAGGATGTTTGTGGGTGGATGTTCAAGGATTTCAAGATCAGAATCACTGATTTCTAAGTCTTGAATAGGGTTAGGAACTGGAATGGTAAGGTGTGTTTGTGTGTTAGAAGATGAGGGCTGGTCAGAATCAATGTGTGTGGTGGAGGTTGAGGCTTGAACATTGACTAGAGAGATGTCAACATCCATGGGAGTTGGTTCAGGAGAGTGTTCAGGTAAAGGTTGAAATTCAGTTTCAGATGGTGGTTGTTGTGGTGAAGTTGATGTTTGAGTGTTGAGTTGGGGGTCAATGGTAGCTGTTGGTGATTGTGGGGGTGTTGTGGGTTTATGTGGTTCAGGTGTTTGTTGATGAGGGTTTTCTGAAACAACATTTTCAGAAGCAGCTGCTTCTGGAGTTGTTTCAGAGGCTTTTTCTGAGGTGGATTTGTCTATGGGTAACTCACCTGAGTAATGGAGAGCAAGATTGTCTAAGACAGAAGTGTCTTCTATTGGTTCAGGTACAGATGTTGAAGAAGGTGAAATGGGGATAGAGGTGACTGGAATCTGAATTACCTGATCTGCAGGGCCAGCTTCAGAAGTGATCTCTTGTGGATCAGCAGCTTGAGGAACTACTTCTGAAGTTCCCTCTTCTGGAGCATTTTCAAACATGAGCATCATATCGTCTTCCTCATTGTCTTCAGTGGGCTTACCAGCAATGGCTCTTACTTCAGCCAATTTATCTTTGTAGAACCTATCACAATCTTCCAATCCCAAGGCCTTGAGCTTTTCATCTCTTTGATGAGTGTATTGTGCTTTTAGATCCTTTTTGTCTGCAATAAGTTCAGAAGCACGTTCATCTGCCATCCTTTTCCTTATTGGAGTCATCTCAACCATAGGACTCACAAGTTGAAGGGGAGCCTTCTTCTTCTTCAGCTTGGGATCAGCCTCTTCTTTTTCCATTTCTTCAAGAGCTTCCTCCAGCTGCTCTTGAGTGATTACCAAGTCTGCCTTACCTTTTCTTCTCTTCTGAGGAATAGAAGCAGCACCAATCTTCTGCATCTTTCCTTTATCCTTGATAGAAACAGGTTTAGCTTCAGACTTAGCAGCTTTAGCAATAGCAGCTTCTGACTCAGTACTGGTTCTGGCCTTCTTTCCTTTAGTAGGAGCAGCATCAGATTCAGAACTAGTTCTGGCCCTTTTCCCTTTAGGTGCATCAGAAACAGCAGCTTCTGACACAGCAACTGCTTTGGCCTTCTTGCCTTTTCCTTTAGCAAGAGCAGCTTCCTCCTGAGACCTCGCATCTTCCTCCAAGACGTACTCAGCTACTAATCTAGCTAAGTGTTCAGGATTACTTTCCTTAACTATGGAAGGAAAGTCTCTGATAATCTCAGATTCTTCTGTGGTCAACTCCAACCATTTCTCATCTTTAGGAGCTTCTTTGAAGAACTTCATAGAGTGTAAGCTCCTGCTACTGATAGTCCTTTCAGCAGTAGAGACTTTGAAACATTTATCAGAAGCACTCCTTGTCCTTTTTAAGAACTCCAGCACCCTTCCTTGATGAAAGATTTCAGACAGCAATCTTCCATAGGGTACTTGTTTTCTTCCTCTGATAGTGCTTTCATTTATGGCCCAGCAGAGATGATCCATTACATATCTTGGGAAGTTTATCTTTTCCAAAATAGCCATGAAGTAAATTGCTAGCTTCTGTTGATTGTTGGGTTGGTCAGATCCTCCTGCTCTTTGAAAGAAGCAATCATTACTGAATTTCACCAACATCCTGTTGTGAATCTCCATTTCAGAAGGCTTTGCATCCTTCTTCCCACCCAAAAGAGTGTTGTAGTAACCTTGTAGCAGGACACCTTCCTTCCTGACTATTTCTTGGAATCTTCCCTCTGCATGCCTTCTACAAGCTTTAGCTATAACTTCCTCAGTGATGGTTATAGGAATGCCCATCACTTCAGACCTTATTTCTAAGCTGTCAAATGATGTAGGATTCATATCCATGACCATTTTCCCTTTCTTTGACCCAGCTTGTTTGCTTACAACTTTTGCAGGCTTTTCATTGTACACTTCAGCTTTTAACCAAAAGTCCTTGACCAAATTCACATATGTTGGCCCATTTAGCATTCTGAAATACTCAGACAGCTCCTGTGTTTCTATTAAGCCTTCCATGTTTACTTCGTACTTCTTGAGTGAAGTAAAATCCACTGGTGACTCTATTTGGACTATCAAGGATTCCCATGGAATATCCATAGTCTTTTCTTTGTAGATGGACCCAGATTCTGATACGCTTTGCTTGTTTGAAGTAGATTCTGAAACTTTTGAGCTTGATGATGCCATTTGATGATTGACGAAGATTTGAAGAAGATGAAGATTTTTAGGGTTCTAAGGTTGAGAGAATGCGCTTACTTCGCAAACGGTTTGAGAGCAAATGTGTTTGTTGTGAGTGAGTAGTGTGTGAATTGACAGAGTGTTAAAAGTAAAACGATTGCAATTCAAAAGAGACAAACAAACATAACATTAATGACAAAATTGGGGGCGCGTGAAAAGTACTTTCAAAGCTTTTAAATCATCATTGCCCTTTTAGTTATACACCAACACATACAGGCCACGTAAGAAGTTCACAGAGTAACTACCAAGTTAGTGGGCCAGAAGTTACTTTAAAAAGTAACTCCTAACGTTTCCACTAATCCAACTATTCAGAAGCAACAAATCCAGCTTCTGGACTTCAGATGCTGATGTCATCATCAGAAGCGAGTTTTCCTCAGAACCTCAATAAAGAGCTTCTGATGGACTACTGCTTCTGAACTAACTTCAGAACCTAAAATCCTATTCAGGAGCAAGCAAACATTTAATCAGACACGAAGTGCATGTTCAAATTTTTCTTTATAAAATCAAATCTTTCAACAGTTAAAGGCTTAGTAAATATATCAGCCCATTGATTTTCAGTATCAATGAATTGTATATCTAAAATTCCTTTTTGAACATAATCTCTGATAAAATGGTGTTTGATTTCAATGTGCTTAGCTCTTGAATGCAAAATTGGATTTTTAGACAAACAAATAGCAGCAGTATTATCACAATAAATGGGAATACTGTTAGCAGTTATCTGATAATCTTCCAACTGATGTTTCATCCAAAGTAGTTGTGTACAACAACTTGCAGCTGAAATGTATTCTGCTTCTGCTGTAGACATAGCAATAGTTGTTTGTCTTTTGCTAGCCCAGGATATAAGATTATCACCCAGAAATTGACAATTGCCACTGGTTGATTTTCTTTCAATCCTATCACCAGCATAATCAGCATCACAGAATCCAATTAGCTTATAATCTAAGGATTTCCTATAAAGGAGTCCAAGATTAGTTGTTCCCTTCAAATACCTGAAGATTCTCTTGACAGCAGTCATATGAGATTCTCTAGGATCTGATTGGAATCTTGCACACAAGCATACACTGAATAAAATATCTGGCCTAGATGCAGTGAGGTATAACAAAGAACCAATCATACCTCTATACAGCTTCTGATCTACTTTAGCTCCTTCATCTTCTTTGCTCAGGTTGCACGTAGGATGCATTGGAGTGTCCATCACTTTACAATCTTCAAGTTTGAACTTCTTCAGAAGCTCCTTTGTGTACTTTGATTGATGAACATAGACTCCATCTTTGCATTGATTGATTTGAATGCCAAGGAAGAACTTCAGCTCTCCCATCATGCTCATTTCAAATTCATCCTGCATTAACTCAGAAAATTCCTTGCAAAGAGATGCATTAGTAGAACCAAATATTATATCATCAACATATATTTGCACAACAAGAATATCTTTCCCAAGGGTCCTTCTGAAGAGTGTAGTGTCAACCTGTCCTCTCTTAAAATCATTTTTGATTAAGAAATTACTTAGTCTATCATACCAAGCTCTGGGAGCTTGTTTCAAGCCATATAGTGATTTCTTCAATTTAAAAACATGGTCAGGATATTTAATATCCTCAAATCCAGGAGGTTGCTTAACATACACTTCTTCTTCTATGACACCATTGAGAAATGCACTCTTGACATCCATCTGATATAATATTATGCCATGATTAACTGCATAGGAAAGAAGTAACCTGATTGCTTCCAATCTTGCAACTGGAGCAAACGTTTCAGTGTAATCAATGCCTTCTTGTTGACTATACCCTTGTGCAACAAGTCTGGCTTTGTTTCTTGTTACTTCACCTTGCTCATTCAGCTTGTTTCTGAATACCCATTTTGTTCCAATAATGTTCTTGTGCTGAGGTTTGGGTACCAGATCCCACACATCATTTCTTTGAAATTGATTCAACTCTTCTTGCATAGCTAGGATCCATCCATCATCTGAGAGTGCTTCATCAACTGTTGTAGGTTCAATCATTGAGATTAGTCCAATCATTGACTCTTCTTGTCTGAAATGTGATCTTGTTCTTCTTGGACTATCTTTGTTTCCAATGATTAGCTCCTCAGGATGTGAAGACTTGTGTTTGAATGTGTTTCTGGGAGGATCTTCCTGATGTGAGCCATCTTGTGCTTTATCAGAAGCAGTTTCATCTTGCGCTTCTGATGTGGGTTCAGCTTCTGGACTATCTTCAGACTCAGCTATCTGATCAGGTTCTGGAGTAGCTTCAGGAACCTGTATCTCTGCAAAACTTTCACCCTGCTTTGAAGTTTTATCTTCAAGCTCCTTGTCATCAAATTTAACGTGCATAGATTCTTCAACACAGAGTGTTTCTGAATTATACACTCTGTATGCCTTTGACCTTTCAGAGTAACCTAAAAAGATTCCTCTTTGGGCCTTGGCATCAAATTTCTTTAGGTAGAGCTTGTTGTTCAGAATGTAACAAGTACATCCAAATTGATGAAAGTAAGAGATATTGGGTTTTCTTCCTTTAAAGAGTTCATATGCTGTTTTCTCCAAAATAGGTCTAATATAGATCCTGTTTTGAACATAGCATGCTGTATTCACTGCTTCTGCCCAAAAATGTTTTGGTAAATGGTTTTCATGAATCATGGTTCTGGCCATTTCTTGTAAAGTTCTATTCTTTCTCTCTACAACTCCATTTTGTTGTGGTGTTCTAGGAGAAGAGAATTCATGGAAAATTCCATGTTTTTCACAAAAAGCTTCAAATGGCTCATTTTCAAATTCTCCACCATGATCACTTCTGACTTTCAAAATTTTTGATTCTTTTTCAGATTGAATTTGAGTGCAGAAGCTACTGAATACATCACACGCAACATCTTTACTCTTTATGAATTTCACCCAAGTCCATCTGCTGTAATCATCAACAATGACTAATCCATATCTACTTCCATATAAGGATGCAGTACTTACTGGTCCAAATAAATCAATATGGAGTAATTCTAAAGGTCTTGAGGTTGAGACAATGTCTTTGGTTTTAAAAGATGATTTCACAATTTTCCCTCTCTGACATGCACCACAAAGTGCATCTGAATGATAGTCAATGTCTGGTAGTCCTTTCACAAGTTGTAACTTACTAAGCTTAGAGATTAACCTCCAGTTAGCATGTCCCAACCTTCTGTGCCACATCCATTTCTTATCATTCACTGATAGTAGGCAGATCACCTTCTGATCAGCCAGTTCAGAAAAATTGATTTTATAGACATTGTCAACTCTCTTTCCCTTGAACACAATGGATTTGTCTTTCTTGTTGATTACTGTACAGTTGGGTTTTTCAAAAAGCACATCATAACCATTATCACAAAATTGACTTATGCTCAAAAGGTTATGTTCAAGACCATCTACCAACCATACATTATTAATTGAGGTAGAGGAATTACCAATAGTTCCTGTACCTATGATCCTGCCTGACTGGTTGCCTCCAAATTTCACATTTCCTCCCTCTTTCATTGTTAGGCTGAGGAACATGGACTTGTCTCCTGTCATGTGTCTTGAGCAACCGCTGTCTAGGTACCATGATCTTTGCTTTGGCACTGCTCTTAGACATACCTCATTTTCATCTTCTGAGTCTGAATCTGATCCAGCTTCTGAAGTTGCTGTTGCTACCAACCCAACAGCTACCTTTGCATCTTCATCAGCTTCCTCTTTATCAGATCCAGATTCACTACCTAAATCATCCCAGGTTGCCATTAAACTCTGCTTAATCTGCTTCTTGAGTTTGCTTGGTTTGAAGGTAGACCTCTTTGACTTGTCCTTTGACTTTTCTCTTTGAAGATCAGGGCATTCAGCAATGAAGTGACCTGGCTTCTTACAGTTGAAGCATCCCTTCTGATCTTCTTTCCTTGAGCTTCTGGAGTTACCCCTTCTGGAGAGAAACTTTCTGTTCTTCTTGGCCAGATATTGAAGTCTGTTTGAGAGCATGGCCATTTCTGCAGCATCTGGTTCAGAATCTGAACTTGACTGTTCTTCCTCTTCAGATTCAACCACTTTGAGAGCTTTTGATGACTTGCCTTTGGATGGAAGAGCTATTGATTTACTCTTCTTAGTAGACTCATGTTCATTAAGACTTAACTCATGAACTTTGAGAGAACTAACAAGATCTTCGACACTTAAAGTGTTTAAATCTTTAGCTTCCTCAATGGCAGTCACTTTTGGTCTCCATCTGGCAGGTAAGCTTCTGAGTATCTTGCTTACATGATCAGAAGCAACATAGCTCTTCTTTAGAATCTGTAGACCAGAAACTAAGGTTTGAAACCTTGAGTACATCTCCTCAATAGTTTCATCATCCTTCATTTTGAATAACTCATACTGATGTACAAGCAGGAGAGCCTTTGCTTCTTTGACCTTCTTGCTCCCTTCATAATTTGAACAGAGAGAAGCAAACATAGCTTTAGCAGTGGATTTGTCACTCATCTTCATGTATTCTGCTTTGGGTATGGCAGTGACAAGAGCTCCTCTGATAATATGATGCTTCTTGTAAAGTTTCCTCAGTTCTGGAGTATGTTTCTTTCTATCCACAGCAGCTCCTTCTTCATCAAGGACTAAACTACCAACGCCGTCTTCCAGAATATCCCACAATTCTTCATCTAGGCTCATGATGTAACTGTAAAAATTGGTCTTCCACCAAGAGAACTCTTCAGCATCTCCATTGAACTTTGGAATTTTTCCAACACTGGTTTTCTTACTATCACTATAATCATGTGAGACATTTTTATAGCTAGTATCACCATCAGGAGGAGGAGTAGTCATGTTTTTCTCAGATCTTTGACTGTTGCACGGTTAAGTGATACACAACAGACCAAGGCTCTGATACCAATTGAAGGTGAGAAAAAACACAAGAAGGGGGGTTGAATTGTGTTTTTGGAAAATTCTTCGTTTCTCCTAAAAACTGAACACACTGATCAGAAGCTAAGTGAGACAGCTTCTGAAGTGATAATCAGAATCTAAATGCAGCGGAATAAATCAGAGAGAGAGAAGAAGAACGACACAAAGCAATTATACTGGTTCCTTCCACAATCCGGAAGTAGTCCAGTCCCCCTTGCACTTCCAAGGAGATTTCACTATGAGTAATATCTGATTACAAATGCTCAAGCACACAAGCAAGAGACTTCCAATGCTCAAGCACACAAGCAAGAGACTTCCTTTGCTCAAGAACACAATCAAGAGACTTCCTGCTCAAGCACAACTGCAAGAGACTTCCTAACTAAACAAAATTACACAGAAAATTGTTTAAGGTTGAACACTTGATATACAATCAGAGGTGCTCACAATACAAATCAGATACAGACTCAATGGACTTAAGAATTTCTAAGATATGACTTTGAGACACAGAAATTCTAAGTGAATGCAGAACAATTTCAGCAGAGGTTTGGTACTTGTGAAAATCTTTGTGAGTCTTTCGTTCAGCGTTCTTGCATTACCAATTCTCTAGGTCTTCACTCCTTTATATAGAGGTGTGAAAGAGACGTTGTGAAATTGCCAGCACACCAAAGAGTCGTTGTTGAAGCTTGATCTTTCACCAATGATCTTTTGCCTGATTTGGTTGTTGTCCTATGAAGAACCAATTTGAAATCCATTCCTTATCCAAAGGAACATCTATGCAGGCGAGACCGTATTCTGGAGCTGTTTTCTTGTCTTGAAGTGCAGACACAAAACAGAGTAGTGGAGAACGTGGTTGTACAATTCGTACAAAGTACAATGTCAACGCTTTCGTCCAAATTGCAAACACATGCCTTCAAACACCTTTGAAATGACCGTTGCAAACTCCACTAATCCTTTGTTTGTGTTTTCGAATCCAATAGACAGCTTTTGCATTGATGATTCTTCAATTCTTGATTCTTTGCAAATCCAGTTGATATTCAGCTTCTGGTGAAATGGCTTCTGATGAGGCAGCTTCTGATGATGCAGCTTCTGATGAGATGACTTCTGCAGCTTCTGATGATGCAGCTTCTGATGATGTGACTTCTGGACCTCTTGTCTTCAGGAGCACTATTCTTCAGGAGCAGCTTACTTCAGGAGCGCTTTTTCTTTAGGAGCAAGTTGCCTTCAGGAGCGTCCAAGCTTCAAATTTTTGTTGTTTCCATAGCTCTTTATTTGCTCATTCAGAACCAATGATGATGTAACTTTATAATTATGGTCCTGTACACTTGAACAAATATTAGCTAATCCAATTTACAATTTTTAATACCTTGTTATCATCAAAACTCTTTAAGGTTTATTGTTAAACACATTTTGTTCCAACATATTCATGTATAACAAGTTAATGCATGTTTTCTGTCCAAAAAAAAAAAAAAGTTAATGCTTGTTTCATATGCAGCCCTTTGGATTACTCCCTGTTATTCAAGATGGTGACTATACTCTCTATGGTTAGCTCCTTCACTTGTTTCTATACTGTATTCCATTTAGCTAAGCATAAATAATTTTTATGTACATTATGCTGCTGGATATACCTGTATTCATGGGTAGATTTATCATTGTTGCATTCAGATTAAACAAATCACAACAATAGTAGTTTTTTCTTGAAAAACCAAAACTTCGAGTTTGATATTCAGACTATCTTGATCAAACGGTCATCAATATGCTGACAACGTTAATTCATGGTATCTGATAGTAGGAAATTATGATCAAGATTTAATCTATGTTCTTTTAATATTGCAGAATCTAGGGCAATCCTAAGGTACTATGCTGAAAAGTACAAAAATCAAGGGACTGATTTGCTTGGAAAGACAATAGAAGAAAGGGGTCTTGTGGAGCAATGGCTAGAAGTTGAAGCCCATAACTTCAACCCACCAGTCTACAACTTGGTTATGAATCTTTTGGTTTACCCACTAGTGGGACTTCCATCTGACCAAAAAGTTGTTCAAGAGAGTGAAGAAAAGCTTGGAAAGGTGTTGGATATTTATGAGGAGAGGCTATCAAAGACTAAGTACTTGGCTGGTGATTTCTTTAGCCTTGCTGATCTTAGCCATCTTCCATTTGGTCATTATTTGATGAATTCAATGGGAAAAGAGAATATGGTAAAGGAGAGGAAGCATGTGAATGCTTGGTGGGATGATATTAGCAATAGGCCATCTTGGAAGAAGGTTCTACAGCTTTATAAATACCCTATGTGA

Protein sequence

>MS.gene22188.t1

MVVKVYGPDYACPKRVIVCLIEKDIEFETVHVDGFKGEHKEAEYLKLQPFGLLPVIQDGDYTLYESRAILRYYAEKYKNQGTDLLGKTIEERGLVEQWLEVEAHNFNPPVYNLVMNLLVYPLVGLPSDQKVVQESEEKLGKVLDIYEERLSKTKYLAGDFFSLADLSHLPFGHYLMNSMGKENMVKERKHVNAWWDDISNRPSWKKVLQLYKYPM