AlfalfaGEDB Alfalfa Gene Editing Database

M. sativa cultivar XinJiangDaYe / MS.gene33593


Query id Subject id identity % alignment length mismatches gap openings q. start q. end s. start s. end e-value bit score
MS.gene33593.t1 MTR_1g030300 96.694 484 12 3 1 483 1 481 0.0 964
MS.gene33593.t1 MTR_5g012030 38.739 111 67 1 182 292 1 110 1.26e-17 86.3
MS.gene33593.t1 MTR_5g012030 30.973 113 77 1 181 293 390 501 1.33e-11 67.4
MS.gene33593.t1 MTR_3g452710 34.711 121 78 1 183 303 11 130 2.17e-17 85.1
MS.gene33593.t1 MTR_3g452710 34.711 121 78 1 183 303 11 130 2.37e-17 85.1
MS.gene33593.t1 MTR_5g063200 39.326 89 54 0 184 272 212 300 2.82e-14 75.9
MS.gene33593.t1 MTR_8g107280 41.228 114 63 2 183 293 479 591 5.72e-14 74.7
MS.gene33593.t1 MTR_3g087480 31.944 144 94 1 170 309 205 348 2.43e-13 72.8
MS.gene33593.t1 MTR_3g074230 38.202 89 55 0 184 272 192 280 8.27e-13 71.2
MS.gene33593.t1 MTR_3g074230 38.202 89 55 0 184 272 192 280 9.76e-13 70.9
MS.gene33593.t1 MTR_1g084100 37.615 109 64 3 183 290 91 196 9.37e-12 67.4
MS.gene33593.t1 MTR_2g078020 35.644 101 65 0 184 284 222 322 1.91e-11 66.6
MS.gene33593.t1 MTR_2g078020 35.644 101 65 0 184 284 222 322 2.43e-11 66.6
MS.gene33593.t1 MTR_4g094205 34.091 132 77 3 164 286 460 590 3.15e-11 66.2
MS.gene33593.t1 MTR_1g041435 30.657 137 93 2 185 321 13 147 3.45e-11 64.7
Query id Subject id identity % alignment length mismatches gap openings q. start q. end s. start s. end e-value bit score
MS.gene33593.t1 AT4G08320 50.600 417 177 8 5 408 3 403 3.87e-130 384
MS.gene33593.t1 AT4G08320 51.079 417 176 9 5 408 3 404 6.84e-126 373
MS.gene33593.t1 AT2G42810 39.496 119 71 1 183 301 13 130 1.40e-19 92.0
MS.gene33593.t1 AT2G42810 39.496 119 71 1 183 301 13 130 1.42e-19 91.7
MS.gene33593.t1 AT2G42810 39.496 119 71 1 183 301 13 130 1.42e-19 91.7
MS.gene33593.t1 AT2G42810 39.496 119 71 1 183 301 13 130 1.42e-19 91.7
MS.gene33593.t1 AT2G42810 39.496 119 71 1 183 301 13 130 1.47e-19 92.0
MS.gene33593.t1 AT4G12400 38.739 111 67 1 182 292 1 110 3.92e-18 87.8
MS.gene33593.t1 AT4G12400 38.739 111 67 1 182 292 1 110 4.55e-18 87.4
MS.gene33593.t1 AT1G12270 34.426 122 79 1 182 303 1 121 1.30e-14 76.6
MS.gene33593.t1 AT1G62740 32.432 111 74 1 182 292 1 110 5.34e-14 74.7
MS.gene33593.t1 AT1G62740 31.858 113 76 1 181 293 380 491 3.44e-12 68.9
MS.gene33593.t1 AT1G04190 31.884 138 92 2 184 321 16 151 6.87e-14 73.2
MS.gene33593.t1 AT3G17970 40.385 104 61 1 183 286 474 576 2.54e-13 72.8
MS.gene33593.t1 AT5G09420 37.607 117 68 2 183 299 488 599 2.74e-13 72.8
MS.gene33593.t1 AT2G42580 39.080 87 53 0 184 270 221 307 1.55e-11 67.4
MS.gene33593.t1 AT5G12430 32.222 180 98 8 152 309 845 1022 7.81e-11 65.1

Find 120 sgRNAs with CRISPR-Local

Find 0 sgRNAs with CRISPR-GE


CRISPR-Local

CRISPR-Local
sgRNA_sequence on_target_score Position Region
CTGTATGCCTTACTGTAATT+TGG 0.182831 1.1:+15959018 None:intergenic
ATCCCCTTGTATAATGGTTT+TGG 0.231510 1.1:+15957596 None:intergenic
GCAGTTCAATTTGAAGAATT+TGG 0.238221 1.1:-15960482 MS.gene33593:CDS
GCAAGATTGCTGGGGTTAAA+TGG 0.251602 1.1:+15957535 None:intergenic
AGTCTGTTAAAGAAAATATA+AGG 0.252895 1.1:-15958458 MS.gene33593:CDS
TGGACAATTCTTTGCTGTTC+TGG 0.271414 1.1:-15961339 MS.gene33593:CDS
TACAGAGATGCTATTGATAA+AGG 0.286696 1.1:-15958961 MS.gene33593:CDS
GGCGGCTGCAAATGCGGGTC+AGG 0.290209 1.1:-15957500 MS.gene33593:CDS
TTCTTGATCCCCTTGTATAA+TGG 0.297724 1.1:+15957590 None:intergenic
GGCTTCGTTATCTTCACTAT+TGG 0.331525 1.1:+15957273 None:intergenic
CATCAGCAGCAGCAGCATTC+TGG 0.332689 1.1:+15964906 None:intergenic
CATGCAGCTCCGGCTTCATT+TGG 0.338679 1.1:-15957565 MS.gene33593:CDS
ATCAGCAGCAGCAGCATTCT+GGG 0.339077 1.1:+15964907 None:intergenic
ATGAACCGGAAATTCGATTT+GGG 0.344456 1.1:-15957429 MS.gene33593:CDS
TTCATTAGCTCCACTACTAT+TGG 0.357311 1.1:+15957447 None:intergenic
AATGAACCGGAAATTCGATT+TGG 0.360726 1.1:-15957430 MS.gene33593:CDS
TGAGGTAGCTAGGGAGTGTT+TGG 0.366540 1.1:-15965071 MS.gene33593:CDS
ACACAGGACCCTCACGCTTC+TGG 0.369748 1.1:-15961960 MS.gene33593:intron
GTTCCTGAGGAATTTGATCT+TGG 0.372280 1.1:+15957389 None:intergenic
GGGTCAGGGGTCACATTCTC+AGG 0.379304 1.1:-15957485 MS.gene33593:CDS
TATCTTTGCAGCCTTGCAAT+TGG 0.381089 1.1:-15958493 MS.gene33593:intron
AATGATTAGATAGCTTAGTT+TGG 0.384520 1.1:+15957106 None:intergenic
AACAACTCAGCTTCTATAAC+TGG 0.389553 1.1:-15965031 MS.gene33593:CDS
AACGAAGCCGATATTCAATT+CGG 0.400107 1.1:-15957259 MS.gene33593:CDS
AATGTGATGGAGATGTTATC+AGG 0.401146 1.1:-15957178 MS.gene33593:CDS
GTCATCACCTCCATCAATAT+TGG 0.402020 1.1:+15961295 None:intergenic
AATATATCAATCAATGAATC+AGG 0.404527 1.1:+15965004 None:intergenic
ATCAAGAAGCCATGCAGCTC+CGG 0.413860 1.1:-15957575 MS.gene33593:CDS
ACGAAGCCGATATTCAATTC+GGG 0.415294 1.1:-15957258 MS.gene33593:CDS
CAATCTTGCAAGCATGTTCA+TGG 0.420391 1.1:-15957521 MS.gene33593:CDS
CGATAGATCTAAGAGAATCT+TGG 0.425389 1.1:+15959049 None:intergenic
ACAGAGATGCTATTGATAAA+GGG 0.425713 1.1:-15958960 MS.gene33593:CDS
GCGGCTGCAAATGCGGGTCA+GGG 0.428666 1.1:-15957499 MS.gene33593:CDS
GTTCATGGCGGCTGCAAATG+CGG 0.429486 1.1:-15957506 MS.gene33593:CDS
CAGTAAGGCATACAGTCGTC+TGG 0.431946 1.1:-15959010 MS.gene33593:CDS
ACGAACCTTGGTTAAGGAAA+TGG 0.437918 1.1:+15965341 None:intergenic
GAACATGCTTGCAAGATTGC+TGG 0.438892 1.1:+15957525 None:intergenic
GTCTTCATTAGGATGAAGAT+TGG 0.440511 1.1:-15961983 MS.gene33593:intron
TTCTTTAACAGACTCATTGT+TGG 0.444055 1.1:+15958467 None:intergenic
TTCTATGAAAGCTTCAAGTT+CGG 0.445568 1.1:-15964936 MS.gene33593:CDS
GTGCTGTTTACTACTGCAAC+AGG 0.445785 1.1:-15960089 MS.gene33593:intron
GCATCTCGCTTATTTGATGA+TGG 0.449447 1.1:-15961254 MS.gene33593:CDS
TCTTTAACAGACTCATTGTT+GGG 0.449540 1.1:+15958468 None:intergenic
GTCTGTTAAAGAAAATATAA+GGG 0.450970 1.1:-15958457 MS.gene33593:intron
CTGAAGGAATTGAGGTAGCT+AGG 0.451124 1.1:-15965081 MS.gene33593:CDS
TATTTCAGGACCAATATTGA+TGG 0.453614 1.1:-15961305 MS.gene33593:CDS
AAGGATACGAACCTTGGTTA+AGG 0.454831 1.1:+15965335 None:intergenic
TTCATGGCGGCTGCAAATGC+GGG 0.459741 1.1:-15957505 MS.gene33593:CDS
TTATCAGGGAATGCACCTCC+TGG 0.482324 1.1:-15957163 MS.gene33593:CDS
CTTTCATTCGTTTGATCATG+TGG 0.485673 1.1:+15957136 None:intergenic
ATTTGATCATGCGGTTGCCC+AGG 0.488809 1.1:+15957316 None:intergenic
AGGTGATGACATAGTGCAAC+TGG 0.493757 1.1:-15961282 MS.gene33593:CDS
AATTGCATCAAAGTACTGCT+TGG 0.501271 1.1:+15960149 None:intergenic
TAGTAGTGGAGCTAATGAAC+CGG 0.501612 1.1:-15957443 MS.gene33593:CDS
GCGATGAGAGAGAGGGGAAT+CGG 0.504892 1.1:+15965381 None:intergenic
CATATGCATCACGGATTGAA+AGG 0.505105 1.1:+15957357 None:intergenic
ACCATAGGTTTCGATTCCTC+AGG 0.510798 1.1:+15964872 None:intergenic
GGCATTGAGCCAAATGAAGC+CGG 0.511221 1.1:+15957556 None:intergenic
TCCTGAGGAATCGAAACCTA+TGG 0.511320 1.1:-15964873 MS.gene33593:intron
AACATGCTTGCAAGATTGCT+GGG 0.515017 1.1:+15957526 None:intergenic
CAACCGCATGATCAAATGAA+TGG 0.519572 1.1:-15957310 MS.gene33593:CDS
ACATCGTGCAGATCACAATC+AGG 0.521432 1.1:-15958313 MS.gene33593:intron
ACAACCAATGTGAAAGAAGT+AGG 0.522321 1.1:-15964964 MS.gene33593:CDS
TTCCTCCATTTCCTTAACCA+AGG 0.522872 1.1:-15965346 MS.gene33593:intron
CTTCCATTCATTTGATCATG+CGG 0.523198 1.1:+15957307 None:intergenic
TGAGAGAGAGGGGAATCGGT+GGG 0.523880 1.1:+15965385 None:intergenic
CTAACCTACTTCTTTCACAT+TGG 0.525902 1.1:+15964960 None:intergenic
CGGCTGCAAATGCGGGTCAG+GGG 0.529279 1.1:-15957498 MS.gene33593:CDS
ATGAGAGAGAGGGGAATCGG+TGG 0.535678 1.1:+15965384 None:intergenic
CAAATTCCTCAGGAACTAAG+AGG 0.537501 1.1:-15957382 MS.gene33593:CDS
GTTTGATCATGTGGTTGCCC+AGG 0.546649 1.1:+15957145 None:intergenic
AATTCCAAAACCATTATACA+AGG 0.552101 1.1:-15957600 MS.gene33593:CDS
CGAACTATGCGATGAGAGAG+AGG 0.557755 1.1:+15965373 None:intergenic
AAATTCTGATCAAATCCCTC+AGG 0.558105 1.1:-15957221 MS.gene33593:CDS
GGTAGCTGAGCATAAACTGA+TGG 0.558424 1.1:-15958343 MS.gene33593:CDS
GATTGATCCAAATTACAGTA+AGG 0.561772 1.1:-15959025 MS.gene33593:CDS
AAACCAAGATCAAATTCCTC+AGG 0.562996 1.1:-15957392 MS.gene33593:CDS
CAAGAAGATGCCAATAGTAG+TGG 0.563887 1.1:-15957457 MS.gene33593:CDS
TTTCAGGGAATGCACCTCCT+GGG 0.570365 1.1:-15957333 MS.gene33593:CDS
TGAACCGGAAATTCGATTTG+GGG 0.574309 1.1:-15957428 MS.gene33593:CDS
TGAAGGAATTGAGGTAGCTA+GGG 0.576242 1.1:-15965080 MS.gene33593:CDS
AGTAAGGCATACAGTCGTCT+GGG 0.585202 1.1:-15959009 MS.gene33593:CDS
CAAATGAAGCCGGAGCTGCA+TGG 0.592124 1.1:+15957566 None:intergenic
TATCAGGGAATGCACCTCCT+GGG 0.596121 1.1:-15957162 MS.gene33593:CDS
GGTTTAGCTTATTATGCACA+AGG 0.596547 1.1:-15958988 MS.gene33593:CDS
ATGTGATGGAGATGTTATCA+GGG 0.598519 1.1:-15957177 MS.gene33593:CDS
CGAAGCCGATATTCAATTCG+GGG 0.601355 1.1:-15957257 MS.gene33593:CDS
TGATCATGCGGTTGCCCAGG+AGG 0.603190 1.1:+15957319 None:intergenic
TCAGCATCAACACCAGGACT+AGG 0.606015 1.1:+15965100 None:intergenic
TTCCAAAACCATTATACAAG+GGG 0.607099 1.1:-15957598 MS.gene33593:CDS
TGGTGTCTTTCAAAATGTGA+TGG 0.607325 1.1:-15957191 MS.gene33593:CDS
CAAATCCCTCAGGAACTAAG+TGG 0.608012 1.1:-15957211 MS.gene33593:CDS
ATTCCTTCAGCATCAACACC+AGG 0.610682 1.1:+15965094 None:intergenic
TTCAGGACCAATATTGATGG+AGG 0.612925 1.1:-15961302 MS.gene33593:CDS
GGAATAGTTGAACCTAGTCC+TGG 0.614900 1.1:-15965112 MS.gene33593:intron
GAACTATGCGATGAGAGAGA+GGG 0.615684 1.1:+15965374 None:intergenic
CCCTGAAAACATATGCATCA+CGG 0.617541 1.1:+15957348 None:intergenic
ATTGTTGGGATCCAATTGCA+AGG 0.618606 1.1:+15958482 None:intergenic
AACCTTGGTTAAGGAAATGG+AGG 0.618743 1.1:+15965344 None:intergenic
AAGGCACCTCTTAGTTCCTG+AGG 0.622185 1.1:+15957376 None:intergenic
ATTCCAAAACCATTATACAA+GGG 0.625931 1.1:-15957599 MS.gene33593:CDS
CAGGGGTCACATTCTCAGGA+AGG 0.630548 1.1:-15957481 MS.gene33593:CDS
TGTTGATGCTGAAGGAATTG+AGG 0.634452 1.1:-15965089 MS.gene33593:CDS
GAACCGGAAATTCGATTTGG+GGG 0.635777 1.1:-15957427 MS.gene33593:CDS
AGACACCACTTAGTTCCTGA+GGG 0.637831 1.1:+15957206 None:intergenic
AGTCCTGGTGTTGATGCTGA+AGG 0.643324 1.1:-15965097 MS.gene33593:CDS
TGATCATGTGGTTGCCCAGG+AGG 0.644291 1.1:+15957148 None:intergenic
GTGTCTAAAGATGAACTATG+TGG 0.647515 1.1:-15961359 MS.gene33593:CDS
AAGACACCACTTAGTTCCTG+AGG 0.647601 1.1:+15957205 None:intergenic
TTTGCCTACCAGAAGCGTGA+GGG 0.649290 1.1:+15961952 None:intergenic
GAGAGAGAGGGGAATCGGTG+GGG 0.653558 1.1:+15965386 None:intergenic
TCAAATTAACAGATATACAG+AGG 0.654051 1.1:-15959076 MS.gene33593:CDS
ATTTGCCTACCAGAAGCGTG+AGG 0.655245 1.1:+15961951 None:intergenic
AACTATGCGATGAGAGAGAG+GGG 0.655686 1.1:+15965375 None:intergenic
GGGGATGCGATTGTGAGACA+TGG 0.664819 1.1:+15965405 None:intergenic
TTAGGATGAAGATTGGACAC+AGG 0.666066 1.1:-15961976 MS.gene33593:intron
TGATGCTAAAACTCGTCCTG+AGG 0.690915 1.1:-15964888 MS.gene33593:CDS
TCTTGCAAGCATGTTCATGG+CGG 0.708564 1.1:-15957518 MS.gene33593:CDS
GAAGCCGATATTCAATTCGG+GGG 0.723079 1.1:-15957256 MS.gene33593:CDS
ACATGCTTGCAAGATTGCTG+GGG 0.724105 1.1:+15957527 None:intergenic

CRISPR-GE

badsite warning sgRNA_sequence Strand Position Region GC_content


Chromosome Type Strat End Strand Name
chr1.1 gene 15957120 15965425 15957120 ID=MS.gene33593
chr1.1 mRNA 15957120 15965425 15957120 ID=MS.gene33593.t1;Parent=MS.gene33593
chr1.1 exon 15965347 15965425 15965347 ID=MS.gene33593.t1.exon1;Parent=MS.gene33593.t1
chr1.1 CDS 15965347 15965425 15965347 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15964874 15965127 15964874 ID=MS.gene33593.t1.exon2;Parent=MS.gene33593.t1
chr1.1 CDS 15964874 15965127 15964874 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15961961 15961994 15961961 ID=MS.gene33593.t1.exon3;Parent=MS.gene33593.t1
chr1.1 CDS 15961961 15961994 15961961 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15961247 15961380 15961247 ID=MS.gene33593.t1.exon4;Parent=MS.gene33593.t1
chr1.1 CDS 15961247 15961380 15961247 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15960461 15960527 15960461 ID=MS.gene33593.t1.exon5;Parent=MS.gene33593.t1
chr1.1 CDS 15960461 15960527 15960461 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15960090 15960189 15960090 ID=MS.gene33593.t1.exon6;Parent=MS.gene33593.t1
chr1.1 CDS 15960090 15960189 15960090 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15958950 15959113 15958950 ID=MS.gene33593.t1.exon7;Parent=MS.gene33593.t1
chr1.1 CDS 15958950 15959113 15958950 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15958458 15958504 15958458 ID=MS.gene33593.t1.exon8;Parent=MS.gene33593.t1
chr1.1 CDS 15958458 15958504 15958458 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15958314 15958364 15958314 ID=MS.gene33593.t1.exon9;Parent=MS.gene33593.t1
chr1.1 CDS 15958314 15958364 15958314 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
chr1.1 exon 15957120 15957641 15957120 ID=MS.gene33593.t1.exon10;Parent=MS.gene33593.t1
chr1.1 CDS 15957120 15957641 15957120 ID=cds.MS.gene33593.t1;Parent=MS.gene33593.t1
Gene Sequence

>MS.gene33593

ATGTCTCACAATCGCATCCCCACCGATTCCCCTCTCTCTCATCGCATAGTTCGTTCTTTCCTCCATTTCCTTAACCAAGGTTCGTATCCTTTGATTTTTGATTTTGATTTTGATCCATTTTTTGTGTAACACTGTATAGAATTTCGTTGTTTTTTTTTGTTCGTAAAACCTAGCTCAGCTAGTAATGTCGAACCCGAGATATTGAGTTTCAGATGGTGTTTGTCACTTAATTTGATTGATTATTATTTTGATTGTAATGAATGTTAATGATTTTTTTGTTGTTGTTGTTGTGGAATAGTTGAACCTAGTCCTGGTGTTGATGCTGAAGGAATTGAGGTAGCTAGGGAGTGTTTGGTAGAAGCTTTTAAGATTAACAACTCAGCTTCTATAACTGGTGAACCTGATTCATTGATTGATATATTTAAGTCATTTGATGCAAACAACCAATGTGAAAGAAGTAGGTTAGATTCTATGAAAGCTTCAAGTTCGGTTTCTGCCCAGAATGCTGCTGCTGCTGATGCTAAAACTCGTCCTGAGGAATCGAAACCTATGGTTATTTCTCACTTTTCTTTTGTTTTTGTTCAACATTATGCTGATTTACTCATGTTATTCCGTATATTGGTTAGGTTTTGTCTCGAGGGTGGACGTAGGTGACAGATTGATTATCACTCACAACAGTCGATTATACTAAAATCTAATGTTTGTTAGAACAATATTTTAACGATTGTGAGTAATAATCGATCGTGAACCATGGTCTGCCCTAGAAGCTGACGTTGTCTATTTGTTCTTACTGACTCGACTATTTATTTTTATTTTTTTGGTTCATTGATGGATTTGAGTCAATGTGTTTTCCCATACTTGTTTAGTTGGTTAAAACTTGTGCCTATCCTTTTCTCTTGAATCGTTCTTCGTGATTAAAACTGGTGAGCATGCTGTACATAGGTGTAAATATGTATAACTTCTAGTTCTGTATTATTTGATGATTAATGACAATGTAATGGCTATCTCCATAACTATATGTCCAACTAAATTATGGTTAGAAAATGGTTACGTTAAATGTGGAAAAGGACGATTGCAGGAAAAGAAATTGTGTTTAACACGTGCTAATTTTCTAGCGGTTATAGTGGAGCCCTTCAATTGTTAATTAGTGAGAGATTATGTCTATGGATTATGAAAATAAGTGTTGCCTTTGTAGGTATTTAGGATAGCAAAGTTCTTAAAAAGTCAAAACATATATGTATAGGGCACCTTGACATTGTGTCGGCTGTATCCAAGAAAGAAAAAGAGTAGTCACTTGGTGTTTTACCATGTACCAAACCCTTCAATTATGCCATGTCTCGCTTCGCATGTGATTTTTATTCCATTAAATAGTTTCATTGTTGGAATTATAGGAATTCAGGTTTAGCAGGGGGAAAAAAGAGAGCAAAATTTGAATAATCGATTCTAATATTTATATATGGAGTGGACTATATGTTATAAAAGCAATGTTGTCAAATAGCGGCAAATCGGCGACAGCTGCGCTATAGCATAGAGGATTGAAATGAATTTTCTATGAAAACCACGACGCTAACCTAATTTCGCCATTAGGGCAATAGTGGCCGCTATTCCAGTTTCTGCTAGCTGATATAGCGGCAATACTGGCAAAATGAGAATTTTAGGGTTTTTGAAAATAATAAAACTGGAGAATATGCATGTATGCCAAACATGACCTATTATGAAAAATAGAAAAACTAAAGAGTCAGCGTTACTCCTATATTCATTGTAAATTGTAATACGTAAGCCTTCCATATTTAAGCTACGGTCTGGCTCCATTCCCTTTTTTATTTGTATGCAACTAGCACTTTATTATATTGTATATAATTTATAATTGACTTATTTCAACCATTATATGCGGCTTCCACTATTTTCCCGCAACACTATAAGCTGTTTCACAAGGCGGGTTTTTGGCATTTCACTGTTTTCTGCAATCATCTATCAGAATACAATGCTAACTCTATTTGTAGAGTTAAGGCGGGTACAGCTGTATTTTTTTCCCTTTATCTCCGGTTTCGGTTATGAATTAATTAGAATCAATGTATTAATAATTAGCTTCGGTGTATTATTGTCCTTATTAGAATATCCGAGTCTGTACTCTATAAATATTGTCCTTCTCAGCCAAGTTTTCAATTTGGTCTTGCAGTTTTGGGAAAAAAATCTCTTGCTGCTATCTTTCACTGGTTAGAACCCTAGTCACCCTATTAGTTACATTTTCTTAGTCTCTACATCATGTCTGAGGAGGATGTTTCAATCGTTTCTAACGGCAAGGGTGTAATGTTCTCAAAATTAGGTCACGTAGACATTGTTGGTCACACAGATTCTGATTTTACTGGGTCTAAACTAGACAGAAAGTCTGCAACAGGTATTTGTTATTTATTGGGGGCAATATTTTGATTGGAGAAGAAACAAAATGTAACGTCACTGTCTAGTGTAGAATCCAAAGTTGTCAATTGTGAATCGCAGAAATTAATGGTTTGTTCAAATTGCGCTATGCTTTAGTGCTATAGCCACTATTTGACAACACTTGGTACTAAATCGTGTATCACGGAATTATAGTTATTTGTTAAATTTCCGCTATTTGACAACACTTTTAGCTGCTATGTTGTGCCCTTCATCATGCACTGCAAAGCTTATATGGATGAACATTCTTCAACTTGAACTTAGTTTTGGTTCGGAAAACCTATGGCACCTTTTTGTGATGACGCAGAAGCCATTGTAATTACAAACATGATCGGACTAAGCATGTTGAACTTGACGGTAACTACGTCAAGGATAGCTTAAACTTAAATGCTAACAGGATACCCTATATTAAAAGTGCTAATCAACTAGCCGATACGATGACCCATGCAGTCCGTACTGGTCCATTTGATTCAGTTATGTCCAAGTTAGTCTGTGCGATATCTATGCACCAACTTGAGGTATAGAGTTGATATGTACAGCTGTATATTCATTGTTTCCTGATTTGGTTATGAATGCGTAAGAATCAATGAACTGTTAGATTGATTGTATAATTTCCTTCTAGATTACCCAATGAGTCTGTTTATTCTTGAGATGAGATGTGTGGAACGAAGACATTAGGCCAATTTTCAACTGTATTCATACTGGACCATGAATTTTTTTAAATAATGAAACATGGAAACACTTCTAAGATATAATCTATGATACTCTAAAATGTTCCAAGATATAAGAAGGATATCATGAGATTTTTGGTAGGCTTTACTCTCTCACCTTTTAAGCTACTCACAAGTGTATGTCACACTGACGCTGAAGTATTTCATGATTTGTCTACTCTGTTTTTACTGCTCTAGAAGTACATTACTCTATCACATAAATTGTAAATTGTATTGAAATTTTGCTTTTTTGGTTCTGATTATCTCACAATGTGTCTTCATTAGGATGAAGATTGGACACAGGACCCTCACGCTTCTGGTAGGCAAATCTGTTCCCTTTATTTTATTTGAATTTTGAGTGATCATAAATAGTTATGTATGGAAAGAACATTTTCTACCATTTATCTTCTCAAGATTCACACTGAGAGGACATTTCTTGCTGGAAAAGTTTATAGTTTGTAACATTTAAATTCATTTGTTTGTTGTACTATGAGGCTGAGCATGCTCTAGTGTCTGGACCAGTTTGACAAGCATTACTACAAATATTAGTAAATACTGTAAATCGTTTTCTGTTCTTTTGCATGACAAATTTTCTCAAGCTAATATTTTCGTTGGTTGGTGGGTGGTACTTTTGGGGATCAAATTGGTAAAATTTTGTAATGTATTTTGGGAATTTTTTTATTAATATATTATAGTATTATTTATGCCCTATCTAATTGTGAATGATTAAAATTTCAGAATATTGCACATAGATTGATATTTTCACCAGAACTTTTTGCCACAGTAGTTGACTTCCTCTTTTGTTCTAGTCCAGTGATATATTGGATTTAGTTGAATTTGTATGGTATTTAGTAGAAGAGGGTCTATCTTAATATTTTCTTAAATTTTTTTGACAGCAGTGTCTAAAGATGAACTATGTGGACAATTCTTTGCTGTTCTGGAGAAAAAACACTATTTCAGGACCAATATTGATGGAGGTGATGACATAGTGCAACTGGAGAAAGCATCTCGCTTATTTGATGATGGTTTTACGGTATGACACTATATCATGAATATGCATATGTTGCATTTCCAGATGATGTTGGAACTTGAAACATACCTGTTCATTTGCTTCAGCATATGTGACTTGGCTTTTATCATAGTAGTGGTTTAAACTTTTAGAACAACAACAAACAAAAATATTTTCCACTGGGTGCGGTAAGCTACATGTAGAAACTAACAGGCAAAATACAGTTGTGTAATATACGTAATTCTTGTCACTGTCACTGCTTCTTGTCATATATACCCAATTCCTAAATTTCAAATGAGTTTTCTAATTTTAATATTAATAATGGGATAACTATTGATGAAGGTACAAGAAAAACATGGTGTTACATTTTGCTTGTGTGTAAATTTAATCTTTGTAGTAAAACGAGAGGAAATGAATGAACTGTGCCTCTTTTTTGTGTTCAAATTTCTTCAACTTATGTCAGATGCTTGTTTGGTAAATGACTTTCATACTATGAGTGTACCCCTATCCATTTTATTCCATTAGATAAGTCAGTTGGTATCTCAAGTTAGTAAAACTGTTGCTTGTTAGTGTGTGTAATTTCTTGCTGATCTTTCTGTTCAATTGTACATTAGCCAATGCCATGCCACTATCCACTTGGTGGGGTTGACTACTTGAATCAATCAATACTATTGGCCGATGGTTTTATCAAAACCAAATTTTCAGTAATACTGTCTGTTTAAGCAAAATCTGCTTCTATGCAGGAGATGGAAAAATCTGGGTGTGGGCAGTTCAATTTGAAGAATTTGGCTGAATCACTAAAAACATTAGGTATGATAAGTTACTTTGACTTTAAGAATCTTTCTTTTCCCTTCATGAATCGAAAATCTCTCTTTCTCCCTCTCTCTTACATTGTGGACACAACACATAATAGATCTTTTATTGGTTAGAATTAAAGTATGTCAAAATGTCTATATTTGCACTGCTGATTGGTTTGGTTAAATTGATTTGAATTTAGGTCAATTTAAGTTTTGCTCAAGAGTATCTTCATTGATGGATTTTCGCAAGACAATTTAATACTTGAAATTGGGTTTTTACTCAGGTAACAAGGCAATGCAATCCAAGCAGTACTTTGATGCAATTGAGTTGTATAATTGTGCAATTGCGATATATGAAAAGAGTGCTGTTTACTACTGCAACAGGTATTCCACTTGATTCTACTGTTACTCAACTATATTTCACTATACTATTTTTAGTACATCTGCGTGTGTGTAGTTTCTTGTGGAATGTAATTACCGATTGTCTAATAAATGGCTCTGTTTTTATTTTTTTTTAAATATTAAATAAATATAACTAATATCAAATATTAGTTGTGTTGCTTCCACTTTTTTTGTGAGATGCTATGTCATTCTTTAGGGTAATTTTAGGATTTTTTTGGGACGGAGAATTTTATCAAGTATTCTAGTTATGCTGCACCAAATATGTTCACCAAATAATTGTGAGAAACAAATTCTGGAAGGTTCTAAATAACTATAATTTATGGAGAATAGAGAAGAACTAAGCAATACAAAGTTCGAGCTCAAAAAAATAAAATATAATTACACTATGGTCTCATTCATCATCCCCAAGATCAAAGTCACTTAGTTCATCTTTTTCTCCATCCTCTGATGGTCTATGCTCTCCCTCTACCTCTGCATCTCCAACAATCTCTTCAATTTCACTCTCCTTCCGTATAATGATAGATTTGACCTTCTTGTCAACCTTAGCAATTCATTTTCTTGGACTTGTCTTATTAAACAAGAAACACAAAAATGCACATTAGGAGGGAGTGTGTCAAAAACTCTTGTGTTCTAATGCTCTTGATTGACGATTTAACTTGGGATATTCAGAATATTGTTTTATGCTGATTCTGTCTATTTTCCCCTAAAAATACAAAAAAAAAAAAAAAAAAAAAAAAACGAGGTAAAGTGAGTGATGGTAGTTCACATGACCCCATGTATATATGTGTTTCTCATGTTTCAAATGTATCCCTTTTTTTCTTTTTTCTGGTGGCTCTGTTCTGTGCGTACGAATATATTTAAATATTTAGTACATTTTGGAGTGGTCAATTGTCTGGCTCAGAATACCTTAGGACCAGTTTATTTAACTGATGGAGCTGCATTTTTTGATAAATATCAGGGCAGCTGCTTATACTCAAATTAACAGATATACAGAGGCAATCCAAGATTCTCTTAGATCTATCGAGATTGATCCAAATTACAGTAAGGCATACAGTCGTCTGGGTTTAGCTTATTATGCACAAGGAAACTACAGAGATGCTATTGATAAAGGGTTTAAAAAAGGTCAGTGAAGTATCATATTAATTGATTTCTTGTTTTTATAATCTGCTTGACATACTTACTGAGCCTCCTCCTAAGAATGTCGACTTCAATTTCATCTGGTGTTACAAATCAAGGTGAAATGGTGCACACAACCTTATTGTTAATTCAATGTAAATAAAAATTAGTTTTGCTAAAAATTCATCAGCTTCTATTTGTTATTTGAGATTCCTACAATTACAGGTGCATGTTTTAAACGTCGGAGTTGAAAGAAATATTAAAGAATTGATTTCACCAAAAAATTCATTCTGTATTAATCAACATCATAATCAGAGTGTGAAGTATTTAAAACCATTAGTGATTTATTACACTATTATGAGTATCCGATCCAGGCTGGTTGGAATATGGAAATGTTAAATCTGAAAGAACTCAAGTTATTTCACCCTAACATGATTCCGTATCTTTGCAGCCTTGCAATTGGATCCCAACAATGAGTCTGTTAAAGAAAATATAAGGGTTTGTTCTGTTCTGTTTTTTATTTTATTTTGTTTTATTTTCTCTTCTCTTTTTACTTTTTCTATGTGGTCATAATGCTCTTGTTAATCACAGGTAGCTGAGCATAAACTGATGGAAGAGCGACATCGTGCAGATCACAATCAGGTATTATTATCATTTCTTTTAAATTCAATCAGGTATTATACATGCTTTGTGTCAGGAATACATATCATACTAAGAACCTGGGCTCGCTGTTGCAGCGTTTTCACGTAGTTTTACACCGTAGCCGATCTTGGCCTTTAATTCGAGATCAGACAGTTCATATTTGAAACTTTTTTTCAAGTAGTAAAATCATCCCAGCCGCTGATTTGAGATCGGACAGCCCAGATCTGTTACAGCCTCCTGCTGTTAGTTGCAGCAGGAAATCCACTTTGATCATACCAGTCCTAATTAATGGGTCTTCTTCTTTTTCTTGCAAAATAACTAATTTTGTGAAAAAATTGTTGGAAAGCAATTATGGATAACCTTTATCAGATGTGATGTAATACATGATCTGGTTCTTGTGGTTATTGTTGTCTTCAACAGCTTATCTTTCATTTCAACCCGCACAGTTCACATGAGTCATACAAATTTTTTTACTTAGTTAGCCATTCCTCTATGCATTATTGTTCGTTTTCAAATTAAATCTACGAATTTACTCTAGCTATTTACTTTTCATATTAGAAACCAACATGTAGTATTGGAATGCAGTGTAACTTGTTATCTGACTTCAATGTACTCTCTCCATTGTTTCAATTTGCTTTTGTATACTGAAAACTAACACTGTAATCTTTAAAGAATTCAAGATCATCTCAAGAATTCCAAAACCATTATACAAGGGGATCAAGAAGCCATGCAGCTCCGGCTTCATTTGGCTCAATGCCATTTAACCCCAGCAATCTTGCAAGCATGTTCATGGCGGCTGCAAATGCGGGTCAGGGGTCACATTCTCAGGAAGGTCAAGAAGATGCCAATAGTAGTGGAGCTAATGAACCGGAAATTCGATTTGGGGGCAATGTTAATGTAAACCAAGATCAAATTCCTCAGGAACTAAGAGGTGCCTTTCAATCCGTGATGCATATGTTTTCAGGGAATGCACCTCCTGGGCAACCGCATGATCAAATGAATGGAAGTCATGAAGATGCCAATAGTGAAGATAACGAAGCCGATATTCAATTCGGGGGCAATGTTAATTTAAATTCTGATCAAATCCCTCAGGAACTAAGTGGTGTCTTTCAAAATGTGATGGAGATGTTATCAGGGAATGCACCTCCTGGGCAACCACATGATCAAACGAATGAAAGAACAGCACCAAACTAA

Protein sequence

>MS.gene33593.t1

MSHNRIPTDSPLSHRIVRSFLHFLNQVEPSPGVDAEGIEVARECLVEAFKINNSASITGEPDSLIDIFKSFDANNQCERSRLDSMKASSSVSAQNAAAADAKTRPEESKPMDEDWTQDPHASVSKDELCGQFFAVLEKKHYFRTNIDGGDDIVQLEKASRLFDDGFTEMEKSGCGQFNLKNLAESLKTLGNKAMQSKQYFDAIELYNCAIAIYEKSAVYYCNRAAAYTQINRYTEAIQDSLRSIEIDPNYSKAYSRLGLAYYAQGNYRDAIDKGFKKALQLDPNNESVKENIRVAEHKLMEERHRADHNQNSRSSQEFQNHYTRGSRSHAAPASFGSMPFNPSNLASMFMAAANAGQGSHSQEGQEDANSSGANEPEIRFGGNVNVNQDQIPQELRGAFQSVMHMFSGNAPPGQPHDQMNGSHEDANSEDNEADIQFGGNVNLNSDQIPQELSGVFQNVMEMLSGNAPPGQPHDQTNERTAPN