AlfalfaGEDB Alfalfa Gene Editing Database

M. sativa cultivar XinJiangDaYe / MS.gene21036


Query id Subject id identity % alignment length mismatches gap openings q. start q. end s. start s. end e-value bit score
MS.gene21036.t1 MTR_1g084190 97.561 451 11 0 1 451 1 451 0.0 925
MS.gene21036.t1 MTR_8g101730 27.033 455 270 15 51 445 8 460 1.17e-35 139
Query id Subject id identity % alignment length mismatches gap openings q. start q. end s. start s. end e-value bit score
MS.gene21036.t1 AT1G09300 68.820 449 137 2 1 449 1 446 0.0 660
MS.gene21036.t1 AT1G09300 71.078 408 116 1 42 449 10 415 0.0 623
MS.gene21036.t1 AT4G29490 25.862 406 243 14 93 445 59 459 4.80e-35 137

Find 125 sgRNAs with CRISPR-Local

Find 0 sgRNAs with CRISPR-GE


CRISPR-Local

CRISPR-Local
sgRNA_sequence on_target_score Position Region
CTCACATAAGGTTTGGCTTC+TGG 0.259772 1.4:+68825031 None:intergenic
TGAATTTGCTTCACTTCTTT+CGG 0.281191 1.4:+68817281 None:intergenic
ACCAATGTCATGCCCTAAAA+TGG 0.283309 1.4:+68825069 None:intergenic
TAGAAGTTGGGTTCAGCTTA+TGG 0.298215 1.4:+68821147 None:intergenic
GAGGAACCCCTTATCATATC+TGG 0.302925 1.4:+68823888 None:intergenic
ACATCTGTCATCATCTTTAC+TGG 0.311255 1.4:+68825163 None:intergenic
AATTAGCATTTGCAGGTTCT+AGG 0.321480 1.4:-68826248 MS.gene21036:intron
GCCGATTACTCATATATTAC+AGG 0.336669 1.4:-68825115 MS.gene21036:CDS
ACCTCGTTCAACTTTCTCAT+TGG 0.343552 1.4:+68824135 None:intergenic
GCATCTTGTCGAAATGTATA+AGG 0.350875 1.4:+68825136 None:intergenic
ATTGATCCAGGTCACTATCT+AGG 0.354401 1.4:-68821000 MS.gene21036:intron
AATTTGTCTAGGATGTGATT+TGG 0.363924 1.4:-68824218 MS.gene21036:intron
CTTTCGGAATTGATGCTGTT+AGG 0.367090 1.4:+68817297 None:intergenic
GAAGTGTTATACATTATTCT+AGG 0.368162 1.4:-68823189 MS.gene21036:CDS
ATCTGCATCAATTGCTTGTC+AGG 0.376334 1.4:-68823709 MS.gene21036:intron
ATGATAGCCAAACTCTTCTC+TGG 0.383373 1.4:+68825196 None:intergenic
GAGATTCTCATTACAGAAAC+AGG 0.389989 1.4:-68818408 MS.gene21036:CDS
CTGTTTGTGTGAAAATGTTT+TGG 0.397576 1.4:+68826374 None:intergenic
TTCAATCCCGTGGTTGGTGG+TGG 0.398757 1.4:-68823220 MS.gene21036:CDS
CATGTTTGGCAGATTAAAGA+TGG 0.406377 1.4:-68823095 MS.gene21036:intron
AAGCTGAACCCAACTTCTAT+AGG 0.407794 1.4:-68821143 MS.gene21036:intron
ATGAGAGGTGCGCAGCGAAT+GGG 0.410718 1.4:-68823451 MS.gene21036:CDS
AGTTTATTTGTGACAGATAG+AGG 0.419351 1.4:-68818339 MS.gene21036:intron
GTGGTTGGTGGTGGGCCTAA+TGG 0.419819 1.4:-68823211 MS.gene21036:CDS
ACAGTAAGCTAGTTTCTTGA+AGG 0.421447 1.4:+68823812 None:intergenic
AACAACTTCAGCCATGGAAT+GGG 0.421682 1.4:-68817248 MS.gene21036:CDS
GCCTGTAATATATGAGTAAT+CGG 0.422629 1.4:+68825114 None:intergenic
GTTGGAATATTGAAAGATGT+TGG 0.424223 1.4:-68821182 MS.gene21036:CDS
AGTACCTGCGGGTGAGATGC+AGG 0.425788 1.4:+68826173 None:intergenic
GCAGCTAGCATACCTTCAAA+AGG 0.428825 1.4:+68823496 None:intergenic
GATCCTTCCAGATATGATAA+GGG 0.429047 1.4:-68823895 MS.gene21036:CDS
GCATTGTAGCTTCTAAAAGA+TGG 0.435409 1.4:-68825286 MS.gene21036:intron
TATCAGACTCACATAAGGTT+TGG 0.437944 1.4:+68825024 None:intergenic
AGATCCTTCCAGATATGATA+AGG 0.439022 1.4:-68823896 MS.gene21036:CDS
GCTTCTCCCAGAGAAGAGTT+TGG 0.441988 1.4:-68825203 MS.gene21036:CDS
CATTACAGAAACAGGTTATG+AGG 0.448108 1.4:-68818400 MS.gene21036:CDS
CATCTGTCATCATCTTTACT+GGG 0.449826 1.4:+68825164 None:intergenic
AACTTTAGTAAGTACCTCTC+TGG 0.452530 1.4:+68819864 None:intergenic
CATTGTAGCTTCTAAAAGAT+GGG 0.460019 1.4:-68825285 MS.gene21036:intron
CCTAAAATGGCCACACCACC+AGG 0.460454 1.4:+68825082 None:intergenic
TGGTAGCTTCTCTTCTGCCC+AGG 0.468113 1.4:-68822997 MS.gene21036:intron
GGTACCAAGGCATTGGGATA+AGG 0.470535 1.4:-68818440 MS.gene21036:CDS
CCGAATGGTTGTGTATTTGT+CGG 0.470570 1.4:+68821997 None:intergenic
TGCAGCATTGAATACATTCA+AGG 0.471710 1.4:-68824172 MS.gene21036:CDS
CACAAGGTTTCCGTTGATGT+TGG 0.475869 1.4:-68826206 MS.gene21036:CDS
TAATGCAGGTACCAAGGCAT+TGG 0.480015 1.4:-68818447 MS.gene21036:intron
TTCATCAGCTTGAGCTCTGA+AGG 0.481437 1.4:+68823735 None:intergenic
TGTGTGGAACTTTGTAAGCC+TGG 0.488945 1.4:-68822029 MS.gene21036:CDS
TTTGACTGTCCTCTGAAGCC+AGG 0.489111 1.4:-68820946 MS.gene21036:CDS
ACCAGGCTACCTATAGAAGT+TGG 0.492131 1.4:+68821134 None:intergenic
GTCTAGGATGTGATTTGGCA+AGG 0.493497 1.4:-68824213 MS.gene21036:intron
AAACAACTTCAGCCATGGAA+TGG 0.496327 1.4:-68817249 MS.gene21036:CDS
AACATTGTGATACAATTTCG+AGG 0.498748 1.4:+68823869 None:intergenic
AATGAGAGGTGCGCAGCGAA+TGG 0.501064 1.4:-68823452 MS.gene21036:CDS
TGGCCACACCACCAGGTTGT+TGG 0.502948 1.4:+68825089 None:intergenic
TAATGTATAACACTTCCATT+AGG 0.503380 1.4:+68823196 None:intergenic
TTCAGAGCTCAAGCTGATGA+AGG 0.504671 1.4:-68823733 MS.gene21036:CDS
ATTTGGCAAGGGCATGTAGC+AGG 0.507631 1.4:-68824201 MS.gene21036:CDS
TCAGCGATCTCACGCGTACC+TGG 0.508919 1.4:-68823028 MS.gene21036:CDS
ATCTATCTGTTTATACCCAT+CGG 0.510938 1.4:-68823776 MS.gene21036:CDS
TACATCAGCATATACTGAAC+TGG 0.512568 1.4:-68823838 MS.gene21036:CDS
CAGAAAGGACTCAAGGAGGT+TGG 0.513664 1.4:-68821200 MS.gene21036:CDS
AGATTCAATCCCGTGGTTGG+TGG 0.514269 1.4:-68823223 MS.gene21036:CDS
AATGCAGGTACCAAGGCATT+GGG 0.515702 1.4:-68818446 MS.gene21036:intron
TGAAGGTATGCTAGCTGCAA+AGG 0.521850 1.4:-68823491 MS.gene21036:CDS
GAAGCTACCACAAGGTGGCC+AGG 0.525969 1.4:+68823010 None:intergenic
TTTATACCCATCGGCTGCGA+TGG 0.528638 1.4:-68823767 MS.gene21036:CDS
ATCAACGGAAACCTTGTGAC+TGG 0.532664 1.4:+68826211 None:intergenic
CCAGGCTACCTATAGAAGTT+GGG 0.533663 1.4:+68821135 None:intergenic
CTTACAACACCTGGCTTCAG+AGG 0.541034 1.4:+68820937 None:intergenic
CAGCTAGCATACCTTCAAAA+GGG 0.544862 1.4:+68823497 None:intergenic
AAAGATGGGATGTACACTCC+TGG 0.554857 1.4:+68819897 None:intergenic
TATTCCTCACTAGATATGCC+TGG 0.555430 1.4:+68825253 None:intergenic
TGCGGGTGAGATGCAGGGGT+TGG 0.558374 1.4:+68826179 None:intergenic
TCTAGGATGTGATTTGGCAA+GGG 0.563003 1.4:-68824212 MS.gene21036:intron
AGGCATTGGGATAAGGATTG+AGG 0.564277 1.4:-68818433 MS.gene21036:CDS
CCGACAAATACACAACCATT+CGG 0.567790 1.4:-68821997 MS.gene21036:intron
AGTTGGGTTCAGCTTATGGT+AGG 0.568189 1.4:+68821151 None:intergenic
ATGGCTGAAGTTGTTTAGCA+AGG 0.569030 1.4:+68817256 None:intergenic
GGGAAAGCACTTACAACACC+TGG 0.570813 1.4:+68820928 None:intergenic
AACCCCTGCATCTCACCCGC+AGG 0.573533 1.4:-68826177 MS.gene21036:intron
ATTAGGCCCACCACCAACCA+CGG 0.575070 1.4:+68823213 None:intergenic
CAATCCTTATCCCAATGCCT+TGG 0.576534 1.4:+68818436 None:intergenic
TTGCTAAACAACTTCAGCCA+TGG 0.577584 1.4:-68817254 MS.gene21036:CDS
TCAATCCCGTGGTTGGTGGT+GGG 0.577700 1.4:-68823219 MS.gene21036:CDS
TGAGAGGTGCGCAGCGAATG+GGG 0.579429 1.4:-68823450 MS.gene21036:intron
GTACCTGCGGGTGAGATGCA+GGG 0.580749 1.4:+68826174 None:intergenic
TACCTGCGGGTGAGATGCAG+GGG 0.581809 1.4:+68826175 None:intergenic
GACATTGGATGTGAGTTACA+TGG 0.582230 1.4:-68823056 MS.gene21036:CDS
TGATAGCCAAACTCTTCTCT+GGG 0.583126 1.4:+68825197 None:intergenic
TCTCATTGGATACGCCTCGT+TGG 0.585869 1.4:+68824149 None:intergenic
AAAGATGGGGAGATCACACC+AGG 0.587344 1.4:-68825271 MS.gene21036:CDS
ATCAGATTCAATCCCGTGGT+TGG 0.587531 1.4:-68823226 MS.gene21036:intron
TTAGGCCCACCACCAACCAC+GGG 0.589559 1.4:+68823214 None:intergenic
AGCTTACTGTAACAACGTAA+AGG 0.590674 1.4:-68823799 MS.gene21036:CDS
ACAACTTCAGCCATGGAATG+GGG 0.593125 1.4:-68817247 MS.gene21036:CDS
CTGCCAACAACCTGGTGGTG+TGG 0.593769 1.4:-68825092 MS.gene21036:CDS
TCGTTCTTATTCCAGTCACA+AGG 0.594752 1.4:-68826222 MS.gene21036:CDS
ATTACAGGCTGCCAACAACC+TGG 0.596169 1.4:-68825100 MS.gene21036:CDS
CAGGTTGAAATGCTGCAGAA+AGG 0.597743 1.4:-68821215 MS.gene21036:CDS
ACGCGTACCTGGCCACCTTG+TGG 0.599158 1.4:-68823017 MS.gene21036:CDS
CAGGTAATAACTATTGAACC+AGG 0.600635 1.4:-68819915 MS.gene21036:CDS
TCCAGGTCACTATCTAGGAA+TGG 0.601316 1.4:-68820995 MS.gene21036:intron
AGAAACTAACAAGCATTGTG+TGG 0.607256 1.4:-68822045 MS.gene21036:CDS
GGTTGGTTGTCCAACATCAA+CGG 0.613022 1.4:+68826196 None:intergenic
ATCCTTCCAGATATGATAAG+GGG 0.625213 1.4:-68823894 MS.gene21036:CDS
AATGCTGCAGAAAGGACTCA+AGG 0.627439 1.4:-68821207 MS.gene21036:CDS
TTATTCTAGGAACGATCAGA+AGG 0.629437 1.4:-68823176 MS.gene21036:intron
CTGTATCAGATTCAATCCCG+TGG 0.633552 1.4:-68823230 MS.gene21036:intron
CAACTTCAGCCATGGAATGG+GGG 0.635710 1.4:-68817246 MS.gene21036:CDS
ATCAGCATATACTGAACTGG+AGG 0.641525 1.4:-68823835 MS.gene21036:CDS
ATTGTAGCTTCTAAAAGATG+GGG 0.641791 1.4:-68825284 MS.gene21036:intron
ATTTGTCGGATACTAGCACC+AGG 0.642966 1.4:+68822011 None:intergenic
AACCCCTTATCATATCTGGA+AGG 0.645748 1.4:+68823892 None:intergenic
GATCTCTAATGCAGGTACCA+AGG 0.651356 1.4:-68818453 MS.gene21036:intron
CACACCAGGCATATCTAGTG+AGG 0.673963 1.4:-68825257 MS.gene21036:CDS
ATCTGTCATCATCTTTACTG+GGG 0.686632 1.4:+68825165 None:intergenic
ACATTGGATGTGAGTTACAT+GGG 0.693361 1.4:-68823055 MS.gene21036:CDS
GAATACGAATGCAAAATGAG+AGG 0.696219 1.4:-68823466 MS.gene21036:CDS
GCAGAAGAGAAGCTACCACA+AGG 0.709712 1.4:+68823002 None:intergenic
GAAGAGAAGCTACCACAAGG+TGG 0.733869 1.4:+68823005 None:intergenic
TCCAATGAGAAAGTTGAACG+AGG 0.743293 1.4:-68824136 MS.gene21036:intron
ACAGGCTGCCAACAACCTGG+TGG 0.744058 1.4:-68825097 MS.gene21036:CDS
GCTGCAGAAAGGACTCAAGG+AGG 0.745589 1.4:-68821204 MS.gene21036:CDS
GAATACATTCAAGGCCAACG+AGG 0.787952 1.4:-68824163 MS.gene21036:CDS

CRISPR-GE

badsite warning sgRNA_sequence Strand Position Region GC_content


Chromosome Type Strat End Strand Name
chr1.4 gene 68817242 68826421 68817242 ID=MS.gene21036
chr1.4 mRNA 68817242 68826421 68817242 ID=MS.gene21036.t1;Parent=MS.gene21036
chr1.4 exon 68826374 68826421 68826374 ID=MS.gene21036.t1.exon1;Parent=MS.gene21036.t1
chr1.4 CDS 68826374 68826421 68826374 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68826178 68826255 68826178 ID=MS.gene21036.t1.exon2;Parent=MS.gene21036.t1
chr1.4 CDS 68826178 68826255 68826178 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68825036 68825299 68825036 ID=MS.gene21036.t1.exon3;Parent=MS.gene21036.t1
chr1.4 CDS 68825036 68825299 68825036 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68824137 68824229 68824137 ID=MS.gene21036.t1.exon4;Parent=MS.gene21036.t1
chr1.4 CDS 68824137 68824229 68824137 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68823710 68823916 68823710 ID=MS.gene21036.t1.exon5;Parent=MS.gene21036.t1
chr1.4 CDS 68823710 68823916 68823710 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68823451 68823554 68823451 ID=MS.gene21036.t1.exon6;Parent=MS.gene21036.t1
chr1.4 CDS 68823451 68823554 68823451 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68823177 68823243 68823177 ID=MS.gene21036.t1.exon7;Parent=MS.gene21036.t1
chr1.4 CDS 68823177 68823243 68823177 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68822998 68823105 68822998 ID=MS.gene21036.t1.exon8;Parent=MS.gene21036.t1
chr1.4 CDS 68822998 68823105 68822998 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68821998 68822090 68821998 ID=MS.gene21036.t1.exon9;Parent=MS.gene21036.t1
chr1.4 CDS 68821998 68822090 68821998 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68821144 68821234 68821144 ID=MS.gene21036.t1.exon10;Parent=MS.gene21036.t1
chr1.4 CDS 68821144 68821234 68821144 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68820942 68821012 68820942 ID=MS.gene21036.t1.exon11;Parent=MS.gene21036.t1
chr1.4 CDS 68820942 68821012 68820942 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68819879 68819934 68819879 ID=MS.gene21036.t1.exon12;Parent=MS.gene21036.t1
chr1.4 CDS 68819879 68819934 68819879 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68818340 68818461 68818340 ID=MS.gene21036.t1.exon13;Parent=MS.gene21036.t1
chr1.4 CDS 68818340 68818461 68818340 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
chr1.4 exon 68817242 68817321 68817242 ID=MS.gene21036.t1.exon14;Parent=MS.gene21036.t1
chr1.4 CDS 68817242 68817321 68817242 ID=cds.MS.gene21036.t1;Parent=MS.gene21036.t1
Gene Sequence

>MS.gene21036

ATGCAGCGTGTTGTAAGAAAACTAACCAAAACATTTTCACACAAACAGGTATGTCAGTCCATATCTTCATTTCTTCAAATTCACCTAATTACTTTATTATTCTTATTTGTTTCAATTAATCAAGGAAATAATCATTTTAGTTAATTTGAATAATTAGCATTTGCAGGTTCTAGGTTTTCGTTCTTATTCCAGTCACAAGGTTTCCGTTGATGTTGGACAACCAACCCCTGCATCTCACCCGCAGGTACTATTTCACAACATCCACCTTAAGAATTAAATTATTATTATGTTCTTATTTGCCATTTCGTTGTAAATTAGTTGTACGCTAATAGACATTTCTACTTTTCTATTTCATAGGATTAATCAGTTTGAAAACACACCTAAGAGTAGAATGACATGGCAAAACAACGAGTTCTTACTGAATGTCAGTGTAAAACTGGTTTACACTAATGATACATGTCTTTTAAGCTCTTAGTTAATAAGAACAAGTCTCTAAGAATTTAAACTACTATGCATTGATAATATTTTTTTATTAGAATGACAATATAAAGAATCAAAGAGAAGAGTGACTCACAAAGGCGGCTAGAGTTTTATTAGAATGACAATATCATGACTTATCAAAATAAATGTTACAAGCCACTATATAGTAAAACCTAAAAGGGATAGACTACTAACTAACTTATAAATCCAAAAGGTCCGACAACATAGTGTACACTGTATAAATGTGTTTGGGCCCCTCGCCCCTTGAGCTAGCTTTTGGGATAAGATCCAAGCCCAGTCAACATGTACGTCCATCAAATTATATTTTAAGATGTAGATATTTGTTATGGTATTTAGGAAACATTATAGTGAAACATGATTATGAGTGTGTGTGTGTGTGTGTGTGTGTATTTTAAACTATCGGTACAATAAGCACAATCTTTAAATTTAGTTGAAATAGATAATTTTTAGAAGCTAACATTAGTTGGTAAAATACTACCATTGTCTAGGACTTTTGGAGTGTCTGAAGTATTCACCAAGTGTTTGACATGCTTACTATGATTATTTTCAAGCATTTAACCGAAATACTTATTTGTTAAAATTCATAACCATGTTTAAAAAAAAAGTTATGTTGCATTGTAGCTTCTAAAAGATGGGGAGATCACACCAGGCATATCTAGTGAGGAATACATCCTAAGAAGAGAAAAATTGTTGGAGCTTCTCCCAGAGAAGAGTTTGGCTATCATTGCTGCTGCCCCAGTAAAGATGATGACAGATGTCGTGCCTTATACATTTCGACAAGATGCCGATTACTCATATATTACAGGCTGCCAACAACCTGGTGGTGTGGCCATTTTAGGGCATGACATTGGTTTATGTATGTTCATGCCAGAAGCCAAACCTTATGTGAGTCTGATACGATAGCTTATTCTTCTTTTAAAATAGAACATCTTTATTGTTTCTCCCGGTATGTATTTTGCAATGCTTACCTATGCCATAAATTCTCCTGGTATCATCATCATCATCATCATCATCATTATTATCGTCGTCATCACGATCATGATCATTGTCATCTCTGGTTAACAAAACTGTATATTACATATTAGAAATTTTGTGTGTTCGTTTAAAAAATATCACCTTAAATGTTTTTGGTCCATTAATTTTTGGGTCATGCTAACAAGTTAAAGGCACTTGTTAAAGATTTAAAATTAGAAACGATTGATGAAATTTAGGTAGAAAAAGTTACTTTTTGCTGTTTCAATGTGTTAAAATGCACAATTTCCATTAATTTTTTGTTTTGAATCAGCCCTTAATTTTTGTCAGTCCTTAAAAGATCCAAAAATCATGTGCGTATAATGCCTAAAGAACCTCTGCGAGGGACTTATTAAAACGCATTATTTTTGTGAACTAGTAGTATATAGAGGCCAGGAATTCCTTTTTTGATGAAAAAAACTGATTCCAAAAATCATCTTATATTTGGGGATCAAGAACAGATTTAAGCCACATAAATACTTGAACTTTCATCCGATTATGAATAAATATATTTTCTTTTGAAATCCATGTAGTATTTCAATTGTTATCATGTTAGAGCGAACCATCCATGAATTTGGTCATCATGTGTGTGATTTTTCTTGCCATGTCTTCTAGGGTTATAATATATGAAATAACTAATTTTGATGCTAACATGACATTGAAATCTTAAATTTGTCTAGGATGTGATTTGGCAAGGGCATGTAGCAGGAGTTGATGCAGCATTGAATACATTCAAGGCCAACGAGGCGTATCCAATGAGAAAGTTGAACGAGGTGAATCACTATTATCGAGTATTAAAAACTTTTTTTACTTATGTAGCAATTATCTAGTTTGTCCTCTACACTTGAACTATTTTCATCGATACGCCTTGCTGGTGGCTTGCAGCCAATGATGCCAACTTTTCTGTCTAAAATGTGCATTTAAGTCTCCTCAATCGAACGGTTACTTCTCTTTAAAATATTTCAATTTTAACATGATTAGTTACTTATTCAGATCCTTCCAGATATGATAAGGGGTTCCTCGAAATTGTATCACAATGTTCAGACTGCTACATCAGCATATACTGAACTGGAGGCCTTCAAGAAACTAGCTTACTGTAACAACGTAAAGGATCTATCTGTTTATACCCATCGGCTGCGATGGATAAAATCTCCTTCAGAGCTCAAGCTGATGAAGGAATCTGCATCAATTGCTTGTCAGGTACTTTCAAGTTTCAACTCTATTTGATGGCTAACTATTTGAGGACTGAAAACCTAGAACTCCTAGTGAGGCTGTTATATGCAGATTTGTTTAGGAGGTCATAGATGCCAAATGGTAAGAACTTGATGATTTGAACATTTTCTGGTTTTGAATAGGCTCTTTTGTCAACAATGATGCATTCAAAAACATACCCTTTTGAAGGTATGCTAGCTGCAAAGGTCGAATACGAATGCAAAATGAGAGGTGCGCAGCGAATGGGGTGAGCATTTCTTTATATTCAGCGACTGTCCCTCTTATGCAATTTGTTCACGCCAGACTGCTTTGTCTATTGTTATTGATAACTTAAAAAGCTTCTGCAAGAGTAACAGAGAAATTCATTAATATAGAAAACTAAATGGAAGTTCAACAATCACCCACGTTCACAGTTAATGAGATAATTTTTATCTTTATTTCCTCTCTGTATCAGATTCAATCCCGTGGTTGGTGGTGGGCCTAATGGAAGTGTTATACATTATTCTAGGAACGATCAGAAGGTAAATTCATTTACTAACTTCTATCTAGTATGTGTAATTTTTTGTTTTCTTATCTTTATCATGTTTGGCAGATTAAAGATGGAGATCTTGTTTTGATGGACATTGGATGTGAGTTACATGGGTATCTCAGCGATCTCACGCGTACCTGGCCACCTTGTGGTAGCTTCTCTTCTGCCCAGGTTAGCATGGTTTGGTTCCTTTTAGAAATCCTATCTAGGTATTTGGGAAATCTTACGGCCCGTTTACTTTGAGAAAATGTTTTGTTTTCATTTCTTGTTTTCACTTCTAATTACAAAGTTGATACATTGTTTTCAATATGCTTTCTGTTTTCAACGATTTGTACAGAAAACACTGAAAACAATCTGTTGTTGTTTTCTGTTTCCTTGATAGACTATTTACTGATGATTTAATAGTAATCCAAGACAAGCAGACTGTATAGTCCTAGTTTTCCATTGTAGTCCTCATTCGATGATAGGCTTAATTGCATCGATTTTCATTTATGGATATGCATATGCTCTCATATGTATTGCACTTCTTTTGGACTGGGATTCTGTGATGTTGACCCATTTTTAATCTAATTCCATTTTTTATTTTAGTAGTATGATTTTCTAAGCTATCGTATCCAGCATCCATACCTTGAGTAAAAAAATAAAAGATATGATTCTTCTATAAACATCCTAGTTGTTTCTTGTTCCCTGACAAAATGCAGATGGTGGCATGCTAACAAGTCATTATAGCCCAAATGAATATTTTTGGACATACAAATTGTTTGAACGTGTAAAATATATCTAATTATTTTGTTCATGCATGAACATCAATTTGTGATAAATTCTGCCTCTTCATGCCCCCTTTGTTTAGGGGATCTGATGTGCCTAAAGCCTGCACATACCCGACATATGGCATGTGGGGCAGAATCTTTCTTTCCTTCTGCTATCCCAATGTCTTATAAATATTTGAAACTTGGTTTCTGTTCCATATTCATGCGAAAAGAATCTGGTATAAAATAGACTGCTCATTCAACTCGTTGCTTTACCCTTGCAGTCTTGTCACTTTTAATATGAATTTCACGATTTTTCTATATTTCAGGAAGAGCTTTATGAGCTTATACTAGAAACTAACAAGCATTGTGTGGAACTTTGTAAGCCTGGTGCTAGTATCCGACAAATACACAACCATTCGGTATCTCCTTTTTCTTATCATTTTTTCAATTTATAAAGATGCAATAAGGTCACTGTCCTTATCCTTAAATAAAAGTCATCGTCCTTCATGAGTTATTCTAATAACTTAAACTTTTTGAAGTTGATGATTGACATGGTATTGGAGTCTCTTTGATTAAGTGTGTCTGGTTTGGTTTCACATTGGGAAACTCAAAATCAAGTTTGATGAATAAATTTTGAGTCAAGGTTAACAACGGACTCCAAACTTAATTTTAACCCCTAAACATAATGCTTTTGTGGTTAGTAAAATTATGACAATTTTAATAAAATTAATTATGAATATCATCTAAAATATGGTTTTATTACTATTATTCAAACATTTACTGTGTCAAACATCGATTTGCAAATAATTATTTACAACTATAAATCAATTTACTTGAATTCATTTTTAACCAAAATTAGTTTTGCAAAATTAAGTTTCTCTAAAATCACTTTTTCAAACTTCAACTCAAACACGCTATAAGTTGTTATGAGGTTAATCCTTGCTGTTCTAATATTTTTATAAATAAATATTGAATTTTAATGTAAGGAAGGTTAAACTTTTAATATTTGCTTTGCTCTATGCGTGATGAGCACTATCTCATAATCAGTAAACAACCATAGTGTGCCAGTGGCCTGAGTATCTTCTTTATAGACCCGTCTGTGCTTAGTGTCTATCTTACTAGCTGCTTGAAGCTTATTTAGCAATAAAACAGTGAACCGTGTGAAATGGTCTGGTTGCACAGGTTGAAATGCTGCAGAAAGGACTCAAGGAGGTTGGAATATTGAAAGATGTTGGAAGCAGTTCCTACCATAAGCTGAACCCAACTTCTATAGGTAGCCTGGTCACGCCCATGTTTTAGCTGACAAAGCATGCCATATTTTTTTTTGTTACAGATATTCATAAGCTAGTTTGACATTTGAAATAAAAATATGGTGATGTATTTAGTCATGTTTTATTGATCCAGGTCACTATCTAGGAATGGATATTCATGATTGTTCAGCGATCAGCTTTGACTGTCCTCTGAAGCCAGGTGTTGTAAGTGCTTTCCCTGACTGAACAACAGTTCAATAATTAATATCAATTAATTTTCAGATGTGGGTGAGGATTATGTTCCCATAAAATCATTATATGGTAATGGTTTTTAGTCAACTGTGTCAGCGTGACTTCATAGTGTCTTCAAACTAAATAAACGTGGTTTGAAATGCACATTTTTCAGTATTTCCAAACATTGGTTTCATCTAAGGTCCTTGTAATTTGTGTCTATCCAAGTTTGAAGCACTCGTTCAGTAGTCTGCACTTTCTTTTCTTAAGTGATTTTTTAAATGATACCTTCATTCCAAAATTTCCCCTTGTAGACCTGGCAAATTATATTTCTTTATTTTCAATTTTTGTATGGTAAAGGGTAAAAAAGACTCAGCCAATCAGCTTAAAAATATTCTGCTGCATTCAAATATCATACAAAAATCATACAAATAGGGAAGTTTTAGAGTTTACACCCAGGGACACTTTCCAGTGATCATCACCACATTTTATGTATAAAATAGGATTTGCTTCTCTAAAGTAAGGGCTGCACTTAAGAGGATAAATTGTGTAAGGATAGTGTTGATTGGAGAATCAATGGTTGAGAACTTAATAACTTCATATTTAGTGTGTTATATACATGCACATCTTTTATCACTTTCTTAATGTTAATGTCCGGCAATTATTATTGCATTTAATCCTCTAAAGTGAATCTCACACTTCCGATGAGGCAAATCCATGAAAATATAATATAGTTTCTGGCCAAATTAATATAGTTTGTTATTTTTTAGCGGGGTAGGGTGGGATGGGAAGGAAGTAGTGTTGGATTGATGTGAATGAGCATTGCCACCAAGTTCTTTTTTTTTTGAAGAAATCATTGCCACCAAGTTCTATTTTTCTTTCAGTTATCTAATTGGATTTTTTACCGGCACTTGTCGTTCAAATATGCTTTACCAATTCATATGGACATAATGTATACTCATATTAACTGACAGTCTGTTCTTTTTTTAATTTGTGAGCAGGTAATAACTATTGAACCAGGAGTGTACATCCCATCTTTTTTTAATTGTCCAGAGAGGTACTTACTAAAGTTGATTGAAGAACTTGTTTTAACAAATTGATCATTACAAATTACTGGTTTACCAGCTATTTTTACAACATTCAAATTCATGGGAAGTTTCCTTCCATTAACCAGCTTCAGAAATTTTACCTCTCATTTTCCCTAGAATCTGATTGTGTACATTGGTATGTTAAAACCTGGGTTGGATAACCCTGGTTGCTTCTTATTTTCAAGGAGATTTATTATTGATTTCCCTGCTTATTCATTATCTTCTGAATTCTGTAAGTTCTTGTTTGATTCAATAAAATGGGGTGAAAGGAATGAAAACGATTAGAGTTTGTTTAATAGGAATTTTAGGAGGTTAATCATTTCCAAGGGGGAGTTCTTATTCTATCTTATGAGGGAAGATTCATTCCCTCTAAAGTTGAATGGAAATGTACCATCTATGTCCCTTATTACAAGCACCTTTGGATAATTCATGCATTAAGAAAGTTAGATAATAGATTAAACAAGCACCTTTGGATAATTCATGCATTAAGAAAATTAGATAATGGATTAAATTTGTATATTTTGTAAATTGATGTTCCGAAATTGCCTTTTAATGAAGTTTAGTTAAGCAATGTAACCACAAGGTAGTAAATTAAATTAGGGGTATTTTTTTAAGAAAATAATTTATGCATCTAATAATTATGAGAAGGGGTTATTTTTAAGCACATTTTTTCCAAGTGGCTCTTATATTTAAGGACAGATGTAGTATTTCTTTCCAACCATTAAATTGTTGCCCTGTATATTTTCTAAAAATGGTAAAAATATCATGACCTACTCCTACCTGTGCCGCATCGTCATGGCTCTACCTTTACCAACCCGTAGTTTCCACTAAACTAGCTCCACCCCAAGGGTAGGTCTGTCATAAAAAAATGTATTATTAAACTCATGTTTGCTTTGGCAGACCATGTAAAGTATAATGGAATAGTCATTTGATCATGTTAAACATGCTATTGGAAATGACTATTCAATTCCATTCTCATTCACATTCTTATCAAAATCATTCCTTTGATTTGAACCTGCTGGGACCTGAATTTCTACAGAAGTATTGCTCATAAGCTATTCAATCCATCAAGAGACATGTATGCAGTCTTGGATCTTGAATTTTGTTTAATTTTATTTGAGAGTTATGAATGTTTTTTAAGTTGATTGGTTTCGATTGCAGAACAAGTTGTACAATTGAGCTCAACATAGTATGAAGTTCATTTTTTTTGCGTCTGTATTGTACTTGTTTTACCATTTTAGTCATTCCCTTTCCTTAACTTCAGCTACATGATTGATTGATCTCAATCTCACGTTATTAGGTCACCATGTATCCCATGTGAGCATTTAAGTTGGTTTTCAGTCTTGGCATTCAATGTTTCAGTTTTGTACGTGATCTCTAATGCAGGTACCAAGGCATTGGGATAAGGATTGAGGATGAGATTCTCATTACAGAAACAGGTTATGAGGTAGTAATTTCTTTCTCTAAAAGTAGTGCAATTGAAACTAGTTTATTTGTGACAGATAGAGGTGTGCTCATTACTCTTCTTTTAAAACTTGAAATAATGAATGAAGGTAGTAGGTCCTTTCATTTTAAAATTATGTTGATTTACCTCTTTATAAACATTCCTTTTTTCTATAGGGTCCCATCTATTTAATATTTATAATGACAGACAAAAAATTCAATTGACAGTCATATATCGTATAAAATTTTAAAAGAGAGGATTATTCAAATAACAAATCAAGTTTCAAGTAAATACTTATCCTTTTAAAAAAAAAAAAAAACTAAGCACTTGCTAATGAAATTTTTGCAAGCTGTTAAAGAAGAGGACGGCGCTTCACTCAGCAAGTCATCATCAGCTTCACTTTTTGGTTTAATCTTGATTTTAGCTTTAAACATACTAGTAAGGTTGAACCTTTCATTTGAAAAATAAAAAATATTCATTGGAAACAGCAGCTTTTCAAGTCATCAATTAATTGGAAAGTGCTACTTTTTATTTCTTGTTTTCTTAACTAATGTCTTAAGGGCATTTGTTTGCATGAGTGCATGATCCTAATATCAAATCATTAGCAGAATTTTAAAGCCAGTGCATGATTAAGTGATGGAAGTAACTGTGATAACCTTTTTCTCAAGGATAGCAGCCCCTTGATAAGTTGTTATCTGTGATTGGTTCTGGTATTACATATGGTTGTGTACAGAACCATTAAATGTGTTCCATCTTCCATGTAAGGTGAAACCAAATGTACTGTAACACTAGAAGCAAAAACACTATACAGTGACCTTTTCTCTGTGTGTGGTTGTATGTGGACAAAATTTGGGTGAAATTTCTTAGGTTCCTGCAGTGATGTTCTCTAAATATAAACGAAAGATGAGTATTACTTTTAATTTCCCAACTATGATAACTTTAGAAAAGAAAATGAAGTGGCTAAACAAAATTAATATATTATATACTTTTTTATTCAAAACAGTTCTATAAGCACCTACAGAACTTAACCTAGTTTTATGACACACACTCCTGAAGTTAATAACTAATATTACGCAATACAGGTCCTAACAGCATCAATTCCGAAAGAAGTGAAGCAAATTCAATCCTTGCTAAACAACTTCAGCCATGGAATGGGGGTTGA

Protein sequence

>MS.gene21036.t1

MQRVVRKLTKTFSHKQVLGFRSYSSHKVSVDVGQPTPASHPQLLKDGEITPGISSEEYILRREKLLELLPEKSLAIIAAAPVKMMTDVVPYTFRQDADYSYITGCQQPGGVAILGHDIGLCMFMPEAKPYDVIWQGHVAGVDAALNTFKANEAYPMRKLNEILPDMIRGSSKLYHNVQTATSAYTELEAFKKLAYCNNVKDLSVYTHRLRWIKSPSELKLMKESASIACQALLSTMMHSKTYPFEGMLAAKVEYECKMRGAQRMGFNPVVGGGPNGSVIHYSRNDQKIKDGDLVLMDIGCELHGYLSDLTRTWPPCGSFSSAQEELYELILETNKHCVELCKPGASIRQIHNHSVEMLQKGLKEVGILKDVGSSSYHKLNPTSIGHYLGMDIHDCSAISFDCPLKPGVVITIEPGVYIPSFFNCPERYQGIGIRIEDEILITETGYEVVISFSKSSAIETSLFVTDRGPNSINSERSEANSILAKQLQPWNGG