GeneMark.hmm (Version 2.2a)
Sequence name: Hvrn.contig8
Sequence length: 50124 bp
G+C content: 44.82%
Matrices file: /home/software/analysis/gene-prediction/genemark/matdir/osativa.mtx (Oryza sativa)
Thu Mar 22 10:25:00 2001
Predicted genes/exons
Gene Exon Strand Exon Exon Range Exon Start/End
# # Type Length Frame
1 1 - Initial 1805 2176 372 3 1
2 5 - Terminal 3108 3229 122 3 2
2 4 - Internal 3869 4501 633 1 2
2 3 - Internal 4820 4888 69 1 2
2 2 - Internal 4981 5061 81 1 2
2 1 - Initial 5296 5656 361 1 1
3 2 - Terminal 7171 7288 118 3 3
3 1 - Initial 7540 7787 248 2 1
4 1 + Single 15431 15757 327 1 3
5 1 + Initial 17526 17696 171 1 3
5 2 + Internal 17772 17887 116 1 2
5 3 + Internal 18005 18074 70 3 3
5 4 + Internal 18456 18539 84 1 3
5 5 + Internal 18628 18714 87 1 3
5 6 + Internal 18807 18870 64 1 1
5 7 + Internal 19944 20038 95 2 3
5 8 + Internal 20139 20293 155 1 2
5 9 + Terminal 20779 20788 10 3 3
6 5 - Terminal 23000 23061 62 3 2
6 4 - Internal 23397 24101 705 1 2
6 3 - Internal 24708 24821 114 1 2
6 2 - Internal 25079 25356 278 1 3
6 1 - Initial 26970 26977 8 2 1
7 3 - Terminal 34218 34310 93 3 1
7 2 - Internal 35900 36301 402 3 1
7 1 - Initial 36392 36448 57 3 1
8 1 + Initial 36531 37064 534 1 3
8 2 + Terminal 37153 37161 9 1 3
9 3 - Terminal 37880 37917 38 3 2
9 2 - Internal 38938 39006 69 1 2
9 1 - Initial 39080 40214 1135 1 1
10 2 - Terminal 41091 41554 464 3 2
10 1 - Initial 41635 41713 79 1 1
11 1 - Single 41744 42061 318 3 1
12 1 + Initial 42171 42212 42 1 3
12 2 + Terminal 42432 42824 393 1 3
13 7 - Terminal 43798 43932 135 3 1
13 6 - Internal 44220 44297 78 3 1
13 5 - Internal 47595 47685 91 3 3
13 4 - Internal 48393 48526 134 2 1
13 3 - Internal 48643 49024 382 3 3
13 2 - Internal 49118 49149 32 2 1
13 1 - Initial 49457 49507 51 3 1
Predicted gene sequence(s):
>Hvrn.contig8|GeneMark.hmm|gene 1|124_aa
MEVAVKGYADASFDTDPDDSKSQTGYVFILNGGAVSWCSSKQSVVADSRCEAEYMAALEA
AKEGVWMKQFMTDLGVVSSALDPLTLLCDNTRAIALAKEPRFHNKTRHIKRRFNLIRDYV
EGED
>Hvrn.contig8|GeneMark.hmm|gene 2|421_aa
MAHAKVTLNFNTFLEKAKLKDDGSNFVDWARNLKLLLQAGKKDYVLNVALGDEPPAAADQ
DAKNAWLACKEDYSVVQCAVLYGLEPGLQRCFERHGAYEMFQELKFIFQKNARIERYETS
ESELRKEHQVLMVNKATSFKRSGKGKKGYGSLEAQLSKYLAGKKAAKEKSENNGCSISMS
NIFYGHAPNVRGLFILNLDSDNTHIHNIETKRVRVNNDSAMFLWHCRLGHIGVKRMKKLH
TDGLLESLDFDSLDTCEPCLMGKMTKTPFSGTMERASDLLEIIHTDVCGPMSAEARGGYR
YFLTFIDDLSRYGYVYLMKHKSETFEKFKQFQSEVENHRNKKIKFLRSDHGGEYLSFEFG
AHLRQCGIVSQLTPLGTPQRNEAMVGPDSNKWLEAMKSEIGSMYGNKVWTLEVLPEGRKA
I
>Hvrn.contig8|GeneMark.hmm|gene 3|121_aa
MVRRQRLIYRMTSFDYRKVFGHYRECTESDEWVPNVHREGPTHPGKPIGPRGGAPALGGL
VGQPKRALCAKDRKSKRKKKRKRSRYFTTTGAPSRCRRTHLLIRLACWIKKAEIIIELYV
C
>Hvrn.contig8|GeneMark.hmm|gene 4|108_aa
MFTTPKAGGGMYLCLSVGWGIVGRRRVMSGCGQGSEMGLVGLRTRRHWAKTGRGGAAGGA
ASIGDGPRRAADKATLGEDGPGRGVGRGGVGRRRVASGGGDREEDEWS
>Hvrn.contig8|GeneMark.hmm|gene 5|283_aa
MDAAVQEAKLLRQVNALIVAHLRDQNLTQAAAAVAAATMTPKADASLPNHLLRLVAKGLA
AEREEAARGGGAPPAFDSAGGGGLARPLGTSAVDFSVQNVRGPSKTFPKHETRHISDHKN
VARCAKFSPDGKHFATGSGDTSIKFFEVSKIKQTMLGDSKEGPGRPVVRTFYDHVQLLTQ
LLVHSTDKVSSFVTNIPGTDHPVAHLYDVNTFTCFLSANPQDSSAAINQVRYSGTGSMYV
TASKDGSLRIWDGVSAECVRPIIGAHGSVEATSAIFTKDESGF
>Hvrn.contig8|GeneMark.hmm|gene 6|388_aa
MGSVVFLEGSEGNLQALKDTLQAYQVASAQKVNLQKSSILDGKGCRDEDKGTLKQTIGID
SEALSERYSGLPTVVGRLKDGSFEYVRERSKGKVSGSVGKASVALQFPSSLCARVLKARY
FKECTIMNTTCPNAMFWKVLSSEKWVPVAIPPVSEGPHGELASWLLRWFAEVGDPERELM
VHAVYGLWLARNEARDGKRIVDPRVVEENVYQHIIEWNAIHMKKPRSTTPTLAVRWSPPE
QGWLKANSDGALAKLRDRGGGGVVLRDHDGAYRGGACYVFRDVSDPEVVEILACRKAVHL
AVQTGATRVHVEVDSKGMAAMLNDQAKNLSAAGPIVEEIKLLGRTLQGFIVSRVRRSGNH
GAHLLAREVRSVYTHVILKQPLFDTCRL
>Hvrn.contig8|GeneMark.hmm|gene 7|183_aa
MVLTEKEAKGFVFSGPVEEAWGLHHDAQFRDLGNNLFLVHFGGEGDWKHSRNNGPWQFDF
MILKGYDGKTRPSEMVFDSVEAWVRVEDLPLDRRTREFGEALGNWLGEVVKVDVERDGFA
KGKYLRVRAKIFVYEPVVRYFNLKESVDDEVETAEGQAGPLEAEAEARRGASVSAHSFGR
WGK
>Hvrn.contig8|GeneMark.hmm|gene 8|180_aa
MASTVSPWSETPQDILGLVIDRLHSSPDHEEPRLSAAWSRFLLAVPVAAANRRGFQRARR
TRHSAAADRARFRAVCRSWHLAMRQHVSTPRVLPWIILSDGYFFTPSDNGCRAPRRLPSL
PKNARCIGSTDGWLALDCTDARNVHTYLLHNPFSDTTVPLPELDPIIANVSEFFAVRKAA
>Hvrn.contig8|GeneMark.hmm|gene 9|413_aa
MPLKFWDETFSTAVYLINRVPSRVIHNQTPLERLFGLTPNYTFLRIFGCAVWPNLRPFNK
HKLEYRSKQCVFIGYNYLHKGYKCLDVSTGRVYVSQDVIFDEHIFPFASLHPNAGAQLRA
ELVLLPPTLLNLSSPLTPSAAPNDPMAISTIYAPTSANSVQDSAGISHDFMQPNVSTDLV
ATENPGLHASESATAAPGAGDPPLQASGSAAAAPGSSPGFVHQPAASVGRSPASTSDPAR
QPDASAARPPVSDPVRPTTVATALFPASDLVRSPQEIRLQRRAPPTAPWIGRGLPRVVGP
PCLLPWTREISLDVVTRYRLLRLRPMQRRRCPMQRPPRLLFLLVCHLIRYLLTLRCPVVS
STICNPCNQHLHPLGLILGEPENLKEAIADPKWKAAMDEEFDWAGCPDDRRST
>Hvrn.contig8|GeneMark.hmm|gene 10|180_aa
MAAAGKPLDDDELVSYILQGLDSDYNPEARIDAQNGSNTNSFSINLASKGGSRNNNDTRP
SGPGGGNPAAYRGAGGGFFPNTLVAPPPSGGRDETCQICKRQGHATWHCFKRYDKNFNPP
PKRQGGGGGNNSGGGGNSSGGNTKSANTVPAAYDVDTNWYLDTGAMDHVTGELEKLAMHD
>Hvrn.contig8|GeneMark.hmm|gene 11|105_aa
MGYLDGTMAEPPAVLTTETDVAGKKEISSTPNPAHVLWYTQDQQVLTFLLASLSRDVLLQ
VHSLASATGVWTAIQQMFASHSRARHIQLRGQLGNTKKGDSPVAI
>Hvrn.contig8|GeneMark.hmm|gene 12|144_aa
MVELEEEDDMSMEEVALMTNNSNYLIILIRPGKGVWLPKPDTAPFNLFIDIVFLQGKLYG
ITQAEDLASVSIDFDDCGMPTVTTVERLIKHPPLESCEFDVWSDAGEKLEADGDMGDEDQ
VENGGEDHDEALNEVDARIQKENR
>Hvrn.contig8|GeneMark.hmm|gene 13|300_aa
MSTATSLWDKAALMMREELAVAAVVAGCLDMTKLYVVGAGMFSCVTVALYPVSVIKTRMQ
VASGEAMRRNALATFKNILKVDGVPGLYRGFGTVITGAIPARIIFLTALEKTKATSLKLV
EPLQLSESMEAALANGLGGLTASLCSQAVFVPIDVVSQKLMVQGYSGHVRYKGGIDVVQK
IMKADGPRGLYRGFGLSVMTALGRLDDKEDTPSQLKIVGVQATGGMVAGATSLEDNPLSD
NVPQFAETSSAGSPLEKERVRQRASATISVTRDCQCSRRPTIGGVRQLGRSLPMRRDGAT