人肌红蛋白基因外显子一生物信息学分析Word文档格式.docx
《人肌红蛋白基因外显子一生物信息学分析Word文档格式.docx》由会员分享,可在线阅读,更多相关《人肌红蛋白基因外显子一生物信息学分析Word文档格式.docx(20页珍藏版)》请在冰点文库上搜索。
核酸分析首先从pubmed核酸数据库中搜索到人的Myoglobin核酸序列,分别进行序列的基本性质分析,限制性酶谱分析及其基因在染色体上的定位分析。
具体操作方法将根据上述各个软件上“Help”文件进行,基因在染色体上的定位分析根据网页上的操作说明进行。
从pubmed蛋白质数据库中搜索到人的Myoglobin和整个前体Myoglobin蛋白质序列,分别进行蛋白序列的基本性质分析(氨基酸组成、分子量、等电点),亲疏水性分析,蛋白酶切位点分析,信号肽序列分析,二级结构和三维结构分析,同源序列和进化分析。
二级结构分析包括结构功能域、Motif、保守位点等,具体操作方法将根据上述各个软件上“Help”文件和网页上的操作说明进行。
Resluts:
1.从NCBI中可以得到一下有关Myoglobi蛋白分子的信息:
Human
myoglobin
gene,exon1
GenBank:
LOCUSHUMMGI12552bpDNAlinearPRI07-JAN-1995
DEFINITIONHumanmyoglobingene,exon1.
ACCESSIONM10090
VERSIONGI:
187580
KEYWORDSFokfamilyrepetitivesequence;
directrepeat;
myoglobin;
repeat
region;
tandemrepeat.
SEGMENT1of
SOURCEHomosapiens(human)
ORGANISM
Eukaryota;
Metazoa;
Chordata;
Craniata;
Vertebrata;
Euteleostomi;
Mammalia;
Eutheria;
Euarchontoglires;
Primates;
Haplorrhini;
Catarrhini;
Hominidae;
Homo.
REFERENCE1(bases1to2552)
AUTHORSAkaboshi,E.
TITLECloningofthehumanmyoglobingene
JOURNALGene33(3),241-249(1985)
PUBMED
COMMENTOriginalsourcetext:
Human:
fetalliverDNA(libraryofLawnet
al.),clonelambda-HMB151;
delta-thalassemialeukocyteDNA(library
ofKimuraetal.),clonelambda-HMB159.
Draftentryandcomputer-readablecopyofsequence[1]kindly
providedby,22-OCT-1985.
Atandemrepeatsequenceisfoundatposition142-794.Positionof
themRNAcapsitewasdeducedfromthatofthesealmyoglobingene
(Blanchetotetal.1983).
FEATURESLocation/Qualifiers
source1..2552
/organism="
Homosapiens"
/mol_type="
genomicDNA"
/db_xref="
taxon:
"
/map="
1100..1750
/note="
Fokfamilyrepetitiveelement"
1895..>
2552
myoglobinmRNAandintron"
join(1965..2552,:
1..2671,:
1..501)
/gene="
MB"
join(1965..2059,:
1415..1637,:
355..501)
myoglobin"
/codon_start=1
/protein_id="
GI:
386872"
/translation="
MGLSDGEWQLVLNVWGKVEADIPGHGQEVLIRLFKGHPETLEKF
DKFKHLKSEDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPV
KYLEFISECIIQVLQSKHPGDFGADAEGAMNKALELFRKDMASNYKELGFQG"
<
1965..2059
myoglobin;
G00-119-378"
/number=1
2060..>
myoglobinintronA"
ORIGIN2bpupstreamofRsaIsite.
1gtactgtattttcattcctcttagttatctccctaaaaagactctgagttccttgaacac
61aggaaggtgttttatttgattttgttatcctcagcatgtagcagtgtctgacacacagta
121ggtgctctatcactgtgagagggatggatggatgggtggagttacagatggatagaagga
181tagatggagggatgggtggatgatggatggatagatggatggaggggggatgatgaatgg
241agggataatgagtggatgaatgagggaatgggtggatggatggatggagggatggaggaa
301cagatagatagatggagggatgggtgggtgatggatggatagatggatggagggagggat
361gatgaatggagggataatgaatggatgaatgaggggatgggtggatggatgaatggaggg
421atgatgggtggatgaatgaattgagggatggatggatgaacacatggatggatggataga
481tggatagatggaggaactggtggattttggatggatgggtggatggatagatgaatgaat
541gcctggatagacaaagagatgatggatagatgaatagatgaattaagggatgtcggatag
601atggagggattgatagatgttggatggatgggtggtggatggatagatgagtgaatgcat
661ggatagacaaagagatgatggatggatgaattaagggatgacagatggatggatggatga
721gtaactggatggacaagtggataaatggatagatggttgaatacctgaatggattgaagg
781aggatgcatggatgtaagataaggctaatcatcctccactctctttctttgcaaaaccat
841ccacccatttactcaataaacatttattcagttcaaacttggcacaaagcaccatgtgag
901gcccaagagatacgtgggttaataaaacagagctcctgccctcctgaaaactgcaaagaa
961aggggcgtggcttcctgagttcaaatcccaactctgccagcgactagctgtacatcagtg
1021atgtttccctactttctctcaattaaatagggataatgtcagtacctatcacattgggag
1081gtcttgcggggattaaatgagttaccaaatgccaagtgtttgggacagggcctggcaccc
1141agcaaagtctcttgtgagtgctggctgctattatcctaatggagaagatggcatgaaaac
1201caggaaataggatgccctttgggaagcaatgcaacaggaacttacacaaagaaaggaaag
1261gaggaagcaattagtggtgtctcaaaggagtatgtcaagaaaaacttttcagagggaaac
1321ctttgagcagggtcatgaaaacaggagttctctaagagattgtggacttgcctgggacca
1381cctggctataagcacaaaaccatccggttcctttctgtcacttctggcgggtgaggggtc
1441tctggcaaaggggcagaaggtgcgtgagaggttgcgaatggccaggactgtcctggggcc
1501agccggggcacctggtggccaagcttagaaacatgacaggtcctcttgggagggctgacc
1561gcagggagcgttgggtttcaggctgctggcgtcggcttctgtggtgccctttctgtcggc
1621tatgagagtccagacagtgcccaacctcctccccttctttccacacgcacaaccacccca
1681ccccctgtggcctgagctgtcctgcctcgccacaatggcacctgccctaaaatagcttcc
1741catgtgagggctagagaaaggaaaagattagaccctccctggatgagagagagaaagtga
1801aggagggcaggggagggggacagcgagccattgagcgatctttgtcaagcatcccagaag
1861gtataaaaacgcccttgggaccaggcagcctcaaaccccagctgttggggccaggacacc
1921cagtgagcccatacttgctctttttgtcttcttcagactgcgccatggggctcagcgacg
1981gggaatggcagttggtgctgaacgtctgggggaaggtggaggctgacatcccaggccatg
2041ggcaggaagtcctcatcaggtaaaaggaagagattccattgcccctgccacccacaccct
2101aagatcaagggtgttcagctgcaaggtggaaagtttgcacgtggggtaggtcagttggct
2161gcattagttaagggtgttagaacggtcacttgctttttctttgcttttaagtgtcaggga
2221ttggactcaggagagggaaaggagccatttcaggctgatgtcagcagctggaggaagcat
2281gagaatcaaacctaggatgctcagagtccaccaggaagaattttagaattatagacagtc
2341agagttaacaagggtcctgagagattttgtacagccacctctcttacaggatgaggacaa
2401aaagcgactgagaaggggaggacatttccagagtcacagctcattaaatgctcttaaagt
2461gtcaaggttaagacatgctcttcaaggggagacagatctggttctagacttggctctgcc
2521actgagccactgggtgacctttgggaaggtac
OfficialSymbolMBprovidedby
OfficialFullNamemyoglobinprovidedby
Primarysource:
Locustag:
Seerelated:
Genetype:
proteincoding
Refsepstatus:
REVIEWED
Organism:
Lineage:
Homo
Alsoknownas:
PVALB
Location:
Sequence:
Chromosome:
22;
(..,complement)
SeeMBin
Chromosome22-
GenomicSequence
Gotonucleotide
:
36M..36M
(22Kbp)C
FindonSequence:
-
+
Tools
Configure
2.利用Primer软件对myoglobin核酸序列进行引物设计其结果如下:
Senseprimer:
5'
TACTCGTGCCTTTCGTACTAGGCTC3'
Anti-senseprimer:
ATGAGCACGGAAAGCATGATCCGAG3'
3.蛋白质序列分析
利用ExPASy软件包对该基因进行氨基酸组成统计、分子量、等电点等分析及磷酸化分析