2024年3月14日发(作者:勇嘉澍)
细胞色素c序列查找和分析
1 登陆NCBI网站,查找关于细胞色素C相关的蛋白的序列,选
取了human,rat,yeast,drosophila等14个物种的细胞色素C蛋白序
列,制订成表格,如下:
NO.
1
ACC NO Organism
AAA28437 fruit fly
Protein sequences
2 AAA21711 Rattus
norvegicus
3 Homo sapiens Homo sapiens
4 P00006 Bos taurus
5 CAA25046 Gallus gallus
6 S11172 yeast
7 AAC80552 Tigriopus
californicus
8 CCSF s tarfish
9 CCCA common carp
1 mgvpagdvek gkklfvqrca qchtveaggk
hkvgpnlhgl igrktgqaag faytdankak
gitwnedtlf eylenpkkyi pgtkmifagl
kkpnergdli aylksatk
1 mgdvekgkki fvqkcaqcht vekggkhktg
pnlhglfgrk tgqaagfsyt danknkgitw
gedtlmeyle npkkyipgtk mifagikkkg
eradliaylk katne
1 mgdvekgkki fimkcsqcht vekggkhktg
pnlhglfgrk tgqapgysyt aanknkgiiw
gedtlmeyle npkkyipgtk mifvgikkke
eradliaylk katne
1 gdvekgkkif vqkcaqchtv ekggkhktgp
nlhglfgrkt gqapgfsytd anknkgitwg
eetlmeylen pkkyipgtkm ifagikkkge
redliaylkk atne
1 mgdiekgkki fvqkcsqcht vekggkhktg
pnlhglfgrk tgqaegfsyt danknkgitw
gedtlmeyle npkkyipgtk mifagikkks
ervdliaylk datsk
1 mpyapgdekk gaslfktrca qchtvekgga
nkvgpnlhgv fgrktgqaeg fsyteanrdk
gitwdeetlf aylenpkkyi pgtkmafagf
kkpadrnnvi tylkkatse
1 mgdidkgkki fvqkctqcht ieaggkhkvg
pnlhgmygrq tgkaagysyt dankskgvtw
neetldiylt npkkyipgtk mvfaglkkkg
dredliaylk sasss
1 gqvekgkkif vqrcaqchtv ekagkhktgp
nlngilgrkt gqaagfsytd anrnkgitwk
netlfeylen pkkyipgtkm vfaglkkqke
rqdliaylea atk
1 gdvekgkkvf vqkcaqchtv zbggkhkvgp
nlwglfgrkt gqapgfsytb abkskgivwb
zztlmeylzb pkkyipgtkm ifagikkkge
10 CCHOZ common zebra
11 AAL67777 Actinobacillus
lignieresii
12 CCHOD donkey
13 AAB86817 Pichia stipitis
14 CAA25899 Mus musculus
radliaylks ats
1 gdvekgkkif vqkcaqchtv ekggkhktgp
nlhglfgrkt gqapgfsytd anknkgitwk
eetlmeylen pkkyipgtkm ifagikkkte
redliaylkk atne
1 mtkllqkiaf ilplvfslva xaemvdtfqf
qnetdrvrav alakslrcpq cqnqnlvesn
attayklrle vyemvnqgkt deeiikimte
rfghfvnykp pfna
1 gdvekgkkif vqkcaqchtv ekggkhktgp
nlhglfgrkt gqapgfsytd anknkgitwk
eetlmeylen pkkyipgtkm ifagikkkte
redliaylkk atne
1 mpapfekgse kkgatlfktr clqchtveeg
gphkvgpnlh gimgrksgqa vgysytdank
kkgvewseqtmsdylenpkkyipgtkmafg
glkkpkdrnd lvtylasatk
1 mgdvekgkki fvqkcaqcht vekggkhktg
pnlhglfgrk tgqaagfsyt danknkgitw
gedtlmeyle npkkyipgtk mifagikkkg
eradliaylk katne
2 将所查找的序列作成fasta格式的文本文档。
3 选取第二条序列(
AAA21711
)为代表,进行蛋白质一级,二级,
三级结构的预测
a.一级结构用的是
/tools/,结果如下:
User-provided sequence:
1 11 21 31 41 51
| | | | | |
1 MGDVEKGKKI FIMKCSQCHT VEKGGKHKTG PNLHGLFGRK TGQAPGYSYT AANKNKGIIW
60
61 GEDTLMEYLE NPKKYIPGTK MIFVGIKKKE ERADLIAYLK KATNE
References and documentation are available.
2024年3月14日发(作者:勇嘉澍)
细胞色素c序列查找和分析
1 登陆NCBI网站,查找关于细胞色素C相关的蛋白的序列,选
取了human,rat,yeast,drosophila等14个物种的细胞色素C蛋白序
列,制订成表格,如下:
NO.
1
ACC NO Organism
AAA28437 fruit fly
Protein sequences
2 AAA21711 Rattus
norvegicus
3 Homo sapiens Homo sapiens
4 P00006 Bos taurus
5 CAA25046 Gallus gallus
6 S11172 yeast
7 AAC80552 Tigriopus
californicus
8 CCSF s tarfish
9 CCCA common carp
1 mgvpagdvek gkklfvqrca qchtveaggk
hkvgpnlhgl igrktgqaag faytdankak
gitwnedtlf eylenpkkyi pgtkmifagl
kkpnergdli aylksatk
1 mgdvekgkki fvqkcaqcht vekggkhktg
pnlhglfgrk tgqaagfsyt danknkgitw
gedtlmeyle npkkyipgtk mifagikkkg
eradliaylk katne
1 mgdvekgkki fimkcsqcht vekggkhktg
pnlhglfgrk tgqapgysyt aanknkgiiw
gedtlmeyle npkkyipgtk mifvgikkke
eradliaylk katne
1 gdvekgkkif vqkcaqchtv ekggkhktgp
nlhglfgrkt gqapgfsytd anknkgitwg
eetlmeylen pkkyipgtkm ifagikkkge
redliaylkk atne
1 mgdiekgkki fvqkcsqcht vekggkhktg
pnlhglfgrk tgqaegfsyt danknkgitw
gedtlmeyle npkkyipgtk mifagikkks
ervdliaylk datsk
1 mpyapgdekk gaslfktrca qchtvekgga
nkvgpnlhgv fgrktgqaeg fsyteanrdk
gitwdeetlf aylenpkkyi pgtkmafagf
kkpadrnnvi tylkkatse
1 mgdidkgkki fvqkctqcht ieaggkhkvg
pnlhgmygrq tgkaagysyt dankskgvtw
neetldiylt npkkyipgtk mvfaglkkkg
dredliaylk sasss
1 gqvekgkkif vqrcaqchtv ekagkhktgp
nlngilgrkt gqaagfsytd anrnkgitwk
netlfeylen pkkyipgtkm vfaglkkqke
rqdliaylea atk
1 gdvekgkkvf vqkcaqchtv zbggkhkvgp
nlwglfgrkt gqapgfsytb abkskgivwb
zztlmeylzb pkkyipgtkm ifagikkkge
10 CCHOZ common zebra
11 AAL67777 Actinobacillus
lignieresii
12 CCHOD donkey
13 AAB86817 Pichia stipitis
14 CAA25899 Mus musculus
radliaylks ats
1 gdvekgkkif vqkcaqchtv ekggkhktgp
nlhglfgrkt gqapgfsytd anknkgitwk
eetlmeylen pkkyipgtkm ifagikkkte
redliaylkk atne
1 mtkllqkiaf ilplvfslva xaemvdtfqf
qnetdrvrav alakslrcpq cqnqnlvesn
attayklrle vyemvnqgkt deeiikimte
rfghfvnykp pfna
1 gdvekgkkif vqkcaqchtv ekggkhktgp
nlhglfgrkt gqapgfsytd anknkgitwk
eetlmeylen pkkyipgtkm ifagikkkte
redliaylkk atne
1 mpapfekgse kkgatlfktr clqchtveeg
gphkvgpnlh gimgrksgqa vgysytdank
kkgvewseqtmsdylenpkkyipgtkmafg
glkkpkdrnd lvtylasatk
1 mgdvekgkki fvqkcaqcht vekggkhktg
pnlhglfgrk tgqaagfsyt danknkgitw
gedtlmeyle npkkyipgtk mifagikkkg
eradliaylk katne
2 将所查找的序列作成fasta格式的文本文档。
3 选取第二条序列(
AAA21711
)为代表,进行蛋白质一级,二级,
三级结构的预测
a.一级结构用的是
/tools/,结果如下:
User-provided sequence:
1 11 21 31 41 51
| | | | | |
1 MGDVEKGKKI FIMKCSQCHT VEKGGKHKTG PNLHGLFGRK TGQAPGYSYT AANKNKGIIW
60
61 GEDTLMEYLE NPKKYIPGTK MIFVGIKKKE ERADLIAYLK KATNE
References and documentation are available.