Amino acid dipepetide frequency for Wenzhou Crab Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.593AlaAla: 5.593 ± 1.804
0.746AlaCys: 0.746 ± 0.409
2.237AlaAsp: 2.237 ± 1.226
3.356AlaGlu: 3.356 ± 1.235
2.61AlaPhe: 2.61 ± 0.657
2.237AlaGly: 2.237 ± 0.72
4.474AlaHis: 4.474 ± 0.613
5.966AlaIle: 5.966 ± 0.593
1.864AlaLys: 1.864 ± 0.571
6.339AlaLeu: 6.339 ± 0.487
1.864AlaMet: 1.864 ± 0.27
1.491AlaAsn: 1.491 ± 0.46
2.983AlaPro: 2.983 ± 0.749
2.983AlaGln: 2.983 ± 2.137
2.61AlaArg: 2.61 ± 1.155
8.203AlaSer: 8.203 ± 1.523
2.61AlaThr: 2.61 ± 0.88
3.729AlaVal: 3.729 ± 1.127
0.746AlaTrp: 0.746 ± 0.368
3.729AlaTyr: 3.729 ± 1.085
0.0AlaXaa: 0.0 ± 0.0
Cys
1.864CysAla: 1.864 ± 0.571
0.746CysCys: 0.746 ± 0.409
1.491CysAsp: 1.491 ± 0.736
1.491CysGlu: 1.491 ± 0.736
0.373CysPhe: 0.373 ± 0.204
1.864CysGly: 1.864 ± 1.022
0.373CysHis: 0.373 ± 0.204
1.119CysIle: 1.119 ± 0.613
1.864CysLys: 1.864 ± 0.27
1.491CysLeu: 1.491 ± 0.466
1.119CysMet: 1.119 ± 0.864
1.491CysAsn: 1.491 ± 0.375
2.237CysPro: 2.237 ± 0.692
1.119CysGln: 1.119 ± 0.864
0.746CysArg: 0.746 ± 1.017
1.864CysSer: 1.864 ± 0.649
0.746CysThr: 0.746 ± 0.5
1.119CysVal: 1.119 ± 0.31
0.373CysTrp: 0.373 ± 0.508
1.491CysTyr: 1.491 ± 0.375
0.0CysXaa: 0.0 ± 0.0
Asp
2.237AspAla: 2.237 ± 0.667
1.491AspCys: 1.491 ± 0.466
3.729AspAsp: 3.729 ± 0.794
1.864AspGlu: 1.864 ± 0.27
2.61AspPhe: 2.61 ± 0.212
2.237AspGly: 2.237 ± 0.692
0.746AspHis: 0.746 ± 0.409
2.983AspIle: 2.983 ± 0.92
1.864AspLys: 1.864 ± 0.571
8.949AspLeu: 8.949 ± 2.011
1.491AspMet: 1.491 ± 0.817
3.356AspAsn: 3.356 ± 1.501
2.983AspPro: 2.983 ± 1.073
0.746AspGln: 0.746 ± 1.252
3.729AspArg: 3.729 ± 0.655
4.474AspSer: 4.474 ± 1.385
4.101AspThr: 4.101 ± 1.024
4.474AspVal: 4.474 ± 0.779
0.746AspTrp: 0.746 ± 0.368
1.119AspTyr: 1.119 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
3.356GluAla: 3.356 ± 0.805
1.864GluCys: 1.864 ± 0.518
2.237GluAsp: 2.237 ± 0.876
5.593GluGlu: 5.593 ± 1.553
2.61GluPhe: 2.61 ± 0.88
4.101GluGly: 4.101 ± 1.158
1.491GluHis: 1.491 ± 0.375
2.61GluIle: 2.61 ± 0.741
3.356GluLys: 3.356 ± 1.264
2.61GluLeu: 2.61 ± 0.474
0.373GluMet: 0.373 ± 0.626
3.356GluAsn: 3.356 ± 1.501
1.864GluPro: 1.864 ± 0.571
1.491GluGln: 1.491 ± 0.375
2.61GluArg: 2.61 ± 1.118
4.101GluSer: 4.101 ± 0.996
5.22GluThr: 5.22 ± 1.102
2.237GluVal: 2.237 ± 0.692
1.491GluTrp: 1.491 ± 0.375
2.237GluTyr: 2.237 ± 0.692
0.0GluXaa: 0.0 ± 0.0
Phe
1.491PheAla: 1.491 ± 0.817
2.237PheCys: 2.237 ± 0.667
1.491PheAsp: 1.491 ± 1.231
1.864PheGlu: 1.864 ± 0.918
1.119PhePhe: 1.119 ± 0.31
1.864PheGly: 1.864 ± 1.875
0.746PheHis: 0.746 ± 0.409
0.0PheIle: 0.0 ± 0.0
3.729PheLys: 3.729 ± 0.539
5.593PheLeu: 5.593 ± 1.948
0.373PheMet: 0.373 ± 0.204
0.746PheAsn: 0.746 ± 0.409
1.491PhePro: 1.491 ± 0.466
2.237PheGln: 2.237 ± 1.226
1.491PheArg: 1.491 ± 0.817
3.356PheSer: 3.356 ± 0.593
2.61PheThr: 2.61 ± 1.155
2.983PheVal: 2.983 ± 0.92
0.746PheTrp: 0.746 ± 0.368
0.746PheTyr: 0.746 ± 0.409
0.0PheXaa: 0.0 ± 0.0
Gly
5.966GlyAla: 5.966 ± 0.791
2.237GlyCys: 2.237 ± 0.62
4.474GlyAsp: 4.474 ± 1.199
4.474GlyGlu: 4.474 ± 0.495
4.474GlyPhe: 4.474 ± 1.664
5.22GlyGly: 5.22 ± 1.561
1.491GlyHis: 1.491 ± 0.736
4.101GlyIle: 4.101 ± 0.655
2.237GlyLys: 2.237 ± 0.72
7.457GlyLeu: 7.457 ± 3.298
1.491GlyMet: 1.491 ± 1.029
0.746GlyAsn: 0.746 ± 0.409
4.101GlyPro: 4.101 ± 1.284
1.119GlyGln: 1.119 ± 0.613
5.22GlyArg: 5.22 ± 0.272
3.729GlySer: 3.729 ± 1.036
2.983GlyThr: 2.983 ± 1.473
4.474GlyVal: 4.474 ± 0.613
0.373GlyTrp: 0.373 ± 0.508
2.61GlyTyr: 2.61 ± 0.212
0.0GlyXaa: 0.0 ± 0.0
His
1.119HisAla: 1.119 ± 0.31
0.0HisCys: 0.0 ± 0.0
1.491HisAsp: 1.491 ± 0.817
1.491HisGlu: 1.491 ± 0.736
1.119HisPhe: 1.119 ± 0.613
2.237HisGly: 2.237 ± 0.667
0.373HisHis: 0.373 ± 0.508
1.864HisIle: 1.864 ± 0.518
1.119HisLys: 1.119 ± 0.438
2.61HisLeu: 2.61 ± 0.212
0.746HisMet: 0.746 ± 0.409
0.373HisAsn: 0.373 ± 0.204
1.119HisPro: 1.119 ± 0.659
0.373HisGln: 0.373 ± 0.204
2.61HisArg: 2.61 ± 1.43
1.864HisSer: 1.864 ± 1.534
2.61HisThr: 2.61 ± 1.697
1.491HisVal: 1.491 ± 0.736
0.373HisTrp: 0.373 ± 0.204
0.746HisTyr: 0.746 ± 0.409
0.0HisXaa: 0.0 ± 0.0
Ile
4.101IleAla: 4.101 ± 0.996
2.61IleCys: 2.61 ± 0.88
3.356IleAsp: 3.356 ± 2.606
3.729IleGlu: 3.729 ± 0.655
1.119IlePhe: 1.119 ± 0.613
3.729IleGly: 3.729 ± 1.841
0.746IleHis: 0.746 ± 0.5
4.474IleIle: 4.474 ± 1.241
5.22IleLys: 5.22 ± 1.011
4.474IleLeu: 4.474 ± 1.851
2.61IleMet: 2.61 ± 0.892
1.119IleAsn: 1.119 ± 0.31
2.983IlePro: 2.983 ± 0.296
2.237IleGln: 2.237 ± 0.131
5.966IleArg: 5.966 ± 0.936
2.61IleSer: 2.61 ± 1.43
3.356IleThr: 3.356 ± 1.991
1.864IleVal: 1.864 ± 1.054
0.373IleTrp: 0.373 ± 0.508
2.983IleTyr: 2.983 ± 0.749
0.0IleXaa: 0.0 ± 0.0
Lys
2.61LysAla: 2.61 ± 1.118
0.373LysCys: 0.373 ± 0.204
4.101LysAsp: 4.101 ± 0.996
4.101LysGlu: 4.101 ± 0.371
1.119LysPhe: 1.119 ± 0.864
3.729LysGly: 3.729 ± 0.539
0.746LysHis: 0.746 ± 0.368
2.237LysIle: 2.237 ± 0.692
1.864LysLys: 1.864 ± 0.918
6.711LysLeu: 6.711 ± 1.853
1.491LysMet: 1.491 ± 0.462
3.729LysAsn: 3.729 ± 1.127
1.491LysPro: 1.491 ± 0.375
1.119LysGln: 1.119 ± 0.438
4.101LysArg: 4.101 ± 0.655
1.864LysSer: 1.864 ± 0.918
2.237LysThr: 2.237 ± 0.62
4.101LysVal: 4.101 ± 1.284
1.864LysTrp: 1.864 ± 0.649
1.864LysTyr: 1.864 ± 0.571
0.0LysXaa: 0.0 ± 0.0
Leu
7.457LeuAla: 7.457 ± 1.144
1.119LeuCys: 1.119 ± 0.31
7.83LeuAsp: 7.83 ± 2.467
5.966LeuGlu: 5.966 ± 1.469
1.864LeuPhe: 1.864 ± 0.918
8.949LeuGly: 8.949 ± 0.881
4.101LeuHis: 4.101 ± 0.371
6.711LeuIle: 6.711 ± 0.392
7.457LeuLys: 7.457 ± 0.885
11.186LeuLeu: 11.186 ± 2.323
1.491LeuMet: 1.491 ± 0.757
3.729LeuAsn: 3.729 ± 1.312
4.474LeuPro: 4.474 ± 0.613
2.983LeuGln: 2.983 ± 0.628
2.237LeuArg: 2.237 ± 0.692
10.44LeuSer: 10.44 ± 0.848
5.22LeuThr: 5.22 ± 1.528
2.61LeuVal: 2.61 ± 1.009
2.237LeuTrp: 2.237 ± 1.226
4.847LeuTyr: 4.847 ± 1.623
0.0LeuXaa: 0.0 ± 0.0
Met
2.61MetAla: 2.61 ± 0.741
0.746MetCys: 0.746 ± 0.409
1.119MetAsp: 1.119 ± 0.613
1.491MetGlu: 1.491 ± 0.375
2.237MetPhe: 2.237 ± 0.692
2.237MetGly: 2.237 ± 0.62
0.0MetHis: 0.0 ± 0.0
1.864MetIle: 1.864 ± 0.518
2.237MetLys: 2.237 ± 0.131
2.61MetLeu: 2.61 ± 0.212
0.373MetMet: 0.373 ± 0.626
0.373MetAsn: 0.373 ± 0.204
0.746MetPro: 0.746 ± 0.409
0.373MetGln: 0.373 ± 0.204
1.119MetArg: 1.119 ± 0.864
2.61MetSer: 2.61 ± 0.212
2.983MetThr: 2.983 ± 2.137
1.491MetVal: 1.491 ± 1.0
0.0MetTrp: 0.0 ± 0.0
1.119MetTyr: 1.119 ± 0.613
0.0MetXaa: 0.0 ± 0.0
Asn
1.119AsnAla: 1.119 ± 0.659
1.119AsnCys: 1.119 ± 0.438
1.864AsnAsp: 1.864 ± 0.649
1.119AsnGlu: 1.119 ± 0.31
1.491AsnPhe: 1.491 ± 1.721
2.983AsnGly: 2.983 ± 0.949
0.373AsnHis: 0.373 ± 0.204
1.491AsnIle: 1.491 ± 0.46
1.491AsnLys: 1.491 ± 0.817
3.356AsnLeu: 3.356 ± 1.235
1.119AsnMet: 1.119 ± 0.613
0.746AsnAsn: 0.746 ± 0.368
2.983AsnPro: 2.983 ± 0.395
1.491AsnGln: 1.491 ± 0.466
1.491AsnArg: 1.491 ± 0.375
1.864AsnSer: 1.864 ± 0.27
2.237AsnThr: 2.237 ± 0.131
2.61AsnVal: 2.61 ± 0.741
1.491AsnTrp: 1.491 ± 1.0
0.746AsnTyr: 0.746 ± 0.409
0.0AsnXaa: 0.0 ± 0.0
Pro
2.237ProAla: 2.237 ± 0.62
1.491ProCys: 1.491 ± 1.369
2.983ProAsp: 2.983 ± 1.073
2.237ProGlu: 2.237 ± 0.72
0.746ProPhe: 0.746 ± 0.5
2.237ProGly: 2.237 ± 0.888
0.746ProHis: 0.746 ± 0.86
5.593ProIle: 5.593 ± 1.212
0.373ProLys: 0.373 ± 0.204
5.966ProLeu: 5.966 ± 0.154
0.746ProMet: 0.746 ± 0.409
1.864ProAsn: 1.864 ± 0.649
3.729ProPro: 3.729 ± 1.323
3.356ProGln: 3.356 ± 0.593
3.729ProArg: 3.729 ± 1.841
3.729ProSer: 3.729 ± 0.578
3.356ProThr: 3.356 ± 0.593
4.101ProVal: 4.101 ± 1.284
0.746ProTrp: 0.746 ± 0.409
1.491ProTyr: 1.491 ± 0.817
0.0ProXaa: 0.0 ± 0.0
Gln
1.864GlnAla: 1.864 ± 0.27
0.746GlnCys: 0.746 ± 0.368
2.61GlnAsp: 2.61 ± 0.657
2.237GlnGlu: 2.237 ± 1.5
2.61GlnPhe: 2.61 ± 0.892
2.61GlnGly: 2.61 ± 0.474
0.746GlnHis: 0.746 ± 0.409
0.746GlnIle: 0.746 ± 0.5
2.983GlnLys: 2.983 ± 0.933
2.237GlnLeu: 2.237 ± 0.72
1.491GlnMet: 1.491 ± 0.466
1.119GlnAsn: 1.119 ± 0.613
1.491GlnPro: 1.491 ± 1.231
0.746GlnGln: 0.746 ± 0.5
2.237GlnArg: 2.237 ± 1.226
2.237GlnSer: 2.237 ± 0.131
2.983GlnThr: 2.983 ± 0.749
1.119GlnVal: 1.119 ± 1.114
0.0GlnTrp: 0.0 ± 0.0
1.491GlnTyr: 1.491 ± 0.375
0.0GlnXaa: 0.0 ± 0.0
Arg
4.847ArgAla: 4.847 ± 0.861
0.373ArgCys: 0.373 ± 0.508
1.864ArgAsp: 1.864 ± 0.27
3.729ArgGlu: 3.729 ± 1.469
1.119ArgPhe: 1.119 ± 0.864
5.22ArgGly: 5.22 ± 1.606
1.864ArgHis: 1.864 ± 0.649
3.356ArgIle: 3.356 ± 0.186
1.119ArgLys: 1.119 ± 0.864
4.847ArgLeu: 4.847 ± 1.25
1.119ArgMet: 1.119 ± 1.114
2.237ArgAsn: 2.237 ± 0.888
2.61ArgPro: 2.61 ± 0.212
2.61ArgGln: 2.61 ± 0.882
4.847ArgArg: 4.847 ± 1.261
5.22ArgSer: 5.22 ± 1.314
1.864ArgThr: 1.864 ± 0.518
4.101ArgVal: 4.101 ± 1.158
0.373ArgTrp: 0.373 ± 0.508
1.119ArgTyr: 1.119 ± 1.114
0.0ArgXaa: 0.0 ± 0.0
Ser
4.847SerAla: 4.847 ± 0.932
1.864SerCys: 1.864 ± 0.649
2.983SerAsp: 2.983 ± 1.074
3.356SerGlu: 3.356 ± 0.593
1.864SerPhe: 1.864 ± 0.649
5.966SerGly: 5.966 ± 0.154
2.983SerHis: 2.983 ± 1.473
4.847SerIle: 4.847 ± 1.402
2.237SerLys: 2.237 ± 0.131
10.44SerLeu: 10.44 ± 1.404
3.356SerMet: 3.356 ± 0.881
2.983SerAsn: 2.983 ± 0.949
5.22SerPro: 5.22 ± 0.272
3.356SerGln: 3.356 ± 0.186
2.983SerArg: 2.983 ± 1.349
6.339SerSer: 6.339 ± 1.617
4.474SerThr: 4.474 ± 1.334
4.847SerVal: 4.847 ± 1.25
1.491SerTrp: 1.491 ± 0.817
3.729SerTyr: 3.729 ± 0.255
0.0SerXaa: 0.0 ± 0.0
Thr
5.593ThrAla: 5.593 ± 2.377
1.491ThrCys: 1.491 ± 1.369
4.101ThrAsp: 4.101 ± 1.578
1.119ThrGlu: 1.119 ± 0.613
2.983ThrPhe: 2.983 ± 1.366
3.356ThrGly: 3.356 ± 0.881
1.491ThrHis: 1.491 ± 0.375
4.847ThrIle: 4.847 ± 1.003
4.101ThrLys: 4.101 ± 0.881
7.83ThrLeu: 7.83 ± 1.453
2.61ThrMet: 2.61 ± 1.155
1.119ThrAsn: 1.119 ± 0.31
1.864ThrPro: 1.864 ± 1.227
1.491ThrGln: 1.491 ± 0.466
2.983ThrArg: 2.983 ± 1.349
4.474ThrSer: 4.474 ± 0.613
1.864ThrThr: 1.864 ± 1.534
2.983ThrVal: 2.983 ± 0.296
0.373ThrTrp: 0.373 ± 0.204
1.864ThrTyr: 1.864 ± 1.534
0.0ThrXaa: 0.0 ± 0.0
Val
2.983ValAla: 2.983 ± 0.974
1.119ValCys: 1.119 ± 0.31
3.356ValAsp: 3.356 ± 0.567
3.356ValGlu: 3.356 ± 0.186
2.983ValPhe: 2.983 ± 0.749
4.847ValGly: 4.847 ± 1.003
0.373ValHis: 0.373 ± 0.204
1.491ValIle: 1.491 ± 0.375
2.61ValLys: 2.61 ± 0.882
4.847ValLeu: 4.847 ± 2.969
1.864ValMet: 1.864 ± 0.571
1.864ValAsn: 1.864 ± 0.571
5.593ValPro: 5.593 ± 0.77
2.237ValGln: 2.237 ± 0.692
1.491ValArg: 1.491 ± 0.375
4.847ValSer: 4.847 ± 0.527
4.474ValThr: 4.474 ± 0.997
3.729ValVal: 3.729 ± 0.578
0.746ValTrp: 0.746 ± 0.409
2.237ValTyr: 2.237 ± 1.342
0.0ValXaa: 0.0 ± 0.0
Trp
1.864TrpAla: 1.864 ± 0.518
0.746TrpCys: 0.746 ± 0.409
0.373TrpAsp: 0.373 ± 0.204
1.119TrpGlu: 1.119 ± 0.864
0.373TrpPhe: 0.373 ± 0.204
0.373TrpGly: 0.373 ± 0.204
0.373TrpHis: 0.373 ± 0.204
0.373TrpIle: 0.373 ± 0.204
1.491TrpLys: 1.491 ± 0.736
1.119TrpLeu: 1.119 ± 0.659
1.119TrpMet: 1.119 ± 0.864
0.0TrpAsn: 0.0 ± 0.0
0.746TrpPro: 0.746 ± 0.409
0.373TrpGln: 0.373 ± 0.508
0.373TrpArg: 0.373 ± 0.204
2.237TrpSer: 2.237 ± 1.104
0.746TrpThr: 0.746 ± 0.5
0.746TrpVal: 0.746 ± 0.409
0.373TrpTrp: 0.373 ± 0.204
0.746TrpTyr: 0.746 ± 0.409
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.61TyrAla: 2.61 ± 1.118
1.864TyrCys: 1.864 ± 0.571
2.237TyrAsp: 2.237 ± 0.72
1.119TyrGlu: 1.119 ± 0.613
1.491TyrPhe: 1.491 ± 0.46
3.356TyrGly: 3.356 ± 0.567
1.491TyrHis: 1.491 ± 1.0
2.983TyrIle: 2.983 ± 0.296
2.237TyrLys: 2.237 ± 0.62
2.237TyrLeu: 2.237 ± 0.692
1.119TyrMet: 1.119 ± 0.31
0.746TyrAsn: 0.746 ± 0.368
0.746TyrPro: 0.746 ± 0.368
1.864TyrGln: 1.864 ± 1.022
1.864TyrArg: 1.864 ± 0.865
4.101TyrSer: 4.101 ± 0.423
1.864TyrThr: 1.864 ± 0.518
2.237TyrVal: 2.237 ± 0.131
0.746TyrTrp: 0.746 ± 1.017
2.237TyrTyr: 2.237 ± 2.065
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2683 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski