Amino acid dipepetide frequency for Chimpanzee associated porprismacovirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.288AlaAla: 1.288 ± 0.995
0.644AlaCys: 0.644 ± 0.873
0.0AlaAsp: 0.0 ± 0.0
1.288AlaGlu: 1.288 ± 0.682
1.932AlaPhe: 1.932 ± 1.104
5.151AlaGly: 5.151 ± 2.081
1.288AlaHis: 1.288 ± 0.808
1.932AlaIle: 1.932 ± 1.197
1.288AlaLys: 1.288 ± 0.712
5.151AlaLeu: 5.151 ± 2.38
2.576AlaMet: 2.576 ± 1.013
1.288AlaAsn: 1.288 ± 1.052
1.288AlaPro: 1.288 ± 1.188
0.644AlaGln: 0.644 ± 0.497
3.863AlaArg: 3.863 ± 0.966
5.151AlaSer: 5.151 ± 1.538
1.288AlaThr: 1.288 ± 0.575
4.507AlaVal: 4.507 ± 2.029
0.0AlaTrp: 0.0 ± 0.0
4.507AlaTyr: 4.507 ± 1.868
0.0AlaXaa: 0.0 ± 0.0
Cys
1.288CysAla: 1.288 ± 1.747
0.0CysCys: 0.0 ± 0.0
0.644CysAsp: 0.644 ± 0.873
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.644CysGly: 0.644 ± 0.618
0.644CysHis: 0.644 ± 0.497
1.932CysIle: 1.932 ± 1.806
0.644CysLys: 0.644 ± 0.511
0.644CysLeu: 0.644 ± 0.873
0.0CysMet: 0.0 ± 0.0
0.644CysAsn: 0.644 ± 0.618
1.288CysPro: 1.288 ± 1.052
0.644CysGln: 0.644 ± 0.724
0.0CysArg: 0.0 ± 0.0
3.22CysSer: 3.22 ± 0.89
0.0CysThr: 0.0 ± 0.0
0.644CysVal: 0.644 ± 0.724
0.0CysTrp: 0.0 ± 0.0
0.644CysTyr: 0.644 ± 0.497
0.0CysXaa: 0.0 ± 0.0
Asp
3.863AspAla: 3.863 ± 1.217
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
2.576AspGlu: 2.576 ± 1.37
3.22AspPhe: 3.22 ± 1.11
3.22AspGly: 3.22 ± 1.313
0.644AspHis: 0.644 ± 0.511
3.863AspIle: 3.863 ± 2.045
2.576AspLys: 2.576 ± 1.079
3.22AspLeu: 3.22 ± 1.313
1.288AspMet: 1.288 ± 0.995
2.576AspAsn: 2.576 ± 0.848
3.22AspPro: 3.22 ± 1.657
1.288AspGln: 1.288 ± 1.188
3.863AspArg: 3.863 ± 2.036
4.507AspSer: 4.507 ± 1.691
3.863AspThr: 3.863 ± 0.884
1.288AspVal: 1.288 ± 1.022
1.288AspTrp: 1.288 ± 0.679
1.932AspTyr: 1.932 ± 1.017
0.0AspXaa: 0.0 ± 0.0
Glu
1.288GluAla: 1.288 ± 0.679
0.644GluCys: 0.644 ± 0.618
1.288GluAsp: 1.288 ± 0.682
1.932GluGlu: 1.932 ± 1.2
0.644GluPhe: 0.644 ± 0.497
2.576GluGly: 2.576 ± 1.783
1.932GluHis: 1.932 ± 1.482
2.576GluIle: 2.576 ± 1.038
0.644GluLys: 0.644 ± 0.511
2.576GluLeu: 2.576 ± 1.363
1.932GluMet: 1.932 ± 0.991
2.576GluAsn: 2.576 ± 0.792
3.863GluPro: 3.863 ± 1.84
2.576GluGln: 2.576 ± 0.872
3.863GluArg: 3.863 ± 1.617
2.576GluSer: 2.576 ± 1.149
3.863GluThr: 3.863 ± 0.966
2.576GluVal: 2.576 ± 1.079
0.0GluTrp: 0.0 ± 0.0
0.644GluTyr: 0.644 ± 0.725
0.0GluXaa: 0.0 ± 0.0
Phe
1.932PheAla: 1.932 ± 1.492
0.644PheCys: 0.644 ± 0.873
1.288PheAsp: 1.288 ± 0.772
1.288PheGlu: 1.288 ± 0.876
0.644PhePhe: 0.644 ± 0.873
2.576PheGly: 2.576 ± 0.872
2.576PheHis: 2.576 ± 1.623
0.0PheIle: 0.0 ± 0.0
1.288PheLys: 1.288 ± 0.679
1.932PheLeu: 1.932 ± 0.967
1.932PheMet: 1.932 ± 0.944
0.0PheAsn: 0.0 ± 0.0
1.288PhePro: 1.288 ± 1.022
1.288PheGln: 1.288 ± 0.575
3.22PheArg: 3.22 ± 0.64
3.22PheSer: 3.22 ± 1.505
1.288PheThr: 1.288 ± 0.682
0.644PheVal: 0.644 ± 0.725
0.644PheTrp: 0.644 ± 0.497
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
1.932GlyAla: 1.932 ± 1.854
1.932GlyCys: 1.932 ± 0.991
2.576GlyAsp: 2.576 ± 1.149
0.644GlyGlu: 0.644 ± 0.497
1.932GlyPhe: 1.932 ± 0.945
3.863GlyGly: 3.863 ± 1.738
3.22GlyHis: 3.22 ± 1.621
5.795GlyIle: 5.795 ± 1.412
3.22GlyLys: 3.22 ± 1.849
8.371GlyLeu: 8.371 ± 1.181
1.288GlyMet: 1.288 ± 0.712
4.507GlyAsn: 4.507 ± 1.711
2.576GlyPro: 2.576 ± 1.089
3.22GlyGln: 3.22 ± 2.017
5.151GlyArg: 5.151 ± 2.786
4.507GlySer: 4.507 ± 1.963
1.288GlyThr: 1.288 ± 0.575
2.576GlyVal: 2.576 ± 1.397
1.288GlyTrp: 1.288 ± 0.995
4.507GlyTyr: 4.507 ± 1.841
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
2.576HisAsp: 2.576 ± 1.193
2.576HisGlu: 2.576 ± 1.519
2.576HisPhe: 2.576 ± 1.053
5.151HisGly: 5.151 ± 0.805
1.932HisHis: 1.932 ± 0.967
3.22HisIle: 3.22 ± 1.987
1.288HisLys: 1.288 ± 0.971
5.151HisLeu: 5.151 ± 2.149
0.0HisMet: 0.0 ± 0.0
0.644HisAsn: 0.644 ± 0.725
1.932HisPro: 1.932 ± 0.967
1.288HisGln: 1.288 ± 1.747
3.22HisArg: 3.22 ± 1.937
1.288HisSer: 1.288 ± 1.22
5.151HisThr: 5.151 ± 1.786
1.288HisVal: 1.288 ± 0.995
0.644HisTrp: 0.644 ± 0.618
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.22IleAla: 3.22 ± 1.619
1.288IleCys: 1.288 ± 1.747
2.576IleAsp: 2.576 ± 0.872
2.576IleGlu: 2.576 ± 1.038
2.576IlePhe: 2.576 ± 0.667
5.795IleGly: 5.795 ± 2.619
4.507IleHis: 4.507 ± 1.942
4.507IleIle: 4.507 ± 2.965
3.22IleLys: 3.22 ± 1.621
6.439IleLeu: 6.439 ± 1.768
3.22IleMet: 3.22 ± 1.293
3.22IleAsn: 3.22 ± 1.506
6.439IlePro: 6.439 ± 1.471
2.576IleGln: 2.576 ± 1.363
9.659IleArg: 9.659 ± 3.826
6.439IleSer: 6.439 ± 1.995
2.576IleThr: 2.576 ± 2.044
5.795IleVal: 5.795 ± 2.034
0.644IleTrp: 0.644 ± 0.618
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.288LysAla: 1.288 ± 0.995
0.644LysCys: 0.644 ± 0.724
1.288LysAsp: 1.288 ± 0.894
1.932LysGlu: 1.932 ± 0.604
0.0LysPhe: 0.0 ± 0.0
5.795LysGly: 5.795 ± 0.983
3.863LysHis: 3.863 ± 0.898
4.507LysIle: 4.507 ± 0.969
1.288LysLys: 1.288 ± 0.995
3.22LysLeu: 3.22 ± 1.335
1.288LysMet: 1.288 ± 0.679
0.644LysAsn: 0.644 ± 0.873
3.22LysPro: 3.22 ± 1.452
0.644LysGln: 0.644 ± 0.497
1.288LysArg: 1.288 ± 1.022
2.576LysSer: 2.576 ± 0.89
1.932LysThr: 1.932 ± 0.967
1.288LysVal: 1.288 ± 0.679
1.932LysTrp: 1.932 ± 1.374
1.288LysTyr: 1.288 ± 0.679
0.0LysXaa: 0.0 ± 0.0
Leu
1.932LeuAla: 1.932 ± 0.985
2.576LeuCys: 2.576 ± 1.038
3.863LeuAsp: 3.863 ± 2.191
5.151LeuGlu: 5.151 ± 1.344
2.576LeuPhe: 2.576 ± 1.079
3.22LeuGly: 3.22 ± 1.313
3.22LeuHis: 3.22 ± 2.088
5.795LeuIle: 5.795 ± 1.445
3.22LeuLys: 3.22 ± 1.927
8.371LeuLeu: 8.371 ± 2.711
0.644LeuMet: 0.644 ± 0.873
5.151LeuAsn: 5.151 ± 1.911
5.151LeuPro: 5.151 ± 1.221
3.22LeuGln: 3.22 ± 1.836
7.727LeuArg: 7.727 ± 0.941
5.795LeuSer: 5.795 ± 2.013
2.576LeuThr: 2.576 ± 1.38
8.371LeuVal: 8.371 ± 1.236
1.288LeuTrp: 1.288 ± 1.447
3.863LeuTyr: 3.863 ± 1.451
0.0LeuXaa: 0.0 ± 0.0
Met
1.932MetAla: 1.932 ± 0.593
0.0MetCys: 0.0 ± 0.0
1.288MetAsp: 1.288 ± 1.236
1.288MetGlu: 1.288 ± 0.712
1.288MetPhe: 1.288 ± 0.741
4.507MetGly: 4.507 ± 1.235
0.644MetHis: 0.644 ± 0.618
1.932MetIle: 1.932 ± 0.604
0.0MetLys: 0.0 ± 0.0
3.22MetLeu: 3.22 ± 1.285
1.932MetMet: 1.932 ± 1.017
1.932MetAsn: 1.932 ± 1.492
0.0MetPro: 0.0 ± 0.0
1.288MetGln: 1.288 ± 0.894
1.932MetArg: 1.932 ± 1.017
1.932MetSer: 1.932 ± 1.482
5.151MetThr: 5.151 ± 1.234
1.288MetVal: 1.288 ± 0.995
0.0MetTrp: 0.0 ± 0.0
0.644MetTyr: 0.644 ± 0.724
0.0MetXaa: 0.0 ± 0.0
Asn
2.576AsnAla: 2.576 ± 0.969
0.0AsnCys: 0.0 ± 0.0
3.863AsnAsp: 3.863 ± 0.966
0.644AsnGlu: 0.644 ± 0.725
0.644AsnPhe: 0.644 ± 0.497
1.932AsnGly: 1.932 ± 0.945
3.863AsnHis: 3.863 ± 1.934
1.932AsnIle: 1.932 ± 1.2
0.644AsnLys: 0.644 ± 0.724
3.863AsnLeu: 3.863 ± 1.485
3.22AsnMet: 3.22 ± 1.005
3.22AsnAsn: 3.22 ± 1.849
1.932AsnPro: 1.932 ± 1.112
3.22AsnGln: 3.22 ± 0.85
5.151AsnArg: 5.151 ± 2.145
3.863AsnSer: 3.863 ± 1.724
6.439AsnThr: 6.439 ± 2.528
0.644AsnVal: 0.644 ± 0.497
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.576ProAla: 2.576 ± 1.989
1.288ProCys: 1.288 ± 0.924
1.932ProAsp: 1.932 ± 0.945
0.0ProGlu: 0.0 ± 0.0
1.288ProPhe: 1.288 ± 0.924
1.932ProGly: 1.932 ± 0.593
1.932ProHis: 1.932 ± 1.172
2.576ProIle: 2.576 ± 0.529
5.795ProLys: 5.795 ± 1.893
5.151ProLeu: 5.151 ± 3.056
3.863ProMet: 3.863 ± 1.103
0.644ProAsn: 0.644 ± 0.511
3.22ProPro: 3.22 ± 1.052
1.932ProGln: 1.932 ± 1.348
7.083ProArg: 7.083 ± 3.258
5.795ProSer: 5.795 ± 2.3
2.576ProThr: 2.576 ± 1.637
3.22ProVal: 3.22 ± 1.285
0.0ProTrp: 0.0 ± 0.0
1.932ProTyr: 1.932 ± 0.854
0.0ProXaa: 0.0 ± 0.0
Gln
3.22GlnAla: 3.22 ± 1.073
0.644GlnCys: 0.644 ± 0.724
0.644GlnAsp: 0.644 ± 0.618
2.576GlnGlu: 2.576 ± 0.744
0.644GlnPhe: 0.644 ± 0.497
1.932GlnGly: 1.932 ± 0.783
0.644GlnHis: 0.644 ± 0.725
3.22GlnIle: 3.22 ± 0.898
0.0GlnLys: 0.0 ± 0.0
2.576GlnLeu: 2.576 ± 1.389
1.288GlnMet: 1.288 ± 1.45
2.576GlnAsn: 2.576 ± 0.93
0.644GlnPro: 0.644 ± 0.873
1.932GlnGln: 1.932 ± 1.017
5.151GlnArg: 5.151 ± 2.512
4.507GlnSer: 4.507 ± 1.662
1.932GlnThr: 1.932 ± 1.034
3.863GlnVal: 3.863 ± 1.806
0.644GlnTrp: 0.644 ± 0.873
1.932GlnTyr: 1.932 ± 0.991
0.0GlnXaa: 0.0 ± 0.0
Arg
5.151ArgAla: 5.151 ± 3.36
1.288ArgCys: 1.288 ± 0.679
7.727ArgAsp: 7.727 ± 1.93
2.576ArgGlu: 2.576 ± 1.006
2.576ArgPhe: 2.576 ± 0.872
3.863ArgGly: 3.863 ± 1.186
0.644ArgHis: 0.644 ± 0.511
9.659ArgIle: 9.659 ± 4.285
4.507ArgLys: 4.507 ± 1.935
6.439ArgLeu: 6.439 ± 1.774
1.288ArgMet: 1.288 ± 0.697
1.932ArgAsn: 1.932 ± 0.967
6.439ArgPro: 6.439 ± 1.751
3.22ArgGln: 3.22 ± 2.891
5.795ArgArg: 5.795 ± 3.543
7.083ArgSer: 7.083 ± 2.178
6.439ArgThr: 6.439 ± 1.583
3.863ArgVal: 3.863 ± 1.792
1.932ArgTrp: 1.932 ± 1.854
3.22ArgTyr: 3.22 ± 1.293
0.0ArgXaa: 0.0 ± 0.0
Ser
2.576SerAla: 2.576 ± 2.827
0.0SerCys: 0.0 ± 0.0
4.507SerAsp: 4.507 ± 2.217
4.507SerGlu: 4.507 ± 1.096
1.932SerPhe: 1.932 ± 0.945
5.795SerGly: 5.795 ± 2.342
2.576SerHis: 2.576 ± 1.312
9.659SerIle: 9.659 ± 3.989
1.288SerLys: 1.288 ± 1.022
7.083SerLeu: 7.083 ± 2.322
2.576SerMet: 2.576 ± 0.835
1.932SerAsn: 1.932 ± 1.104
1.932SerPro: 1.932 ± 1.656
1.932SerGln: 1.932 ± 0.945
8.371SerArg: 8.371 ± 2.341
10.303SerSer: 10.303 ± 3.521
7.083SerThr: 7.083 ± 1.631
5.151SerVal: 5.151 ± 1.442
3.863SerTrp: 3.863 ± 2.354
0.644SerTyr: 0.644 ± 0.497
0.0SerXaa: 0.0 ± 0.0
Thr
5.151ThrAla: 5.151 ± 2.23
1.288ThrCys: 1.288 ± 1.22
3.863ThrAsp: 3.863 ± 1.217
6.439ThrGlu: 6.439 ± 1.525
1.932ThrPhe: 1.932 ± 1.696
1.932ThrGly: 1.932 ± 1.034
2.576ThrHis: 2.576 ± 1.038
3.22ThrIle: 3.22 ± 1.927
2.576ThrLys: 2.576 ± 1.218
3.22ThrLeu: 3.22 ± 1.268
2.576ThrMet: 2.576 ± 1.248
4.507ThrAsn: 4.507 ± 1.886
1.932ThrPro: 1.932 ± 1.492
1.288ThrGln: 1.288 ± 0.808
5.795ThrArg: 5.795 ± 2.157
5.795ThrSer: 5.795 ± 1.797
9.015ThrThr: 9.015 ± 2.67
3.863ThrVal: 3.863 ± 1.312
0.644ThrTrp: 0.644 ± 0.497
1.288ThrTyr: 1.288 ± 0.679
0.0ThrXaa: 0.0 ± 0.0
Val
1.932ValAla: 1.932 ± 0.783
0.644ValCys: 0.644 ± 0.724
3.22ValAsp: 3.22 ± 1.285
1.288ValGlu: 1.288 ± 0.679
0.644ValPhe: 0.644 ± 0.618
1.288ValGly: 1.288 ± 0.995
2.576ValHis: 2.576 ± 1.193
9.015ValIle: 9.015 ± 2.251
1.932ValLys: 1.932 ± 0.848
3.863ValLeu: 3.863 ± 2.263
0.0ValMet: 0.0 ± 0.0
7.727ValAsn: 7.727 ± 2.037
3.863ValPro: 3.863 ± 1.934
5.795ValGln: 5.795 ± 1.157
3.863ValArg: 3.863 ± 1.2
3.863ValSer: 3.863 ± 1.742
1.288ValThr: 1.288 ± 0.772
4.507ValVal: 4.507 ± 1.93
0.644ValTrp: 0.644 ± 0.618
0.644ValTyr: 0.644 ± 0.725
0.0ValXaa: 0.0 ± 0.0
Trp
0.644TrpAla: 0.644 ± 0.618
0.0TrpCys: 0.0 ± 0.0
1.932TrpAsp: 1.932 ± 0.854
0.0TrpGlu: 0.0 ± 0.0
0.644TrpPhe: 0.644 ± 0.618
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.644TrpIle: 0.644 ± 0.724
1.288TrpLys: 1.288 ± 0.679
2.576TrpLeu: 2.576 ± 1.358
0.0TrpMet: 0.0 ± 0.0
1.932TrpAsn: 1.932 ± 1.017
0.0TrpPro: 0.0 ± 0.0
0.644TrpGln: 0.644 ± 0.618
0.0TrpArg: 0.0 ± 0.0
0.644TrpSer: 0.644 ± 0.618
2.576TrpThr: 2.576 ± 1.529
1.288TrpVal: 1.288 ± 1.236
0.0TrpTrp: 0.0 ± 0.0
1.288TrpTyr: 1.288 ± 0.971
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.288TyrAla: 1.288 ± 1.052
0.0TyrCys: 0.0 ± 0.0
3.22TyrAsp: 3.22 ± 1.274
1.288TyrGlu: 1.288 ± 0.971
0.0TyrPhe: 0.0 ± 0.0
3.22TyrGly: 3.22 ± 1.274
0.644TyrHis: 0.644 ± 0.497
1.288TyrIle: 1.288 ± 0.971
3.22TyrLys: 3.22 ± 1.701
1.288TyrLeu: 1.288 ± 0.876
0.644TyrMet: 0.644 ± 0.618
0.0TyrAsn: 0.0 ± 0.0
4.507TyrPro: 4.507 ± 1.607
1.932TyrGln: 1.932 ± 1.112
0.644TyrArg: 0.644 ± 0.497
0.644TyrSer: 0.644 ± 0.497
2.576TyrThr: 2.576 ± 1.358
1.932TyrVal: 1.932 ± 1.492
0.644TyrTrp: 0.644 ± 0.618
2.576TyrTyr: 2.576 ± 1.45
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski