Amino acid dipepetide frequency for Phlox virus S

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.426AlaAla: 6.426 ± 2.224
2.142AlaCys: 2.142 ± 0.64
2.142AlaAsp: 2.142 ± 0.431
5.355AlaGlu: 5.355 ± 1.802
4.641AlaPhe: 4.641 ± 1.062
4.284AlaGly: 4.284 ± 2.178
1.428AlaHis: 1.428 ± 0.469
3.927AlaIle: 3.927 ± 2.697
5.712AlaLys: 5.712 ± 2.382
4.641AlaLeu: 4.641 ± 1.883
1.785AlaMet: 1.785 ± 0.719
3.213AlaAsn: 3.213 ± 0.67
3.57AlaPro: 3.57 ± 0.824
1.071AlaGln: 1.071 ± 0.579
3.57AlaArg: 3.57 ± 1.093
4.998AlaSer: 4.998 ± 1.58
3.927AlaThr: 3.927 ± 0.955
4.998AlaVal: 4.998 ± 1.545
0.0AlaTrp: 0.0 ± 0.0
1.785AlaTyr: 1.785 ± 1.33
0.0AlaXaa: 0.0 ± 0.0
Cys
2.142CysAla: 2.142 ± 2.365
0.714CysCys: 0.714 ± 0.386
1.428CysAsp: 1.428 ± 0.772
1.071CysGlu: 1.071 ± 0.964
3.213CysPhe: 3.213 ± 0.948
2.499CysGly: 2.499 ± 1.386
0.357CysHis: 0.357 ± 0.708
1.785CysIle: 1.785 ± 0.541
1.428CysLys: 1.428 ± 0.5
3.57CysLeu: 3.57 ± 2.382
0.0CysMet: 0.0 ± 0.0
1.785CysAsn: 1.785 ± 1.176
1.428CysPro: 1.428 ± 0.5
0.714CysGln: 0.714 ± 0.386
2.142CysArg: 2.142 ± 0.431
1.428CysSer: 1.428 ± 1.303
1.785CysThr: 1.785 ± 0.541
2.142CysVal: 2.142 ± 2.001
0.0CysTrp: 0.0 ± 0.0
0.714CysTyr: 0.714 ± 0.615
0.0CysXaa: 0.0 ± 0.0
Asp
2.499AspAla: 2.499 ± 0.397
1.785AspCys: 1.785 ± 0.579
2.142AspAsp: 2.142 ± 0.725
5.712AspGlu: 5.712 ± 2.382
2.856AspPhe: 2.856 ± 1.406
2.499AspGly: 2.499 ± 0.714
1.071AspHis: 1.071 ± 1.331
2.142AspIle: 2.142 ± 1.061
1.428AspLys: 1.428 ± 0.469
6.069AspLeu: 6.069 ± 1.992
0.714AspMet: 0.714 ± 0.462
2.499AspAsn: 2.499 ± 1.11
2.856AspPro: 2.856 ± 2.197
0.357AspGln: 0.357 ± 0.193
2.856AspArg: 2.856 ± 0.451
2.142AspSer: 2.142 ± 1.868
1.428AspThr: 1.428 ± 0.909
4.284AspVal: 4.284 ± 1.407
1.428AspTrp: 1.428 ± 0.469
2.142AspTyr: 2.142 ± 0.725
0.0AspXaa: 0.0 ± 0.0
Glu
4.641GluAla: 4.641 ± 1.82
0.714GluCys: 0.714 ± 0.462
4.641GluAsp: 4.641 ± 1.312
7.854GluGlu: 7.854 ± 1.423
3.927GluPhe: 3.927 ± 2.08
3.57GluGly: 3.57 ± 0.721
2.499GluHis: 2.499 ± 1.351
3.213GluIle: 3.213 ± 1.024
4.284GluLys: 4.284 ± 2.315
5.355GluLeu: 5.355 ± 1.616
1.785GluMet: 1.785 ± 0.579
3.213GluAsn: 3.213 ± 1.036
1.785GluPro: 1.785 ± 0.965
3.213GluGln: 3.213 ± 1.241
3.57GluArg: 3.57 ± 2.125
3.57GluSer: 3.57 ± 0.916
1.785GluThr: 1.785 ± 0.579
8.568GluVal: 8.568 ± 1.24
0.714GluTrp: 0.714 ± 0.386
1.428GluTyr: 1.428 ± 0.683
0.0GluXaa: 0.0 ± 0.0
Phe
3.927PheAla: 3.927 ± 1.132
2.142PheCys: 2.142 ± 0.431
3.57PheAsp: 3.57 ± 0.651
4.641PheGlu: 4.641 ± 0.98
2.142PhePhe: 2.142 ± 0.64
3.213PheGly: 3.213 ± 1.023
0.357PheHis: 0.357 ± 0.754
3.57PheIle: 3.57 ± 1.169
2.856PheLys: 2.856 ± 0.713
7.14PheLeu: 7.14 ± 2.826
1.785PheMet: 1.785 ± 0.947
3.57PheAsn: 3.57 ± 0.889
0.0PhePro: 0.0 ± 0.0
3.57PheGln: 3.57 ± 0.973
2.499PheArg: 2.499 ± 1.351
4.284PheSer: 4.284 ± 2.315
2.856PheThr: 2.856 ± 1.495
2.499PheVal: 2.499 ± 1.083
0.714PheTrp: 0.714 ± 0.615
1.428PheTyr: 1.428 ± 0.772
0.0PheXaa: 0.0 ± 0.0
Gly
4.284GlyAla: 4.284 ± 0.704
1.428GlyCys: 1.428 ± 2.279
2.856GlyAsp: 2.856 ± 0.87
4.284GlyGlu: 4.284 ± 1.125
4.998GlyPhe: 4.998 ± 0.708
4.284GlyGly: 4.284 ± 1.171
1.071GlyHis: 1.071 ± 0.579
2.499GlyIle: 2.499 ± 0.85
4.998GlyLys: 4.998 ± 1.558
5.712GlyLeu: 5.712 ± 1.468
0.714GlyMet: 0.714 ± 0.386
2.499GlyAsn: 2.499 ± 0.941
1.785GlyPro: 1.785 ± 0.537
1.071GlyGln: 1.071 ± 0.579
3.213GlyArg: 3.213 ± 1.141
2.856GlySer: 2.856 ± 1.088
3.57GlyThr: 3.57 ± 1.012
7.14GlyVal: 7.14 ± 2.125
1.071GlyTrp: 1.071 ± 0.579
1.785GlyTyr: 1.785 ± 1.045
0.0GlyXaa: 0.0 ± 0.0
His
1.071HisAla: 1.071 ± 1.331
1.071HisCys: 1.071 ± 0.53
1.071HisAsp: 1.071 ± 0.579
2.499HisGlu: 2.499 ± 0.775
1.071HisPhe: 1.071 ± 0.579
0.714HisGly: 0.714 ± 0.898
1.071HisHis: 1.071 ± 0.579
1.785HisIle: 1.785 ± 0.537
1.428HisLys: 1.428 ± 0.469
2.499HisLeu: 2.499 ± 0.888
0.0HisMet: 0.0 ± 0.0
1.071HisAsn: 1.071 ± 0.579
1.428HisPro: 1.428 ± 0.909
0.357HisGln: 0.357 ± 0.193
1.428HisArg: 1.428 ± 0.909
2.142HisSer: 2.142 ± 0.872
1.428HisThr: 1.428 ± 0.932
1.071HisVal: 1.071 ± 0.579
0.357HisTrp: 0.357 ± 0.193
1.071HisTyr: 1.071 ± 0.579
0.0HisXaa: 0.0 ± 0.0
Ile
3.57IleAla: 3.57 ± 1.578
1.785IleCys: 1.785 ± 0.541
2.856IleAsp: 2.856 ± 1.062
3.57IleGlu: 3.57 ± 0.651
3.213IlePhe: 3.213 ± 1.269
2.856IleGly: 2.856 ± 1.544
0.714IleHis: 0.714 ± 1.217
1.785IleIle: 1.785 ± 0.797
6.069IleLys: 6.069 ± 1.436
4.998IleLeu: 4.998 ± 2.938
0.714IleMet: 0.714 ± 0.462
0.357IleAsn: 0.357 ± 0.193
1.785IlePro: 1.785 ± 0.678
1.428IleGln: 1.428 ± 1.523
1.785IleArg: 1.785 ± 0.537
1.785IleSer: 1.785 ± 1.711
1.785IleThr: 1.785 ± 0.579
4.641IleVal: 4.641 ± 1.179
0.0IleTrp: 0.0 ± 0.0
3.213IleTyr: 3.213 ± 2.275
0.0IleXaa: 0.0 ± 0.0
Lys
4.284LysAla: 4.284 ± 1.064
1.785LysCys: 1.785 ± 1.54
1.785LysAsp: 1.785 ± 0.821
2.856LysGlu: 2.856 ± 1.544
4.284LysPhe: 4.284 ± 1.799
4.998LysGly: 4.998 ± 2.007
2.499LysHis: 2.499 ± 0.775
2.499LysIle: 2.499 ± 1.182
5.355LysLys: 5.355 ± 1.836
6.069LysLeu: 6.069 ± 1.992
1.071LysMet: 1.071 ± 0.579
2.499LysAsn: 2.499 ± 1.351
5.712LysPro: 5.712 ± 1.735
2.499LysGln: 2.499 ± 1.044
3.213LysArg: 3.213 ± 1.024
5.712LysSer: 5.712 ± 1.29
2.499LysThr: 2.499 ± 0.714
2.856LysVal: 2.856 ± 1.101
1.071LysTrp: 1.071 ± 0.579
2.499LysTyr: 2.499 ± 1.321
0.0LysXaa: 0.0 ± 0.0
Leu
6.069LeuAla: 6.069 ± 1.992
4.641LeuCys: 4.641 ± 2.803
3.927LeuAsp: 3.927 ± 1.453
6.069LeuGlu: 6.069 ± 1.376
3.213LeuPhe: 3.213 ± 1.598
6.783LeuGly: 6.783 ± 1.307
2.499LeuHis: 2.499 ± 1.351
4.284LeuIle: 4.284 ± 1.972
6.783LeuLys: 6.783 ± 2.403
9.996LeuLeu: 9.996 ± 3.891
1.785LeuMet: 1.785 ± 0.579
7.14LeuAsn: 7.14 ± 1.375
3.57LeuPro: 3.57 ± 2.125
2.856LeuGln: 2.856 ± 0.74
3.57LeuArg: 3.57 ± 0.538
7.14LeuSer: 7.14 ± 1.587
5.355LeuThr: 5.355 ± 1.71
5.712LeuVal: 5.712 ± 1.401
0.714LeuTrp: 0.714 ± 1.019
2.856LeuTyr: 2.856 ± 0.451
0.0LeuXaa: 0.0 ± 0.0
Met
3.213MetAla: 3.213 ± 1.036
0.714MetCys: 0.714 ± 0.386
1.428MetAsp: 1.428 ± 0.5
1.785MetGlu: 1.785 ± 0.965
0.0MetPhe: 0.0 ± 0.0
1.785MetGly: 1.785 ± 0.579
0.357MetHis: 0.357 ± 0.567
1.428MetIle: 1.428 ± 0.772
1.071MetLys: 1.071 ± 0.878
2.142MetLeu: 2.142 ± 1.081
0.357MetMet: 0.357 ± 0.193
0.357MetAsn: 0.357 ± 0.193
2.142MetPro: 2.142 ± 0.872
1.071MetGln: 1.071 ± 0.579
2.499MetArg: 2.499 ± 1.007
1.071MetSer: 1.071 ± 0.579
0.357MetThr: 0.357 ± 0.193
1.785MetVal: 1.785 ± 0.579
0.0MetTrp: 0.0 ± 0.0
0.357MetTyr: 0.357 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
2.142AsnAla: 2.142 ± 1.385
2.856AsnCys: 2.856 ± 0.931
0.714AsnAsp: 0.714 ± 0.386
3.57AsnGlu: 3.57 ± 1.93
2.499AsnPhe: 2.499 ± 1.16
2.142AsnGly: 2.142 ± 0.645
0.714AsnHis: 0.714 ± 0.386
2.499AsnIle: 2.499 ± 1.034
1.428AsnLys: 1.428 ± 0.76
4.998AsnLeu: 4.998 ± 0.923
1.428AsnMet: 1.428 ± 0.469
3.927AsnAsn: 3.927 ± 2.472
1.428AsnPro: 1.428 ± 0.469
1.071AsnGln: 1.071 ± 0.576
3.213AsnArg: 3.213 ± 1.173
3.213AsnSer: 3.213 ± 2.052
2.142AsnThr: 2.142 ± 1.385
3.213AsnVal: 3.213 ± 0.57
0.714AsnTrp: 0.714 ± 0.386
2.142AsnTyr: 2.142 ± 0.725
0.0AsnXaa: 0.0 ± 0.0
Pro
3.213ProAla: 3.213 ± 0.67
0.357ProCys: 0.357 ± 0.193
3.927ProAsp: 3.927 ± 1.184
2.856ProGlu: 2.856 ± 1.062
1.071ProPhe: 1.071 ± 0.576
3.213ProGly: 3.213 ± 1.191
1.428ProHis: 1.428 ± 1.14
2.499ProIle: 2.499 ± 1.568
2.142ProLys: 2.142 ± 0.725
3.213ProLeu: 3.213 ± 0.857
1.071ProMet: 1.071 ± 0.435
0.357ProAsn: 0.357 ± 1.002
2.856ProPro: 2.856 ± 2.45
2.142ProGln: 2.142 ± 1.659
4.284ProArg: 4.284 ± 0.773
2.142ProSer: 2.142 ± 2.031
3.213ProThr: 3.213 ± 2.391
1.785ProVal: 1.785 ± 0.965
1.428ProTrp: 1.428 ± 0.797
1.785ProTyr: 1.785 ± 1.007
0.0ProXaa: 0.0 ± 0.0
Gln
1.785GlnAla: 1.785 ± 0.541
1.071GlnCys: 1.071 ± 1.284
0.714GlnAsp: 0.714 ± 0.386
1.785GlnGlu: 1.785 ± 0.579
1.785GlnPhe: 1.785 ± 0.579
2.142GlnGly: 2.142 ± 0.725
1.428GlnHis: 1.428 ± 0.469
1.071GlnIle: 1.071 ± 0.53
1.428GlnLys: 1.428 ± 0.76
4.284GlnLeu: 4.284 ± 1.449
1.428GlnMet: 1.428 ± 0.454
1.785GlnAsn: 1.785 ± 1.375
1.071GlnPro: 1.071 ± 1.066
0.357GlnGln: 0.357 ± 0.193
0.714GlnArg: 0.714 ± 0.462
2.856GlnSer: 2.856 ± 1.544
0.357GlnThr: 0.357 ± 0.193
2.499GlnVal: 2.499 ± 1.613
0.0GlnTrp: 0.0 ± 0.0
0.357GlnTyr: 0.357 ± 0.193
0.0GlnXaa: 0.0 ± 0.0
Arg
5.712ArgAla: 5.712 ± 1.375
1.428ArgCys: 1.428 ± 1.231
3.213ArgAsp: 3.213 ± 1.254
3.213ArgGlu: 3.213 ± 1.099
5.712ArgPhe: 5.712 ± 1.28
2.142ArgGly: 2.142 ± 2.37
1.428ArgHis: 1.428 ± 0.5
2.856ArgIle: 2.856 ± 1.017
1.428ArgLys: 1.428 ± 0.683
4.284ArgLeu: 4.284 ± 1.266
1.071ArgMet: 1.071 ± 0.579
1.071ArgAsn: 1.071 ± 0.423
2.856ArgPro: 2.856 ± 2.762
1.071ArgGln: 1.071 ± 0.423
4.998ArgArg: 4.998 ± 1.798
2.499ArgSer: 2.499 ± 1.11
1.785ArgThr: 1.785 ± 0.797
2.856ArgVal: 2.856 ± 1.607
1.071ArgTrp: 1.071 ± 0.579
2.499ArgTyr: 2.499 ± 0.888
0.0ArgXaa: 0.0 ± 0.0
Ser
3.927SerAla: 3.927 ± 1.001
1.428SerCys: 1.428 ± 1.196
3.213SerAsp: 3.213 ± 1.023
4.641SerGlu: 4.641 ± 1.555
2.499SerPhe: 2.499 ± 0.775
3.57SerGly: 3.57 ± 1.255
1.785SerHis: 1.785 ± 0.541
3.213SerIle: 3.213 ± 1.121
6.069SerLys: 6.069 ± 1.557
4.641SerLeu: 4.641 ± 1.901
1.785SerMet: 1.785 ± 0.965
2.856SerAsn: 2.856 ± 1.297
2.856SerPro: 2.856 ± 1.088
2.499SerGln: 2.499 ± 1.351
3.927SerArg: 3.927 ± 1.096
6.069SerSer: 6.069 ± 1.06
3.927SerThr: 3.927 ± 1.044
4.998SerVal: 4.998 ± 2.662
0.357SerTrp: 0.357 ± 0.193
2.142SerTyr: 2.142 ± 0.645
0.0SerXaa: 0.0 ± 0.0
Thr
3.213ThrAla: 3.213 ± 1.835
1.071ThrCys: 1.071 ± 1.229
1.785ThrAsp: 1.785 ± 1.078
2.142ThrGlu: 2.142 ± 0.944
5.355ThrPhe: 5.355 ± 1.135
3.57ThrGly: 3.57 ± 1.289
1.071ThrHis: 1.071 ± 0.579
2.142ThrIle: 2.142 ± 1.158
2.499ThrLys: 2.499 ± 1.862
4.998ThrLeu: 4.998 ± 3.498
0.714ThrMet: 0.714 ± 0.386
2.499ThrAsn: 2.499 ± 0.873
2.142ThrPro: 2.142 ± 1.385
0.357ThrGln: 0.357 ± 0.193
1.071ThrArg: 1.071 ± 0.847
4.284ThrSer: 4.284 ± 0.704
0.714ThrThr: 0.714 ± 0.386
3.213ThrVal: 3.213 ± 0.573
0.714ThrTrp: 0.714 ± 1.133
1.785ThrTyr: 1.785 ± 0.537
0.0ThrXaa: 0.0 ± 0.0
Val
4.284ValAla: 4.284 ± 1.183
2.499ValCys: 2.499 ± 1.277
5.355ValAsp: 5.355 ± 1.582
4.998ValGlu: 4.998 ± 1.43
1.785ValPhe: 1.785 ± 0.579
5.355ValGly: 5.355 ± 2.443
1.428ValHis: 1.428 ± 0.797
3.927ValIle: 3.927 ± 1.126
5.712ValLys: 5.712 ± 2.017
6.783ValLeu: 6.783 ± 2.115
2.499ValMet: 2.499 ± 1.351
1.428ValAsn: 1.428 ± 1.196
2.856ValPro: 2.856 ± 1.175
2.142ValGln: 2.142 ± 0.431
3.213ValArg: 3.213 ± 0.785
4.641ValSer: 4.641 ± 1.246
5.355ValThr: 5.355 ± 1.925
4.284ValVal: 4.284 ± 2.892
0.357ValTrp: 0.357 ± 0.193
3.213ValTyr: 3.213 ± 1.003
0.0ValXaa: 0.0 ± 0.0
Trp
0.357TrpAla: 0.357 ± 1.002
0.357TrpCys: 0.357 ± 0.193
0.714TrpAsp: 0.714 ± 0.462
0.0TrpGlu: 0.0 ± 0.0
1.785TrpPhe: 1.785 ± 0.678
0.357TrpGly: 0.357 ± 0.708
0.357TrpHis: 0.357 ± 0.193
0.0TrpIle: 0.0 ± 0.0
0.357TrpLys: 0.357 ± 0.193
1.785TrpLeu: 1.785 ± 0.965
0.357TrpMet: 0.357 ± 0.193
1.428TrpAsn: 1.428 ± 0.923
0.714TrpPro: 0.714 ± 0.386
0.357TrpGln: 0.357 ± 0.193
0.0TrpArg: 0.0 ± 0.0
0.714TrpSer: 0.714 ± 0.462
0.0TrpThr: 0.0 ± 0.0
1.071TrpVal: 1.071 ± 0.579
0.0TrpTrp: 0.0 ± 0.0
0.357TrpTyr: 0.357 ± 0.193
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.856TyrAla: 2.856 ± 0.948
0.357TyrCys: 0.357 ± 1.069
1.428TyrAsp: 1.428 ± 0.599
1.428TyrGlu: 1.428 ± 0.772
1.071TyrPhe: 1.071 ± 0.579
2.142TyrGly: 2.142 ± 0.986
1.071TyrHis: 1.071 ± 0.53
1.785TyrIle: 1.785 ± 0.821
3.57TyrLys: 3.57 ± 0.673
1.785TyrLeu: 1.785 ± 0.965
2.499TyrMet: 2.499 ± 0.668
2.499TyrAsn: 2.499 ± 1.224
2.142TyrPro: 2.142 ± 0.797
0.714TyrGln: 0.714 ± 0.462
1.428TyrArg: 1.428 ± 0.599
2.856TyrSer: 2.856 ± 0.87
1.071TyrThr: 1.071 ± 1.369
2.499TyrVal: 2.499 ± 0.714
0.357TyrTrp: 0.357 ± 0.193
1.785TyrTyr: 1.785 ± 0.813
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2802 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski