Amino acid dipepetide frequency for Chagres virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.094AlaAla: 4.094 ± 2.056
1.535AlaCys: 1.535 ± 0.222
1.535AlaAsp: 1.535 ± 0.419
4.094AlaGlu: 4.094 ± 0.57
0.768AlaPhe: 0.768 ± 0.359
3.582AlaGly: 3.582 ± 1.479
2.047AlaHis: 2.047 ± 1.208
4.606AlaIle: 4.606 ± 0.82
2.559AlaLys: 2.559 ± 0.891
4.606AlaLeu: 4.606 ± 0.737
1.791AlaMet: 1.791 ± 0.578
1.791AlaAsn: 1.791 ± 0.636
1.791AlaPro: 1.791 ± 0.399
3.071AlaGln: 3.071 ± 0.872
3.071AlaArg: 3.071 ± 0.556
4.862AlaSer: 4.862 ± 1.739
2.559AlaThr: 2.559 ± 1.219
3.327AlaVal: 3.327 ± 0.712
0.0AlaTrp: 0.0 ± 0.0
1.791AlaTyr: 1.791 ± 0.912
0.0AlaXaa: 0.0 ± 0.0
Cys
0.512CysAla: 0.512 ± 0.432
0.512CysCys: 0.512 ± 0.409
0.768CysAsp: 0.768 ± 0.178
1.535CysGlu: 1.535 ± 0.419
1.535CysPhe: 1.535 ± 0.517
1.791CysGly: 1.791 ± 0.254
1.279CysHis: 1.279 ± 0.494
1.791CysIle: 1.791 ± 0.967
1.791CysLys: 1.791 ± 0.967
2.559CysLeu: 2.559 ± 0.699
0.512CysMet: 0.512 ± 0.14
0.768CysAsn: 0.768 ± 0.178
1.279CysPro: 1.279 ± 0.494
1.535CysGln: 1.535 ± 0.784
1.535CysArg: 1.535 ± 0.45
5.629CysSer: 5.629 ± 2.393
0.768CysThr: 0.768 ± 0.178
0.768CysVal: 0.768 ± 0.364
0.256CysTrp: 0.256 ± 0.42
1.024CysTyr: 1.024 ± 0.279
0.0CysXaa: 0.0 ± 0.0
Asp
2.559AspAla: 2.559 ± 0.684
1.535AspCys: 1.535 ± 0.617
4.35AspAsp: 4.35 ± 0.966
4.862AspGlu: 4.862 ± 0.635
2.559AspPhe: 2.559 ± 1.311
2.303AspGly: 2.303 ± 0.449
1.279AspHis: 1.279 ± 0.276
6.141AspIle: 6.141 ± 0.809
3.327AspLys: 3.327 ± 0.452
6.141AspLeu: 6.141 ± 0.805
1.024AspMet: 1.024 ± 0.312
2.815AspAsn: 2.815 ± 0.288
2.303AspPro: 2.303 ± 0.809
0.768AspGln: 0.768 ± 0.364
2.559AspArg: 2.559 ± 0.688
3.838AspSer: 3.838 ± 0.753
2.303AspThr: 2.303 ± 0.531
2.303AspVal: 2.303 ± 0.502
0.768AspTrp: 0.768 ± 0.178
0.768AspTyr: 0.768 ± 0.817
0.0AspXaa: 0.0 ± 0.0
Glu
5.118GluAla: 5.118 ± 1.35
1.535GluCys: 1.535 ± 0.222
4.606GluAsp: 4.606 ± 0.383
5.374GluGlu: 5.374 ± 1.332
4.606GluPhe: 4.606 ± 1.609
2.815GluGly: 2.815 ± 0.615
0.768GluHis: 0.768 ± 0.485
4.862GluIle: 4.862 ± 1.035
4.094GluLys: 4.094 ± 1.238
6.141GluLeu: 6.141 ± 1.216
1.279GluMet: 1.279 ± 0.461
2.815GluAsn: 2.815 ± 0.493
2.047GluPro: 2.047 ± 1.126
2.303GluGln: 2.303 ± 0.765
3.838GluArg: 3.838 ± 0.441
4.606GluSer: 4.606 ± 0.671
2.815GluThr: 2.815 ± 0.457
2.559GluVal: 2.559 ± 0.551
0.512GluTrp: 0.512 ± 0.14
1.279GluTyr: 1.279 ± 0.276
0.0GluXaa: 0.0 ± 0.0
Phe
2.559PheAla: 2.559 ± 1.089
1.024PheCys: 1.024 ± 0.604
3.838PheAsp: 3.838 ± 1.887
1.024PheGlu: 1.024 ± 0.309
1.791PhePhe: 1.791 ± 0.254
1.024PheGly: 1.024 ± 0.646
2.047PheHis: 2.047 ± 1.274
1.279PheIle: 1.279 ± 0.461
3.582PheLys: 3.582 ± 0.507
4.862PheLeu: 4.862 ± 0.758
1.791PheMet: 1.791 ± 0.254
2.559PheAsn: 2.559 ± 0.321
1.791PhePro: 1.791 ± 0.952
1.279PheGln: 1.279 ± 0.276
2.047PheArg: 2.047 ± 0.721
3.582PheSer: 3.582 ± 0.65
3.838PheThr: 3.838 ± 1.671
3.327PheVal: 3.327 ± 0.715
0.768PheTrp: 0.768 ± 0.334
1.279PheTyr: 1.279 ± 0.674
0.0PheXaa: 0.0 ± 0.0
Gly
3.327GlyAla: 3.327 ± 0.647
1.024GlyCys: 1.024 ± 0.279
2.303GlyAsp: 2.303 ± 0.458
2.815GlyGlu: 2.815 ± 0.494
4.606GlyPhe: 4.606 ± 0.644
3.582GlyGly: 3.582 ± 0.587
1.791GlyHis: 1.791 ± 0.776
4.606GlyIle: 4.606 ± 1.552
3.838GlyLys: 3.838 ± 0.713
2.559GlyLeu: 2.559 ± 0.984
2.047GlyMet: 2.047 ± 0.354
2.047GlyAsn: 2.047 ± 1.086
1.791GlyPro: 1.791 ± 0.662
1.791GlyGln: 1.791 ± 1.334
1.535GlyArg: 1.535 ± 0.543
7.165GlySer: 7.165 ± 0.856
2.559GlyThr: 2.559 ± 0.352
4.606GlyVal: 4.606 ± 0.82
0.512GlyTrp: 0.512 ± 0.474
1.535GlyTyr: 1.535 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
0.768HisAla: 0.768 ± 0.364
0.512HisCys: 0.512 ± 0.14
2.047HisAsp: 2.047 ± 0.619
1.024HisGlu: 1.024 ± 0.279
1.791HisPhe: 1.791 ± 0.254
2.303HisGly: 2.303 ± 0.768
0.256HisHis: 0.256 ± 0.162
2.303HisIle: 2.303 ± 0.531
2.047HisLys: 2.047 ± 0.935
0.512HisLeu: 0.512 ± 0.323
0.768HisMet: 0.768 ± 0.334
1.279HisAsn: 1.279 ± 0.706
1.279HisPro: 1.279 ± 0.615
0.512HisGln: 0.512 ± 0.323
1.279HisArg: 1.279 ± 0.344
3.582HisSer: 3.582 ± 0.962
0.256HisThr: 0.256 ± 0.162
1.535HisVal: 1.535 ± 0.419
0.0HisTrp: 0.0 ± 0.0
1.791HisTyr: 1.791 ± 0.628
0.0HisXaa: 0.0 ± 0.0
Ile
4.094IleAla: 4.094 ± 0.717
3.071IleCys: 3.071 ± 0.721
4.862IleAsp: 4.862 ± 0.635
5.885IleGlu: 5.885 ± 1.655
1.791IlePhe: 1.791 ± 0.578
5.374IleGly: 5.374 ± 1.157
0.768IleHis: 0.768 ± 0.334
4.35IleIle: 4.35 ± 0.872
2.815IleLys: 2.815 ± 0.493
5.374IleLeu: 5.374 ± 0.895
2.047IleMet: 2.047 ± 0.891
4.094IleAsn: 4.094 ± 1.179
2.303IlePro: 2.303 ± 0.836
1.791IleGln: 1.791 ± 0.952
5.629IleArg: 5.629 ± 0.239
6.909IleSer: 6.909 ± 1.813
2.815IleThr: 2.815 ± 0.902
4.606IleVal: 4.606 ± 0.364
0.512IleTrp: 0.512 ± 0.323
1.535IleTyr: 1.535 ± 0.97
0.0IleXaa: 0.0 ± 0.0
Lys
2.303LysAla: 2.303 ± 0.231
1.535LysCys: 1.535 ± 0.813
3.327LysAsp: 3.327 ± 0.684
3.582LysGlu: 3.582 ± 0.53
1.279LysPhe: 1.279 ± 0.344
3.838LysGly: 3.838 ± 0.489
1.024LysHis: 1.024 ± 0.298
4.606LysIle: 4.606 ± 0.99
5.118LysLys: 5.118 ± 0.853
4.606LysLeu: 4.606 ± 0.445
4.094LysMet: 4.094 ± 0.799
1.791LysAsn: 1.791 ± 0.399
3.582LysPro: 3.582 ± 0.175
2.303LysGln: 2.303 ± 0.502
2.303LysArg: 2.303 ± 0.372
5.118LysSer: 5.118 ± 0.872
4.35LysThr: 4.35 ± 0.501
5.374LysVal: 5.374 ± 1.695
1.279LysTrp: 1.279 ± 0.461
2.047LysTyr: 2.047 ± 0.22
0.0LysXaa: 0.0 ± 0.0
Leu
3.838LeuAla: 3.838 ± 0.637
1.791LeuCys: 1.791 ± 1.005
3.838LeuAsp: 3.838 ± 1.094
5.629LeuGlu: 5.629 ± 0.239
4.606LeuPhe: 4.606 ± 1.536
3.838LeuGly: 3.838 ± 0.023
1.791LeuHis: 1.791 ± 0.776
6.909LeuIle: 6.909 ± 1.467
6.141LeuLys: 6.141 ± 0.931
8.188LeuLeu: 8.188 ± 1.603
2.815LeuMet: 2.815 ± 0.302
4.862LeuAsn: 4.862 ± 1.217
3.071LeuPro: 3.071 ± 1.307
2.559LeuGln: 2.559 ± 0.291
6.653LeuArg: 6.653 ± 1.592
7.677LeuSer: 7.677 ± 0.882
5.374LeuThr: 5.374 ± 0.96
3.838LeuVal: 3.838 ± 0.496
0.768LeuTrp: 0.768 ± 0.364
2.303LeuTyr: 2.303 ± 0.809
0.0LeuXaa: 0.0 ± 0.0
Met
1.279MetAla: 1.279 ± 0.342
0.768MetCys: 0.768 ± 0.364
1.791MetAsp: 1.791 ± 0.889
2.303MetGlu: 2.303 ± 0.939
1.535MetPhe: 1.535 ± 0.222
1.535MetGly: 1.535 ± 0.222
1.279MetHis: 1.279 ± 0.615
2.559MetIle: 2.559 ± 0.497
1.024MetLys: 1.024 ± 0.419
2.047MetLeu: 2.047 ± 0.269
2.303MetMet: 2.303 ± 0.58
1.024MetAsn: 1.024 ± 0.554
0.256MetPro: 0.256 ± 0.244
1.279MetGln: 1.279 ± 0.308
1.024MetArg: 1.024 ± 0.298
4.35MetSer: 4.35 ± 1.262
1.791MetThr: 1.791 ± 0.458
2.303MetVal: 2.303 ± 0.533
0.0MetTrp: 0.0 ± 0.0
1.791MetTyr: 1.791 ± 0.628
0.0MetXaa: 0.0 ± 0.0
Asn
1.279AsnAla: 1.279 ± 0.276
1.535AsnCys: 1.535 ± 0.728
3.582AsnAsp: 3.582 ± 0.54
2.559AsnGlu: 2.559 ± 0.997
2.303AsnPhe: 2.303 ± 0.871
2.815AsnGly: 2.815 ± 0.902
1.791AsnHis: 1.791 ± 0.735
3.071AsnIle: 3.071 ± 1.086
2.303AsnLys: 2.303 ± 0.799
5.629AsnLeu: 5.629 ± 1.143
0.512AsnMet: 0.512 ± 0.323
1.279AsnAsn: 1.279 ± 0.276
3.327AsnPro: 3.327 ± 0.45
2.047AsnGln: 2.047 ± 0.519
1.535AsnArg: 1.535 ± 0.617
4.094AsnSer: 4.094 ± 0.717
1.279AsnThr: 1.279 ± 0.276
2.047AsnVal: 2.047 ± 0.963
1.279AsnTrp: 1.279 ± 0.308
2.047AsnTyr: 2.047 ± 0.916
0.0AsnXaa: 0.0 ± 0.0
Pro
1.279ProAla: 1.279 ± 0.615
0.512ProCys: 0.512 ± 0.474
1.791ProAsp: 1.791 ± 0.478
2.303ProGlu: 2.303 ± 1.454
2.559ProPhe: 2.559 ± 0.506
3.582ProGly: 3.582 ± 0.536
1.791ProHis: 1.791 ± 0.717
2.303ProIle: 2.303 ± 0.819
2.047ProLys: 2.047 ± 0.269
2.303ProLeu: 2.303 ± 0.747
1.279ProMet: 1.279 ± 0.791
1.279ProAsn: 1.279 ± 0.599
0.768ProPro: 0.768 ± 0.178
1.279ProGln: 1.279 ± 0.276
2.047ProArg: 2.047 ± 0.319
4.094ProSer: 4.094 ± 1.173
1.791ProThr: 1.791 ± 0.567
2.559ProVal: 2.559 ± 0.785
1.024ProTrp: 1.024 ± 0.554
1.791ProTyr: 1.791 ± 0.315
0.0ProXaa: 0.0 ± 0.0
Gln
1.791GlnAla: 1.791 ± 1.373
1.791GlnCys: 1.791 ± 1.334
1.791GlnAsp: 1.791 ± 0.938
1.791GlnGlu: 1.791 ± 0.399
1.535GlnPhe: 1.535 ± 0.939
2.303GlnGly: 2.303 ± 0.458
1.535GlnHis: 1.535 ± 0.97
3.582GlnIle: 3.582 ± 0.53
2.303GlnLys: 2.303 ± 0.768
2.303GlnLeu: 2.303 ± 1.223
0.512GlnMet: 0.512 ± 0.14
1.791GlnAsn: 1.791 ± 0.234
1.791GlnPro: 1.791 ± 0.52
1.279GlnGln: 1.279 ± 0.846
0.768GlnArg: 0.768 ± 0.418
2.559GlnSer: 2.559 ± 0.699
1.279GlnThr: 1.279 ± 0.461
2.559GlnVal: 2.559 ± 0.896
0.0GlnTrp: 0.0 ± 0.0
0.512GlnTyr: 0.512 ± 0.14
0.0GlnXaa: 0.0 ± 0.0
Arg
3.838ArgAla: 3.838 ± 0.951
2.047ArgCys: 2.047 ± 0.442
2.815ArgAsp: 2.815 ± 1.179
4.862ArgGlu: 4.862 ± 0.689
1.024ArgPhe: 1.024 ± 0.309
2.815ArgGly: 2.815 ± 1.137
0.0ArgHis: 0.0 ± 0.0
3.071ArgIle: 3.071 ± 0.365
2.815ArgLys: 2.815 ± 0.615
4.862ArgLeu: 4.862 ± 0.528
1.535ArgMet: 1.535 ± 0.543
2.559ArgAsn: 2.559 ± 0.683
2.303ArgPro: 2.303 ± 0.982
2.559ArgGln: 2.559 ± 0.522
1.791ArgArg: 1.791 ± 1.823
5.885ArgSer: 5.885 ± 1.766
1.279ArgThr: 1.279 ± 1.023
2.303ArgVal: 2.303 ± 0.798
0.768ArgTrp: 0.768 ± 0.178
1.791ArgTyr: 1.791 ± 0.234
0.0ArgXaa: 0.0 ± 0.0
Ser
5.885SerAla: 5.885 ± 1.578
3.838SerCys: 3.838 ± 2.539
4.862SerAsp: 4.862 ± 0.528
5.885SerGlu: 5.885 ± 1.209
4.094SerPhe: 4.094 ± 0.585
3.838SerGly: 3.838 ± 1.395
2.815SerHis: 2.815 ± 0.615
4.606SerIle: 4.606 ± 1.257
7.932SerLys: 7.932 ± 0.58
11.771SerLeu: 11.771 ± 0.954
3.071SerMet: 3.071 ± 0.405
4.35SerAsn: 4.35 ± 1.045
3.327SerPro: 3.327 ± 0.41
2.303SerGln: 2.303 ± 0.794
4.35SerArg: 4.35 ± 0.312
10.747SerSer: 10.747 ± 0.716
5.374SerThr: 5.374 ± 1.513
5.374SerVal: 5.374 ± 0.883
2.047SerTrp: 2.047 ± 0.363
2.047SerTyr: 2.047 ± 0.596
0.0SerXaa: 0.0 ± 0.0
Thr
2.047ThrAla: 2.047 ± 0.319
1.279ThrCys: 1.279 ± 0.342
3.071ThrAsp: 3.071 ± 0.894
4.35ThrGlu: 4.35 ± 0.694
1.791ThrPhe: 1.791 ± 0.458
3.838ThrGly: 3.838 ± 1.045
0.256ThrHis: 0.256 ± 0.162
4.094ThrIle: 4.094 ± 0.537
4.35ThrLys: 4.35 ± 1.078
5.374ThrLeu: 5.374 ± 1.121
0.768ThrMet: 0.768 ± 0.364
2.047ThrAsn: 2.047 ± 0.619
1.535ThrPro: 1.535 ± 0.355
1.279ThrGln: 1.279 ± 0.344
3.071ThrArg: 3.071 ± 0.708
4.094ThrSer: 4.094 ± 0.456
2.559ThrThr: 2.559 ± 0.651
3.071ThrVal: 3.071 ± 1.393
0.256ThrTrp: 0.256 ± 0.425
1.279ThrTyr: 1.279 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
4.606ValAla: 4.606 ± 1.101
1.535ValCys: 1.535 ± 0.728
1.791ValAsp: 1.791 ± 0.917
2.815ValGlu: 2.815 ± 0.691
3.838ValPhe: 3.838 ± 0.532
2.559ValGly: 2.559 ± 1.33
2.303ValHis: 2.303 ± 0.533
2.815ValIle: 2.815 ± 0.493
3.071ValLys: 3.071 ± 0.961
3.071ValLeu: 3.071 ± 0.464
2.047ValMet: 2.047 ± 0.67
3.582ValAsn: 3.582 ± 0.536
1.535ValPro: 1.535 ± 0.222
2.815ValGln: 2.815 ± 0.414
3.838ValArg: 3.838 ± 0.489
6.909ValSer: 6.909 ± 1.087
4.094ValThr: 4.094 ± 0.151
4.606ValVal: 4.606 ± 1.696
0.768ValTrp: 0.768 ± 0.178
1.535ValTyr: 1.535 ± 0.355
0.0ValXaa: 0.0 ± 0.0
Trp
0.768TrpAla: 0.768 ± 0.178
0.0TrpCys: 0.0 ± 0.0
0.256TrpAsp: 0.256 ± 0.162
0.0TrpGlu: 0.0 ± 0.0
0.256TrpPhe: 0.256 ± 0.162
1.024TrpGly: 1.024 ± 0.746
0.256TrpHis: 0.256 ± 0.162
0.512TrpIle: 0.512 ± 0.14
1.024TrpLys: 1.024 ± 0.279
1.535TrpLeu: 1.535 ± 0.617
0.512TrpMet: 0.512 ± 0.432
1.024TrpAsn: 1.024 ± 0.279
0.512TrpPro: 0.512 ± 0.409
0.0TrpGln: 0.0 ± 0.0
0.512TrpArg: 0.512 ± 0.389
1.024TrpSer: 1.024 ± 0.309
1.279TrpThr: 1.279 ± 0.308
1.279TrpVal: 1.279 ± 0.342
0.256TrpTrp: 0.256 ± 0.162
0.256TrpTyr: 0.256 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.535TyrAla: 1.535 ± 0.809
0.768TyrCys: 0.768 ± 0.733
1.535TyrAsp: 1.535 ± 0.45
1.279TyrGlu: 1.279 ± 0.344
1.024TyrPhe: 1.024 ± 0.309
1.279TyrGly: 1.279 ± 0.344
0.768TyrHis: 0.768 ± 0.178
2.559TyrIle: 2.559 ± 0.612
1.279TyrLys: 1.279 ± 0.276
2.047TyrLeu: 2.047 ± 0.442
1.279TyrMet: 1.279 ± 0.308
2.559TyrAsn: 2.559 ± 0.845
1.791TyrPro: 1.791 ± 1.067
1.024TyrGln: 1.024 ± 0.774
1.535TyrArg: 1.535 ± 0.617
1.791TyrSer: 1.791 ± 0.717
2.047TyrThr: 2.047 ± 0.363
1.791TyrVal: 1.791 ± 1.131
0.512TyrTrp: 0.512 ± 0.14
0.768TyrTyr: 0.768 ± 0.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3909 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski