Amino acid dipepetide frequency for Japanese macaque simian foamy virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.031AlaAla: 5.031 ± 2.047
1.118AlaCys: 1.118 ± 0.364
2.795AlaAsp: 2.795 ± 0.609
3.074AlaGlu: 3.074 ± 1.07
1.677AlaPhe: 1.677 ± 0.424
3.074AlaGly: 3.074 ± 1.451
2.236AlaHis: 2.236 ± 1.189
1.956AlaIle: 1.956 ± 0.687
2.515AlaLys: 2.515 ± 1.203
6.708AlaLeu: 6.708 ± 0.713
1.677AlaMet: 1.677 ± 0.606
3.633AlaAsn: 3.633 ± 0.405
4.751AlaPro: 4.751 ± 2.371
3.354AlaGln: 3.354 ± 0.807
3.633AlaArg: 3.633 ± 1.301
5.59AlaSer: 5.59 ± 2.685
3.633AlaThr: 3.633 ± 0.468
3.633AlaVal: 3.633 ± 0.981
0.0AlaTrp: 0.0 ± 0.0
2.515AlaTyr: 2.515 ± 0.688
0.0AlaXaa: 0.0 ± 0.0
Cys
0.279CysAla: 0.279 ± 0.234
0.279CysCys: 0.279 ± 0.234
0.838CysAsp: 0.838 ± 0.62
0.559CysGlu: 0.559 ± 0.693
1.118CysPhe: 1.118 ± 0.631
0.559CysGly: 0.559 ± 0.346
0.0CysHis: 0.0 ± 0.0
1.677CysIle: 1.677 ± 0.305
1.118CysLys: 1.118 ± 0.401
1.677CysLeu: 1.677 ± 0.384
1.118CysMet: 1.118 ± 0.717
1.397CysAsn: 1.397 ± 0.544
0.559CysPro: 0.559 ± 0.468
0.279CysGln: 0.279 ± 0.347
1.397CysArg: 1.397 ± 0.554
0.838CysSer: 0.838 ± 0.702
0.0CysThr: 0.0 ± 0.0
0.279CysVal: 0.279 ± 0.234
0.559CysTrp: 0.559 ± 0.315
1.118CysTyr: 1.118 ± 0.631
0.0CysXaa: 0.0 ± 0.0
Asp
1.397AspAla: 1.397 ± 0.619
0.838AspCys: 0.838 ± 0.702
2.236AspAsp: 2.236 ± 0.765
3.074AspGlu: 3.074 ± 0.913
1.118AspPhe: 1.118 ± 0.7
1.677AspGly: 1.677 ± 0.754
1.677AspHis: 1.677 ± 0.88
3.913AspIle: 3.913 ± 0.937
1.956AspLys: 1.956 ± 0.476
5.031AspLeu: 5.031 ± 1.578
0.838AspMet: 0.838 ± 0.632
2.236AspAsn: 2.236 ± 0.827
5.031AspPro: 5.031 ± 1.987
5.031AspGln: 5.031 ± 1.255
1.118AspArg: 1.118 ± 0.409
1.956AspSer: 1.956 ± 0.679
0.559AspThr: 0.559 ± 0.266
1.956AspVal: 1.956 ± 0.424
2.795AspTrp: 2.795 ± 1.035
2.795AspTyr: 2.795 ± 0.648
0.0AspXaa: 0.0 ± 0.0
Glu
2.795GluAla: 2.795 ± 0.663
1.397GluCys: 1.397 ± 0.892
4.751GluAsp: 4.751 ± 2.62
4.192GluGlu: 4.192 ± 1.023
1.956GluPhe: 1.956 ± 0.666
5.31GluGly: 5.31 ± 1.575
0.838GluHis: 0.838 ± 0.311
5.59GluIle: 5.59 ± 0.945
3.074GluLys: 3.074 ± 1.755
5.031GluLeu: 5.031 ± 1.707
1.677GluMet: 1.677 ± 0.802
2.795GluAsn: 2.795 ± 0.743
1.956GluPro: 1.956 ± 0.876
2.236GluGln: 2.236 ± 0.538
3.074GluArg: 3.074 ± 0.88
1.677GluSer: 1.677 ± 0.66
2.236GluThr: 2.236 ± 1.044
3.913GluVal: 3.913 ± 0.83
0.279GluTrp: 0.279 ± 0.234
1.677GluTyr: 1.677 ± 0.609
0.0GluXaa: 0.0 ± 0.0
Phe
2.236PheAla: 2.236 ± 0.753
0.279PheCys: 0.279 ± 0.234
1.397PheAsp: 1.397 ± 0.554
0.838PheGlu: 0.838 ± 0.62
0.279PhePhe: 0.279 ± 0.209
2.795PheGly: 2.795 ± 0.949
0.559PheHis: 0.559 ± 0.315
1.956PheIle: 1.956 ± 0.306
1.118PheLys: 1.118 ± 0.359
3.354PheLeu: 3.354 ± 0.918
0.0PheMet: 0.0 ± 0.0
0.559PheAsn: 0.559 ± 0.266
1.677PhePro: 1.677 ± 0.687
0.838PheGln: 0.838 ± 0.266
0.559PheArg: 0.559 ± 0.346
1.397PheSer: 1.397 ± 0.807
1.677PheThr: 1.677 ± 0.663
0.838PheVal: 0.838 ± 0.448
0.838PheTrp: 0.838 ± 0.311
1.397PheTyr: 1.397 ± 0.792
0.0PheXaa: 0.0 ± 0.0
Gly
2.515GlyAla: 2.515 ± 0.787
0.559GlyCys: 0.559 ± 0.346
3.354GlyAsp: 3.354 ± 1.183
1.397GlyGlu: 1.397 ± 0.996
2.236GlyPhe: 2.236 ± 0.951
3.913GlyGly: 3.913 ± 3.059
1.956GlyHis: 1.956 ± 0.596
3.354GlyIle: 3.354 ± 0.92
2.236GlyLys: 2.236 ± 1.189
3.633GlyLeu: 3.633 ± 0.705
1.677GlyMet: 1.677 ± 0.375
4.192GlyAsn: 4.192 ± 1.966
4.751GlyPro: 4.751 ± 1.947
4.751GlyGln: 4.751 ± 2.602
4.751GlyArg: 4.751 ± 2.889
4.472GlySer: 4.472 ± 0.934
3.354GlyThr: 3.354 ± 0.953
3.354GlyVal: 3.354 ± 1.372
0.559GlyTrp: 0.559 ± 0.589
3.074GlyTyr: 3.074 ± 1.182
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.954
0.838HisCys: 0.838 ± 0.24
0.838HisAsp: 0.838 ± 0.266
1.118HisGlu: 1.118 ± 0.401
0.559HisPhe: 0.559 ± 0.502
0.838HisGly: 0.838 ± 0.371
0.559HisHis: 0.559 ± 0.502
0.838HisIle: 0.838 ± 0.481
0.838HisLys: 0.838 ± 0.266
3.913HisLeu: 3.913 ± 1.124
0.0HisMet: 0.0 ± 0.0
0.838HisAsn: 0.838 ± 0.626
3.074HisPro: 3.074 ± 0.379
2.236HisGln: 2.236 ± 0.845
1.677HisArg: 1.677 ± 0.742
1.677HisSer: 1.677 ± 0.514
1.397HisThr: 1.397 ± 0.558
2.515HisVal: 2.515 ± 0.855
0.838HisTrp: 0.838 ± 0.449
0.559HisTyr: 0.559 ± 0.502
0.0HisXaa: 0.0 ± 0.0
Ile
4.472IleAla: 4.472 ± 1.254
1.397IleCys: 1.397 ± 0.721
2.236IleAsp: 2.236 ± 0.588
1.397IleGlu: 1.397 ± 0.785
0.559IlePhe: 0.559 ± 0.346
1.956IleGly: 1.956 ± 0.306
1.956IleHis: 1.956 ± 0.347
4.472IleIle: 4.472 ± 0.403
4.472IleLys: 4.472 ± 1.124
5.869IleLeu: 5.869 ± 1.122
1.118IleMet: 1.118 ± 0.443
3.913IleAsn: 3.913 ± 1.326
5.31IlePro: 5.31 ± 1.051
5.31IleGln: 5.31 ± 0.977
3.913IleArg: 3.913 ± 0.8
3.633IleSer: 3.633 ± 0.963
2.795IleThr: 2.795 ± 0.95
3.354IleVal: 3.354 ± 1.247
0.559IleTrp: 0.559 ± 0.323
1.118IleTyr: 1.118 ± 0.666
0.0IleXaa: 0.0 ± 0.0
Lys
3.633LysAla: 3.633 ± 1.641
1.677LysCys: 1.677 ± 0.946
3.913LysAsp: 3.913 ± 1.036
3.074LysGlu: 3.074 ± 0.742
1.118LysPhe: 1.118 ± 0.595
1.397LysGly: 1.397 ± 0.807
2.236LysHis: 2.236 ± 0.578
2.795LysIle: 2.795 ± 1.304
2.795LysLys: 2.795 ± 0.809
4.751LysLeu: 4.751 ± 1.27
1.956LysMet: 1.956 ± 1.411
1.956LysAsn: 1.956 ± 1.352
4.472LysPro: 4.472 ± 1.557
3.354LysGln: 3.354 ± 1.362
2.515LysArg: 2.515 ± 0.647
3.633LysSer: 3.633 ± 0.993
2.795LysThr: 2.795 ± 1.072
3.913LysVal: 3.913 ± 1.41
1.397LysTrp: 1.397 ± 0.524
2.236LysTyr: 2.236 ± 1.057
0.0LysXaa: 0.0 ± 0.0
Leu
6.428LeuAla: 6.428 ± 0.859
0.838LeuCys: 0.838 ± 0.62
4.192LeuAsp: 4.192 ± 0.883
5.869LeuGlu: 5.869 ± 1.177
1.956LeuPhe: 1.956 ± 0.636
6.428LeuGly: 6.428 ± 1.881
3.074LeuHis: 3.074 ± 0.742
4.751LeuIle: 4.751 ± 0.592
7.826LeuLys: 7.826 ± 2.71
11.179LeuLeu: 11.179 ± 0.9
0.838LeuMet: 0.838 ± 0.448
5.31LeuAsn: 5.31 ± 0.945
6.428LeuPro: 6.428 ± 0.985
6.149LeuGln: 6.149 ± 1.386
6.708LeuArg: 6.708 ± 1.943
5.31LeuSer: 5.31 ± 1.51
6.428LeuThr: 6.428 ± 1.254
4.192LeuVal: 4.192 ± 0.994
1.397LeuTrp: 1.397 ± 0.395
3.074LeuTyr: 3.074 ± 0.971
0.0LeuXaa: 0.0 ± 0.0
Met
2.236MetAla: 2.236 ± 0.889
0.279MetCys: 0.279 ± 0.315
1.397MetAsp: 1.397 ± 0.394
1.677MetGlu: 1.677 ± 0.763
0.559MetPhe: 0.559 ± 0.442
1.118MetGly: 1.118 ± 0.519
0.279MetHis: 0.279 ± 0.209
1.118MetIle: 1.118 ± 0.621
1.397MetLys: 1.397 ± 0.545
1.397MetLeu: 1.397 ± 0.404
0.279MetMet: 0.279 ± 0.295
0.838MetAsn: 0.838 ± 0.448
0.559MetPro: 0.559 ± 0.346
1.397MetGln: 1.397 ± 1.473
1.118MetArg: 1.118 ± 1.004
1.956MetSer: 1.956 ± 0.774
2.236MetThr: 2.236 ± 0.627
0.838MetVal: 0.838 ± 0.46
0.279MetTrp: 0.279 ± 0.295
0.559MetTyr: 0.559 ± 0.327
0.0MetXaa: 0.0 ± 0.0
Asn
2.795AsnAla: 2.795 ± 1.155
0.559AsnCys: 0.559 ± 0.315
0.838AsnAsp: 0.838 ± 0.24
2.795AsnGlu: 2.795 ± 1.194
1.118AsnPhe: 1.118 ± 0.519
3.074AsnGly: 3.074 ± 0.627
0.559AsnHis: 0.559 ± 0.323
2.236AsnIle: 2.236 ± 0.402
2.795AsnLys: 2.795 ± 0.809
5.59AsnLeu: 5.59 ± 1.28
1.397AsnMet: 1.397 ± 0.269
2.795AsnAsn: 2.795 ± 0.512
3.633AsnPro: 3.633 ± 1.182
3.074AsnGln: 3.074 ± 0.496
2.515AsnArg: 2.515 ± 1.142
3.354AsnSer: 3.354 ± 1.128
2.795AsnThr: 2.795 ± 0.792
1.956AsnVal: 1.956 ± 0.853
0.279AsnTrp: 0.279 ± 0.234
1.397AsnTyr: 1.397 ± 0.404
0.0AsnXaa: 0.0 ± 0.0
Pro
6.708ProAla: 6.708 ± 3.427
0.559ProCys: 0.559 ± 0.346
2.236ProAsp: 2.236 ± 0.213
4.472ProGlu: 4.472 ± 1.365
1.397ProPhe: 1.397 ± 0.623
4.751ProGly: 4.751 ± 1.974
2.515ProHis: 2.515 ± 0.658
3.913ProIle: 3.913 ± 0.768
4.472ProLys: 4.472 ± 1.17
8.105ProLeu: 8.105 ± 1.089
1.956ProMet: 1.956 ± 0.451
1.397ProAsn: 1.397 ± 0.576
7.826ProPro: 7.826 ± 2.968
2.515ProGln: 2.515 ± 0.509
3.354ProArg: 3.354 ± 1.142
7.826ProSer: 7.826 ± 2.29
2.795ProThr: 2.795 ± 0.511
4.192ProVal: 4.192 ± 0.731
1.397ProTrp: 1.397 ± 0.996
2.795ProTyr: 2.795 ± 0.988
0.0ProXaa: 0.0 ± 0.0
Gln
2.795GlnAla: 2.795 ± 0.676
1.118GlnCys: 1.118 ± 0.359
2.236GlnAsp: 2.236 ± 1.008
4.751GlnGlu: 4.751 ± 1.077
1.397GlnPhe: 1.397 ± 0.394
4.751GlnGly: 4.751 ± 1.373
2.236GlnHis: 2.236 ± 0.757
2.236GlnIle: 2.236 ± 0.785
2.795GlnLys: 2.795 ± 0.88
5.869GlnLeu: 5.869 ± 1.91
1.118GlnMet: 1.118 ± 0.59
2.515GlnAsn: 2.515 ± 0.564
5.59GlnPro: 5.59 ± 2.117
6.428GlnGln: 6.428 ± 1.673
2.236GlnArg: 2.236 ± 1.303
4.192GlnSer: 4.192 ± 2.301
1.956GlnThr: 1.956 ± 0.636
3.354GlnVal: 3.354 ± 0.888
0.838GlnTrp: 0.838 ± 0.448
3.074GlnTyr: 3.074 ± 0.379
0.0GlnXaa: 0.0 ± 0.0
Arg
1.956ArgAla: 1.956 ± 0.961
1.118ArgCys: 1.118 ± 0.48
2.515ArgAsp: 2.515 ± 0.714
3.633ArgGlu: 3.633 ± 0.412
0.559ArgPhe: 0.559 ± 0.323
4.751ArgGly: 4.751 ± 3.363
0.838ArgHis: 0.838 ± 0.371
3.074ArgIle: 3.074 ± 0.343
1.956ArgLys: 1.956 ± 0.931
5.31ArgLeu: 5.31 ± 1.079
1.677ArgMet: 1.677 ± 0.518
1.677ArgAsn: 1.677 ± 0.375
4.192ArgPro: 4.192 ± 1.15
2.236ArgGln: 2.236 ± 0.648
3.354ArgArg: 3.354 ± 0.828
3.354ArgSer: 3.354 ± 1.116
2.515ArgThr: 2.515 ± 0.907
2.515ArgVal: 2.515 ± 0.603
1.118ArgTrp: 1.118 ± 0.533
1.397ArgTyr: 1.397 ± 0.493
0.0ArgXaa: 0.0 ± 0.0
Ser
5.31SerAla: 5.31 ± 1.436
1.397SerCys: 1.397 ± 0.702
5.31SerAsp: 5.31 ± 1.087
3.633SerGlu: 3.633 ± 0.826
2.515SerPhe: 2.515 ± 0.515
5.59SerGly: 5.59 ± 1.774
1.397SerHis: 1.397 ± 0.763
4.751SerIle: 4.751 ± 1.285
3.354SerLys: 3.354 ± 1.162
4.751SerLeu: 4.751 ± 1.277
0.838SerMet: 0.838 ± 0.395
1.956SerAsn: 1.956 ± 0.905
5.031SerPro: 5.031 ± 0.844
3.074SerGln: 3.074 ± 1.433
2.795SerArg: 2.795 ± 0.729
7.546SerSer: 7.546 ± 1.732
5.031SerThr: 5.031 ± 0.985
2.515SerVal: 2.515 ± 0.517
1.956SerTrp: 1.956 ± 0.5
2.236SerTyr: 2.236 ± 0.746
0.0SerXaa: 0.0 ± 0.0
Thr
4.192ThrAla: 4.192 ± 1.213
0.838ThrCys: 0.838 ± 0.467
1.397ThrAsp: 1.397 ± 0.296
2.795ThrGlu: 2.795 ± 0.809
1.677ThrPhe: 1.677 ± 0.817
2.515ThrGly: 2.515 ± 0.348
1.118ThrHis: 1.118 ± 0.684
2.236ThrIle: 2.236 ± 0.845
3.633ThrLys: 3.633 ± 1.368
4.472ThrLeu: 4.472 ± 0.547
1.956ThrMet: 1.956 ± 0.666
1.677ThrAsn: 1.677 ± 0.521
4.192ThrPro: 4.192 ± 1.619
2.515ThrGln: 2.515 ± 0.815
1.956ThrArg: 1.956 ± 0.688
5.59ThrSer: 5.59 ± 0.835
2.795ThrThr: 2.795 ± 1.019
4.192ThrVal: 4.192 ± 0.313
1.677ThrTrp: 1.677 ± 0.722
1.956ThrTyr: 1.956 ± 1.048
0.0ThrXaa: 0.0 ± 0.0
Val
4.472ValAla: 4.472 ± 1.56
0.0ValCys: 0.0 ± 0.0
2.515ValAsp: 2.515 ± 0.682
3.074ValGlu: 3.074 ± 1.204
1.397ValPhe: 1.397 ± 0.475
2.515ValGly: 2.515 ± 0.515
1.118ValHis: 1.118 ± 0.288
5.031ValIle: 5.031 ± 1.412
3.074ValLys: 3.074 ± 0.918
6.708ValLeu: 6.708 ± 1.285
0.559ValMet: 0.559 ± 0.442
2.515ValAsn: 2.515 ± 0.389
3.354ValPro: 3.354 ± 0.515
2.795ValGln: 2.795 ± 0.916
1.118ValArg: 1.118 ± 0.222
3.074ValSer: 3.074 ± 0.766
5.59ValThr: 5.59 ± 1.137
4.472ValVal: 4.472 ± 1.215
0.838ValTrp: 0.838 ± 0.266
2.795ValTyr: 2.795 ± 0.443
0.0ValXaa: 0.0 ± 0.0
Trp
0.838TrpAla: 0.838 ± 0.377
0.279TrpCys: 0.279 ± 0.347
0.559TrpAsp: 0.559 ± 0.26
2.236TrpGlu: 2.236 ± 1.036
0.559TrpPhe: 0.559 ± 0.442
0.559TrpGly: 0.559 ± 0.589
0.838TrpHis: 0.838 ± 0.467
1.118TrpIle: 1.118 ± 0.392
1.956TrpLys: 1.956 ± 0.586
1.956TrpLeu: 1.956 ± 0.613
0.279TrpMet: 0.279 ± 0.209
1.118TrpAsn: 1.118 ± 0.684
0.838TrpPro: 0.838 ± 0.266
1.397TrpGln: 1.397 ± 0.53
1.397TrpArg: 1.397 ± 0.219
0.559TrpSer: 0.559 ± 0.417
0.559TrpThr: 0.559 ± 0.26
1.118TrpVal: 1.118 ± 1.004
0.559TrpTrp: 0.559 ± 0.266
0.559TrpTyr: 0.559 ± 0.26
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.397TyrAla: 1.397 ± 0.833
0.279TyrCys: 0.279 ± 0.209
1.956TyrAsp: 1.956 ± 0.931
2.795TyrGlu: 2.795 ± 1.071
0.838TyrPhe: 0.838 ± 0.444
2.515TyrGly: 2.515 ± 1.615
0.279TyrHis: 0.279 ± 0.209
2.795TyrIle: 2.795 ± 0.884
2.236TyrLys: 2.236 ± 0.587
3.074TyrLeu: 3.074 ± 1.776
0.279TyrMet: 0.279 ± 0.234
2.236TyrAsn: 2.236 ± 1.072
1.956TyrPro: 1.956 ± 0.554
2.515TyrGln: 2.515 ± 0.657
0.559TyrArg: 0.559 ± 0.35
3.354TyrSer: 3.354 ± 0.596
2.236TyrThr: 2.236 ± 0.82
3.913TyrVal: 3.913 ± 1.08
1.118TyrTrp: 1.118 ± 0.623
2.795TyrTyr: 2.795 ± 1.161
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3579 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski