Amino acid dipepetide frequency for Gouleako virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.122AlaAla: 2.122 ± 2.653
1.212AlaCys: 1.212 ± 0.909
2.122AlaAsp: 2.122 ± 0.608
3.031AlaGlu: 3.031 ± 1.274
2.425AlaPhe: 2.425 ± 1.585
2.425AlaGly: 2.425 ± 2.539
0.303AlaHis: 0.303 ± 0.173
3.941AlaIle: 3.941 ± 0.369
4.244AlaLys: 4.244 ± 1.139
3.941AlaLeu: 3.941 ± 1.606
1.819AlaMet: 1.819 ± 0.684
0.909AlaAsn: 0.909 ± 0.249
2.728AlaPro: 2.728 ± 1.66
1.212AlaGln: 1.212 ± 0.791
2.425AlaArg: 2.425 ± 1.015
6.062AlaSer: 6.062 ± 1.161
2.122AlaThr: 2.122 ± 0.916
2.728AlaVal: 2.728 ± 2.407
0.606AlaTrp: 0.606 ± 0.217
0.303AlaTyr: 0.303 ± 0.976
0.0AlaXaa: 0.0 ± 0.0
Cys
0.303CysAla: 0.303 ± 0.173
0.606CysCys: 0.606 ± 0.346
1.212CysAsp: 1.212 ± 0.795
0.909CysGlu: 0.909 ± 0.249
0.606CysPhe: 0.606 ± 0.217
1.212CysGly: 1.212 ± 0.795
1.212CysHis: 1.212 ± 1.231
2.425CysIle: 2.425 ± 0.92
2.122CysLys: 2.122 ± 1.698
2.728CysLeu: 2.728 ± 1.188
0.606CysMet: 0.606 ± 0.978
0.909CysAsn: 0.909 ± 0.249
1.819CysPro: 1.819 ± 1.396
0.303CysGln: 0.303 ± 0.173
0.909CysArg: 0.909 ± 0.249
4.244CysSer: 4.244 ± 2.984
0.909CysThr: 0.909 ± 0.909
1.516CysVal: 1.516 ± 1.515
0.606CysTrp: 0.606 ± 0.217
2.122CysTyr: 2.122 ± 2.12
0.0CysXaa: 0.0 ± 0.0
Asp
1.819AspAla: 1.819 ± 0.841
2.122AspCys: 2.122 ± 1.291
4.547AspAsp: 4.547 ± 0.699
3.941AspGlu: 3.941 ± 1.533
2.728AspPhe: 2.728 ± 1.188
4.85AspGly: 4.85 ± 0.285
1.212AspHis: 1.212 ± 0.371
4.547AspIle: 4.547 ± 0.987
2.122AspLys: 2.122 ± 0.917
4.85AspLeu: 4.85 ± 1.733
0.909AspMet: 0.909 ± 0.249
2.122AspAsn: 2.122 ± 0.85
1.516AspPro: 1.516 ± 0.637
0.909AspGln: 0.909 ± 0.498
2.122AspArg: 2.122 ± 0.48
3.334AspSer: 3.334 ± 0.919
2.122AspThr: 2.122 ± 0.608
4.244AspVal: 4.244 ± 1.163
0.606AspTrp: 0.606 ± 0.217
1.516AspTyr: 1.516 ± 0.705
0.0AspXaa: 0.0 ± 0.0
Glu
3.941GluAla: 3.941 ± 1.026
3.031GluCys: 3.031 ± 0.868
3.941GluAsp: 3.941 ± 1.258
5.759GluGlu: 5.759 ± 2.216
4.244GluPhe: 4.244 ± 1.7
2.425GluGly: 2.425 ± 1.018
0.909GluHis: 0.909 ± 0.821
2.425GluIle: 2.425 ± 0.741
4.85GluLys: 4.85 ± 1.256
5.759GluLeu: 5.759 ± 1.931
2.122GluMet: 2.122 ± 1.214
2.122GluAsn: 2.122 ± 0.581
2.122GluPro: 2.122 ± 0.581
1.516GluGln: 1.516 ± 0.522
2.425GluArg: 2.425 ± 1.018
5.456GluSer: 5.456 ± 1.212
4.547GluThr: 4.547 ± 0.943
5.153GluVal: 5.153 ± 1.504
2.425GluTrp: 2.425 ± 0.741
1.819GluTyr: 1.819 ± 0.841
0.0GluXaa: 0.0 ± 0.0
Phe
2.122PheAla: 2.122 ± 0.581
1.212PheCys: 1.212 ± 1.212
3.334PheAsp: 3.334 ± 1.366
2.425PheGlu: 2.425 ± 0.741
2.425PhePhe: 2.425 ± 1.018
3.334PheGly: 3.334 ± 0.152
1.516PheHis: 1.516 ± 0.434
1.819PheIle: 1.819 ± 1.039
3.031PheLys: 3.031 ± 1.044
3.334PheLeu: 3.334 ± 0.152
1.212PheMet: 1.212 ± 0.795
1.819PheAsn: 1.819 ± 0.684
2.122PhePro: 2.122 ± 1.212
0.909PheGln: 0.909 ± 0.519
3.637PheArg: 3.637 ± 1.367
3.334PheSer: 3.334 ± 0.919
1.819PheThr: 1.819 ± 0.585
2.122PheVal: 2.122 ± 0.917
0.303PheTrp: 0.303 ± 0.303
1.516PheTyr: 1.516 ± 0.522
0.0PheXaa: 0.0 ± 0.0
Gly
2.728GlyAla: 2.728 ± 0.387
1.516GlyCys: 1.516 ± 1.095
2.425GlyAsp: 2.425 ± 1.335
3.031GlyGlu: 3.031 ± 0.575
3.334GlyPhe: 3.334 ± 0.526
3.941GlyGly: 3.941 ± 0.081
1.212GlyHis: 1.212 ± 0.434
4.244GlyIle: 4.244 ± 0.97
3.031GlyLys: 3.031 ± 1.274
7.275GlyLeu: 7.275 ± 2.214
0.909GlyMet: 0.909 ± 0.519
2.122GlyAsn: 2.122 ± 0.608
1.819GlyPro: 1.819 ± 0.995
1.819GlyGln: 1.819 ± 0.65
1.516GlyArg: 1.516 ± 1.059
4.85GlySer: 4.85 ± 1.45
3.637GlyThr: 3.637 ± 2.48
5.456GlyVal: 5.456 ± 0.505
1.212GlyTrp: 1.212 ± 0.434
1.819GlyTyr: 1.819 ± 0.65
0.0GlyXaa: 0.0 ± 0.0
His
1.212HisAla: 1.212 ± 1.812
0.606HisCys: 0.606 ± 0.606
0.909HisAsp: 0.909 ± 0.519
1.212HisGlu: 1.212 ± 0.791
1.516HisPhe: 1.516 ± 1.095
1.516HisGly: 1.516 ± 0.522
0.606HisHis: 0.606 ± 0.346
3.031HisIle: 3.031 ± 0.868
0.606HisLys: 0.606 ± 0.978
1.516HisLeu: 1.516 ± 0.434
0.606HisMet: 0.606 ± 0.346
0.303HisAsn: 0.303 ± 0.303
0.606HisPro: 0.606 ± 0.346
1.516HisGln: 1.516 ± 0.798
1.212HisArg: 1.212 ± 0.692
0.303HisSer: 0.303 ± 0.303
0.606HisThr: 0.606 ± 0.217
0.606HisVal: 0.606 ± 0.217
0.606HisTrp: 0.606 ± 0.217
0.909HisTyr: 0.909 ± 0.249
0.0HisXaa: 0.0 ± 0.0
Ile
3.637IleAla: 3.637 ± 2.183
1.516IleCys: 1.516 ± 1.095
2.122IleAsp: 2.122 ± 0.85
4.85IleGlu: 4.85 ± 1.725
3.637IlePhe: 3.637 ± 0.998
3.334IleGly: 3.334 ± 1.358
0.909IleHis: 0.909 ± 0.821
2.728IleIle: 2.728 ± 0.85
3.334IleLys: 3.334 ± 1.204
4.547IleLeu: 4.547 ± 1.247
2.122IleMet: 2.122 ± 0.608
2.728IleAsn: 2.728 ± 2.431
3.637IlePro: 3.637 ± 0.223
3.334IleGln: 3.334 ± 2.292
4.547IleArg: 4.547 ± 1.493
9.094IleSer: 9.094 ± 2.141
3.941IleThr: 3.941 ± 1.026
5.759IleVal: 5.759 ± 1.714
0.606IleTrp: 0.606 ± 0.217
1.212IleTyr: 1.212 ± 0.371
0.0IleXaa: 0.0 ± 0.0
Lys
3.334LysAla: 3.334 ± 1.207
1.516LysCys: 1.516 ± 0.705
3.031LysAsp: 3.031 ± 0.829
2.728LysGlu: 2.728 ± 0.714
2.728LysPhe: 2.728 ± 1.188
2.122LysGly: 2.122 ± 0.608
0.303LysHis: 0.303 ± 0.303
3.941LysIle: 3.941 ± 0.369
7.275LysLys: 7.275 ± 1.305
6.972LysLeu: 6.972 ± 1.822
2.425LysMet: 2.425 ± 1.018
2.425LysAsn: 2.425 ± 0.39
1.819LysPro: 1.819 ± 0.585
1.516LysGln: 1.516 ± 0.522
3.637LysArg: 3.637 ± 1.104
5.456LysSer: 5.456 ± 1.104
5.153LysThr: 5.153 ± 0.643
6.366LysVal: 6.366 ± 2.751
0.909LysTrp: 0.909 ± 0.249
3.941LysTyr: 3.941 ± 0.519
0.0LysXaa: 0.0 ± 0.0
Leu
4.244LeuAla: 4.244 ± 3.044
1.516LeuCys: 1.516 ± 0.522
3.941LeuAsp: 3.941 ± 1.258
7.275LeuGlu: 7.275 ± 0.11
3.031LeuPhe: 3.031 ± 1.323
5.153LeuGly: 5.153 ± 0.982
3.031LeuHis: 3.031 ± 0.868
7.275LeuIle: 7.275 ± 1.651
6.366LeuLys: 6.366 ± 1.728
6.669LeuLeu: 6.669 ± 0.634
2.728LeuMet: 2.728 ± 0.387
2.425LeuAsn: 2.425 ± 0.867
3.334LeuPro: 3.334 ± 0.975
4.244LeuGln: 4.244 ± 2.032
4.85LeuArg: 4.85 ± 1.733
12.125LeuSer: 12.125 ± 0.949
5.153LeuThr: 5.153 ± 2.943
4.244LeuVal: 4.244 ± 0.135
2.425LeuTrp: 2.425 ± 1.018
2.425LeuTyr: 2.425 ± 1.015
0.0LeuXaa: 0.0 ± 0.0
Met
2.122MetAla: 2.122 ± 0.48
0.909MetCys: 0.909 ± 0.844
1.516MetAsp: 1.516 ± 0.522
2.425MetGlu: 2.425 ± 0.628
0.303MetPhe: 0.303 ± 0.303
1.819MetGly: 1.819 ± 0.995
1.212MetHis: 1.212 ± 0.434
1.212MetIle: 1.212 ± 0.909
2.122MetLys: 2.122 ± 0.581
3.031MetLeu: 3.031 ± 1.261
2.122MetMet: 2.122 ± 0.85
0.606MetAsn: 0.606 ± 0.217
0.606MetPro: 0.606 ± 0.217
0.303MetGln: 0.303 ± 0.173
3.031MetArg: 3.031 ± 1.374
2.728MetSer: 2.728 ± 0.387
1.212MetThr: 1.212 ± 0.434
1.212MetVal: 1.212 ± 0.692
0.0MetTrp: 0.0 ± 0.0
0.606MetTyr: 0.606 ± 0.346
0.0MetXaa: 0.0 ± 0.0
Asn
2.122AsnAla: 2.122 ± 0.48
0.606AsnCys: 0.606 ± 0.217
1.819AsnAsp: 1.819 ± 0.607
2.122AsnGlu: 2.122 ± 0.85
2.425AsnPhe: 2.425 ± 1.018
1.819AsnGly: 1.819 ± 0.499
0.0AsnHis: 0.0 ± 0.0
2.122AsnIle: 2.122 ± 0.639
3.334AsnLys: 3.334 ± 0.919
3.941AsnLeu: 3.941 ± 1.255
1.212AsnMet: 1.212 ± 0.371
1.212AsnAsn: 1.212 ± 0.371
1.212AsnPro: 1.212 ± 1.77
1.819AsnGln: 1.819 ± 0.684
0.303AsnArg: 0.303 ± 0.173
4.244AsnSer: 4.244 ± 0.135
2.425AsnThr: 2.425 ± 0.674
1.819AsnVal: 1.819 ± 0.65
1.212AsnTrp: 1.212 ± 0.371
0.606AsnTyr: 0.606 ± 0.885
0.0AsnXaa: 0.0 ± 0.0
Pro
1.212ProAla: 1.212 ± 0.909
0.606ProCys: 0.606 ± 0.606
2.728ProAsp: 2.728 ± 0.889
3.031ProGlu: 3.031 ± 1.274
1.516ProPhe: 1.516 ± 0.866
2.122ProGly: 2.122 ± 0.608
0.606ProHis: 0.606 ± 0.217
3.637ProIle: 3.637 ± 2.001
2.122ProLys: 2.122 ± 0.608
3.031ProLeu: 3.031 ± 1.044
1.212ProMet: 1.212 ± 0.434
2.425ProAsn: 2.425 ± 0.55
0.303ProPro: 0.303 ± 0.976
2.728ProGln: 2.728 ± 1.188
2.122ProArg: 2.122 ± 1.567
2.728ProSer: 2.728 ± 0.889
3.334ProThr: 3.334 ± 0.834
1.516ProVal: 1.516 ± 0.637
0.606ProTrp: 0.606 ± 0.346
1.516ProTyr: 1.516 ± 1.703
0.0ProXaa: 0.0 ± 0.0
Gln
2.122GlnAla: 2.122 ± 0.916
0.909GlnCys: 0.909 ± 0.498
1.212GlnAsp: 1.212 ± 0.371
2.425GlnGlu: 2.425 ± 1.015
1.516GlnPhe: 1.516 ± 0.522
2.122GlnGly: 2.122 ± 0.581
0.606GlnHis: 0.606 ± 0.885
1.516GlnIle: 1.516 ± 0.434
1.819GlnLys: 1.819 ± 0.841
2.728GlnLeu: 2.728 ± 0.889
0.909GlnMet: 0.909 ± 0.821
2.728GlnAsn: 2.728 ± 0.365
1.516GlnPro: 1.516 ± 0.434
0.909GlnGln: 0.909 ± 0.249
0.606GlnArg: 0.606 ± 0.217
1.212GlnSer: 1.212 ± 0.371
2.122GlnThr: 2.122 ± 0.48
3.334GlnVal: 3.334 ± 0.403
0.303GlnTrp: 0.303 ± 0.173
0.909GlnTyr: 0.909 ± 0.844
0.0GlnXaa: 0.0 ± 0.0
Arg
4.244ArgAla: 4.244 ± 0.97
0.303ArgCys: 0.303 ± 0.173
3.637ArgAsp: 3.637 ± 0.998
4.547ArgGlu: 4.547 ± 1.868
1.516ArgPhe: 1.516 ± 0.866
3.334ArgGly: 3.334 ± 1.358
0.606ArgHis: 0.606 ± 0.346
3.941ArgIle: 3.941 ± 0.812
1.516ArgLys: 1.516 ± 0.522
5.759ArgLeu: 5.759 ± 0.766
0.909ArgMet: 0.909 ± 0.519
1.212ArgAsn: 1.212 ± 0.371
3.031ArgPro: 3.031 ± 0.237
1.516ArgGln: 1.516 ± 0.522
2.425ArgArg: 2.425 ± 0.741
5.456ArgSer: 5.456 ± 0.783
3.637ArgThr: 3.637 ± 1.112
2.425ArgVal: 2.425 ± 0.628
0.909ArgTrp: 0.909 ± 0.821
1.819ArgTyr: 1.819 ± 0.684
0.0ArgXaa: 0.0 ± 0.0
Ser
3.637SerAla: 3.637 ± 1.622
3.031SerCys: 3.031 ± 3.029
6.669SerAsp: 6.669 ± 1.682
4.547SerGlu: 4.547 ± 1.868
2.728SerPhe: 2.728 ± 0.748
6.972SerGly: 6.972 ± 2.387
1.819SerHis: 1.819 ± 0.499
6.366SerIle: 6.366 ± 1.354
8.487SerLys: 8.487 ± 0.766
9.094SerLeu: 9.094 ± 0.955
2.425SerMet: 2.425 ± 0.92
2.425SerAsn: 2.425 ± 0.674
2.122SerPro: 2.122 ± 0.608
2.122SerGln: 2.122 ± 0.639
6.972SerArg: 6.972 ± 2.298
9.397SerSer: 9.397 ± 1.523
6.669SerThr: 6.669 ± 1.452
6.366SerVal: 6.366 ± 1.4
0.606SerTrp: 0.606 ± 0.346
4.244SerTyr: 4.244 ± 0.97
0.0SerXaa: 0.0 ± 0.0
Thr
2.425ThrAla: 2.425 ± 0.867
1.819ThrCys: 1.819 ± 1.396
2.728ThrAsp: 2.728 ± 1.188
4.244ThrGlu: 4.244 ± 1.409
1.819ThrPhe: 1.819 ± 0.585
4.244ThrGly: 4.244 ± 3.133
0.909ThrHis: 0.909 ± 0.519
4.85ThrIle: 4.85 ± 0.78
3.031ThrLys: 3.031 ± 0.418
6.669ThrLeu: 6.669 ± 2.096
0.606ThrMet: 0.606 ± 0.885
2.425ThrAsn: 2.425 ± 1.018
3.941ThrPro: 3.941 ± 0.081
1.516ThrGln: 1.516 ± 0.637
3.941ThrArg: 3.941 ± 1.106
6.062ThrSer: 6.062 ± 1.371
5.153ThrThr: 5.153 ± 0.643
3.941ThrVal: 3.941 ± 2.292
0.606ThrTrp: 0.606 ± 0.217
1.516ThrTyr: 1.516 ± 1.699
0.0ThrXaa: 0.0 ± 0.0
Val
1.212ValAla: 1.212 ± 0.791
2.122ValCys: 2.122 ± 1.093
2.728ValAsp: 2.728 ± 0.387
6.062ValGlu: 6.062 ± 0.618
2.728ValPhe: 2.728 ± 1.131
3.031ValGly: 3.031 ± 1.434
2.122ValHis: 2.122 ± 0.608
3.941ValIle: 3.941 ± 0.081
4.547ValLys: 4.547 ± 0.943
6.669ValLeu: 6.669 ± 0.807
2.425ValMet: 2.425 ± 0.847
2.122ValAsn: 2.122 ± 0.608
3.031ValPro: 3.031 ± 0.868
2.728ValGln: 2.728 ± 0.748
3.031ValArg: 3.031 ± 0.418
6.366ValSer: 6.366 ± 1.67
4.244ValThr: 4.244 ± 2.593
2.728ValVal: 2.728 ± 1.166
0.909ValTrp: 0.909 ± 0.249
0.606ValTyr: 0.606 ± 0.606
0.0ValXaa: 0.0 ± 0.0
Trp
0.606TrpAla: 0.606 ± 0.346
0.606TrpCys: 0.606 ± 0.217
0.909TrpAsp: 0.909 ± 0.519
1.516TrpGlu: 1.516 ± 0.434
0.909TrpPhe: 0.909 ± 0.498
0.909TrpGly: 0.909 ± 0.909
0.303TrpHis: 0.303 ± 0.173
1.212TrpIle: 1.212 ± 0.434
0.909TrpLys: 0.909 ± 0.249
1.819TrpLeu: 1.819 ± 0.585
0.606TrpMet: 0.606 ± 0.21
1.516TrpAsn: 1.516 ± 0.866
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.212TrpArg: 1.212 ± 0.692
1.212TrpSer: 1.212 ± 0.692
0.909TrpThr: 0.909 ± 0.249
0.303TrpVal: 0.303 ± 0.173
0.0TrpTrp: 0.0 ± 0.0
0.303TrpTyr: 0.303 ± 0.303
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.516TyrAla: 1.516 ± 0.753
1.516TyrCys: 1.516 ± 0.434
0.606TyrAsp: 0.606 ± 0.606
0.909TyrGlu: 0.909 ± 0.821
1.212TyrPhe: 1.212 ± 0.692
1.212TyrGly: 1.212 ± 0.795
0.606TyrHis: 0.606 ± 0.978
2.425TyrIle: 2.425 ± 0.674
2.728TyrLys: 2.728 ± 0.889
2.122TyrLeu: 2.122 ± 1.728
0.909TyrMet: 0.909 ± 1.855
1.819TyrAsn: 1.819 ± 0.684
1.819TyrPro: 1.819 ± 0.607
0.606TyrGln: 0.606 ± 0.885
1.819TyrArg: 1.819 ± 0.499
3.031TyrSer: 3.031 ± 0.418
2.728TyrThr: 2.728 ± 0.365
1.819TyrVal: 1.819 ± 0.888
0.303TyrTrp: 0.303 ± 0.173
1.212TyrTyr: 1.212 ± 0.434
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3300 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski