Amino acid dipepetide frequency for Methanobacterium phage psiM2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.984AlaAla: 3.984 ± 0.761
0.664AlaCys: 0.664 ± 0.323
1.593AlaAsp: 1.593 ± 0.454
4.647AlaGlu: 4.647 ± 0.77
1.726AlaPhe: 1.726 ± 0.666
3.984AlaGly: 3.984 ± 0.691
0.398AlaHis: 0.398 ± 0.175
3.054AlaIle: 3.054 ± 0.565
3.054AlaLys: 3.054 ± 0.66
7.967AlaLeu: 7.967 ± 1.063
1.062AlaMet: 1.062 ± 0.33
1.461AlaAsn: 1.461 ± 0.477
1.992AlaPro: 1.992 ± 0.493
1.328AlaGln: 1.328 ± 0.425
2.788AlaArg: 2.788 ± 0.567
3.585AlaSer: 3.585 ± 0.558
3.851AlaThr: 3.851 ± 0.746
2.656AlaVal: 2.656 ± 0.683
0.797AlaTrp: 0.797 ± 0.445
2.257AlaTyr: 2.257 ± 0.618
0.0AlaXaa: 0.0 ± 0.0
Cys
0.133CysAla: 0.133 ± 0.122
0.133CysCys: 0.133 ± 0.143
0.266CysAsp: 0.266 ± 0.149
0.531CysGlu: 0.531 ± 0.318
0.0CysPhe: 0.0 ± 0.0
0.266CysGly: 0.266 ± 0.184
0.266CysHis: 0.266 ± 0.179
0.398CysIle: 0.398 ± 0.209
0.797CysLys: 0.797 ± 0.387
0.266CysLeu: 0.266 ± 0.201
0.0CysMet: 0.0 ± 0.0
0.398CysAsn: 0.398 ± 0.222
0.133CysPro: 0.133 ± 0.135
0.531CysGln: 0.531 ± 0.331
0.398CysArg: 0.398 ± 0.207
0.664CysSer: 0.664 ± 0.322
0.664CysThr: 0.664 ± 0.262
0.266CysVal: 0.266 ± 0.189
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.187AspAla: 3.187 ± 0.7
0.133AspCys: 0.133 ± 0.133
3.984AspAsp: 3.984 ± 0.712
7.038AspGlu: 7.038 ± 1.112
3.054AspPhe: 3.054 ± 0.623
4.249AspGly: 4.249 ± 0.738
1.461AspHis: 1.461 ± 0.465
4.382AspIle: 4.382 ± 0.822
3.187AspLys: 3.187 ± 0.735
4.116AspLeu: 4.116 ± 0.822
1.593AspMet: 1.593 ± 0.74
2.125AspAsn: 2.125 ± 0.496
2.921AspPro: 2.921 ± 0.665
0.929AspGln: 0.929 ± 0.279
3.187AspArg: 3.187 ± 0.598
2.656AspSer: 2.656 ± 0.525
3.851AspThr: 3.851 ± 0.651
4.116AspVal: 4.116 ± 0.862
1.593AspTrp: 1.593 ± 0.522
3.054AspTyr: 3.054 ± 0.615
0.0AspXaa: 0.0 ± 0.0
Glu
5.311GluAla: 5.311 ± 1.075
0.266GluCys: 0.266 ± 0.167
6.905GluAsp: 6.905 ± 0.907
8.1GluGlu: 8.1 ± 1.094
4.647GluPhe: 4.647 ± 1.026
6.905GluGly: 6.905 ± 0.999
1.195GluHis: 1.195 ± 0.384
4.515GluIle: 4.515 ± 0.731
5.179GluLys: 5.179 ± 1.084
7.834GluLeu: 7.834 ± 1.356
1.726GluMet: 1.726 ± 0.505
3.054GluAsn: 3.054 ± 0.595
2.656GluPro: 2.656 ± 0.954
1.859GluGln: 1.859 ± 0.451
4.116GluArg: 4.116 ± 0.835
5.311GluSer: 5.311 ± 0.632
5.179GluThr: 5.179 ± 0.899
6.108GluVal: 6.108 ± 0.723
1.062GluTrp: 1.062 ± 0.418
5.179GluTyr: 5.179 ± 1.147
0.0GluXaa: 0.0 ± 0.0
Phe
1.062PheAla: 1.062 ± 0.443
0.398PheCys: 0.398 ± 0.232
2.656PheAsp: 2.656 ± 0.663
2.257PheGlu: 2.257 ± 0.63
1.726PhePhe: 1.726 ± 0.476
1.992PheGly: 1.992 ± 0.399
1.062PheHis: 1.062 ± 0.404
3.187PheIle: 3.187 ± 0.879
3.984PheLys: 3.984 ± 1.209
3.054PheLeu: 3.054 ± 0.61
1.726PheMet: 1.726 ± 0.482
2.656PheAsn: 2.656 ± 0.581
2.257PhePro: 2.257 ± 0.961
0.398PheGln: 0.398 ± 0.239
2.921PheArg: 2.921 ± 0.749
4.515PheSer: 4.515 ± 0.897
1.859PheThr: 1.859 ± 0.555
0.929PheVal: 0.929 ± 0.439
0.531PheTrp: 0.531 ± 0.282
0.929PheTyr: 0.929 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
4.116GlyAla: 4.116 ± 0.759
0.266GlyCys: 0.266 ± 0.187
5.444GlyAsp: 5.444 ± 1.14
5.843GlyGlu: 5.843 ± 0.743
2.788GlyPhe: 2.788 ± 0.852
5.71GlyGly: 5.71 ± 1.236
1.461GlyHis: 1.461 ± 0.592
3.984GlyIle: 3.984 ± 0.478
3.32GlyLys: 3.32 ± 0.781
4.382GlyLeu: 4.382 ± 1.084
1.726GlyMet: 1.726 ± 0.784
1.461GlyAsn: 1.461 ± 0.372
2.921GlyPro: 2.921 ± 0.857
0.664GlyGln: 0.664 ± 0.346
4.647GlyArg: 4.647 ± 0.89
3.851GlySer: 3.851 ± 0.597
2.921GlyThr: 2.921 ± 0.704
5.975GlyVal: 5.975 ± 0.679
1.992GlyTrp: 1.992 ± 0.518
3.718GlyTyr: 3.718 ± 0.774
0.0GlyXaa: 0.0 ± 0.0
His
0.797HisAla: 0.797 ± 0.347
0.133HisCys: 0.133 ± 0.124
0.398HisAsp: 0.398 ± 0.236
1.593HisGlu: 1.593 ± 0.415
0.531HisPhe: 0.531 ± 0.327
1.062HisGly: 1.062 ± 0.296
0.133HisHis: 0.133 ± 0.146
1.461HisIle: 1.461 ± 0.422
1.328HisLys: 1.328 ± 0.496
1.195HisLeu: 1.195 ± 0.49
0.929HisMet: 0.929 ± 0.332
0.531HisAsn: 0.531 ± 0.254
0.929HisPro: 0.929 ± 0.415
0.398HisGln: 0.398 ± 0.233
1.195HisArg: 1.195 ± 0.35
1.328HisSer: 1.328 ± 0.451
0.929HisThr: 0.929 ± 0.427
1.593HisVal: 1.593 ± 0.399
0.0HisTrp: 0.0 ± 0.0
0.929HisTyr: 0.929 ± 0.382
0.0HisXaa: 0.0 ± 0.0
Ile
2.125IleAla: 2.125 ± 0.531
0.398IleCys: 0.398 ± 0.218
3.585IleAsp: 3.585 ± 0.652
6.905IleGlu: 6.905 ± 1.162
1.328IlePhe: 1.328 ± 0.41
2.921IleGly: 2.921 ± 0.587
1.195IleHis: 1.195 ± 0.472
5.046IleIle: 5.046 ± 1.075
4.249IleLys: 4.249 ± 0.663
6.108IleLeu: 6.108 ± 1.133
2.125IleMet: 2.125 ± 0.613
2.523IleAsn: 2.523 ± 0.646
5.179IlePro: 5.179 ± 1.119
2.523IleGln: 2.523 ± 0.576
5.444IleArg: 5.444 ± 1.081
3.054IleSer: 3.054 ± 0.738
4.647IleThr: 4.647 ± 0.836
3.984IleVal: 3.984 ± 0.742
0.797IleTrp: 0.797 ± 0.353
2.788IleTyr: 2.788 ± 0.557
0.0IleXaa: 0.0 ± 0.0
Lys
4.515LysAla: 4.515 ± 0.601
0.266LysCys: 0.266 ± 0.165
3.32LysAsp: 3.32 ± 0.854
5.975LysGlu: 5.975 ± 0.931
2.257LysPhe: 2.257 ± 0.566
3.187LysGly: 3.187 ± 0.685
0.797LysHis: 0.797 ± 0.363
6.108LysIle: 6.108 ± 1.416
5.311LysLys: 5.311 ± 1.086
5.179LysLeu: 5.179 ± 1.175
1.195LysMet: 1.195 ± 0.396
1.726LysAsn: 1.726 ± 0.39
3.187LysPro: 3.187 ± 0.644
0.929LysGln: 0.929 ± 0.283
4.249LysArg: 4.249 ± 0.77
3.585LysSer: 3.585 ± 0.62
2.921LysThr: 2.921 ± 0.788
5.444LysVal: 5.444 ± 0.756
0.531LysTrp: 0.531 ± 0.264
2.921LysTyr: 2.921 ± 0.518
0.0LysXaa: 0.0 ± 0.0
Leu
5.046LeuAla: 5.046 ± 1.104
0.133LeuCys: 0.133 ± 0.138
3.851LeuAsp: 3.851 ± 0.726
8.897LeuGlu: 8.897 ± 1.578
3.32LeuPhe: 3.32 ± 0.522
5.975LeuGly: 5.975 ± 1.042
2.39LeuHis: 2.39 ± 0.671
4.382LeuIle: 4.382 ± 0.862
7.834LeuLys: 7.834 ± 1.484
6.772LeuLeu: 6.772 ± 0.865
2.257LeuMet: 2.257 ± 0.601
4.382LeuAsn: 4.382 ± 0.997
4.382LeuPro: 4.382 ± 1.391
3.452LeuGln: 3.452 ± 0.819
6.241LeuArg: 6.241 ± 1.111
5.71LeuSer: 5.71 ± 0.857
5.444LeuThr: 5.444 ± 0.929
2.788LeuVal: 2.788 ± 0.499
0.398LeuTrp: 0.398 ± 0.21
2.523LeuTyr: 2.523 ± 0.45
0.0LeuXaa: 0.0 ± 0.0
Met
1.062MetAla: 1.062 ± 0.398
0.398MetCys: 0.398 ± 0.209
1.461MetAsp: 1.461 ± 0.497
2.788MetGlu: 2.788 ± 0.712
0.531MetPhe: 0.531 ± 0.195
2.523MetGly: 2.523 ± 0.56
0.266MetHis: 0.266 ± 0.17
2.656MetIle: 2.656 ± 0.686
2.125MetLys: 2.125 ± 0.525
1.461MetLeu: 1.461 ± 0.417
0.664MetMet: 0.664 ± 0.252
1.859MetAsn: 1.859 ± 0.616
1.195MetPro: 1.195 ± 0.501
0.664MetGln: 0.664 ± 0.352
1.726MetArg: 1.726 ± 0.616
1.062MetSer: 1.062 ± 0.332
1.062MetThr: 1.062 ± 0.361
2.125MetVal: 2.125 ± 0.606
0.133MetTrp: 0.133 ± 0.115
0.664MetTyr: 0.664 ± 0.291
0.0MetXaa: 0.0 ± 0.0
Asn
1.859AsnAla: 1.859 ± 0.897
0.398AsnCys: 0.398 ± 0.213
3.851AsnAsp: 3.851 ± 0.497
2.788AsnGlu: 2.788 ± 0.53
0.929AsnPhe: 0.929 ± 0.316
1.859AsnGly: 1.859 ± 0.454
0.664AsnHis: 0.664 ± 0.353
3.718AsnIle: 3.718 ± 0.867
2.656AsnLys: 2.656 ± 0.649
4.647AsnLeu: 4.647 ± 1.042
1.328AsnMet: 1.328 ± 0.36
1.461AsnAsn: 1.461 ± 0.358
3.851AsnPro: 3.851 ± 0.972
0.797AsnGln: 0.797 ± 0.36
1.992AsnArg: 1.992 ± 0.673
1.859AsnSer: 1.859 ± 0.46
1.859AsnThr: 1.859 ± 0.419
2.656AsnVal: 2.656 ± 0.763
0.398AsnTrp: 0.398 ± 0.193
1.992AsnTyr: 1.992 ± 0.652
0.0AsnXaa: 0.0 ± 0.0
Pro
1.726ProAla: 1.726 ± 0.491
0.398ProCys: 0.398 ± 0.253
2.921ProAsp: 2.921 ± 0.556
3.452ProGlu: 3.452 ± 0.648
2.257ProPhe: 2.257 ± 0.578
2.656ProGly: 2.656 ± 0.712
1.195ProHis: 1.195 ± 0.476
2.788ProIle: 2.788 ± 0.757
3.851ProLys: 3.851 ± 1.722
3.851ProLeu: 3.851 ± 0.778
0.797ProMet: 0.797 ± 0.328
2.921ProAsn: 2.921 ± 1.087
3.452ProPro: 3.452 ± 0.867
0.664ProGln: 0.664 ± 0.272
3.187ProArg: 3.187 ± 0.749
5.311ProSer: 5.311 ± 1.045
2.921ProThr: 2.921 ± 0.622
3.585ProVal: 3.585 ± 0.794
0.266ProTrp: 0.266 ± 0.175
1.726ProTyr: 1.726 ± 0.446
0.0ProXaa: 0.0 ± 0.0
Gln
1.859GlnAla: 1.859 ± 0.461
0.133GlnCys: 0.133 ± 0.152
1.859GlnAsp: 1.859 ± 0.538
1.859GlnGlu: 1.859 ± 0.475
1.062GlnPhe: 1.062 ± 0.371
2.125GlnGly: 2.125 ± 0.57
0.664GlnHis: 0.664 ± 0.319
0.929GlnIle: 0.929 ± 0.359
1.062GlnLys: 1.062 ± 0.434
1.726GlnLeu: 1.726 ± 0.455
0.398GlnMet: 0.398 ± 0.22
0.797GlnAsn: 0.797 ± 0.266
0.531GlnPro: 0.531 ± 0.266
0.664GlnGln: 0.664 ± 0.245
1.062GlnArg: 1.062 ± 0.358
1.328GlnSer: 1.328 ± 0.431
0.797GlnThr: 0.797 ± 0.333
2.257GlnVal: 2.257 ± 0.615
0.133GlnTrp: 0.133 ± 0.154
1.195GlnTyr: 1.195 ± 0.396
0.0GlnXaa: 0.0 ± 0.0
Arg
3.452ArgAla: 3.452 ± 0.808
0.531ArgCys: 0.531 ± 0.357
2.788ArgAsp: 2.788 ± 0.583
6.905ArgGlu: 6.905 ± 1.863
4.382ArgPhe: 4.382 ± 0.612
4.382ArgGly: 4.382 ± 0.714
0.266ArgHis: 0.266 ± 0.176
5.444ArgIle: 5.444 ± 0.932
3.32ArgLys: 3.32 ± 0.611
4.647ArgLeu: 4.647 ± 1.032
1.859ArgMet: 1.859 ± 0.548
2.125ArgAsn: 2.125 ± 0.589
2.39ArgPro: 2.39 ± 0.619
0.929ArgGln: 0.929 ± 0.315
5.975ArgArg: 5.975 ± 1.494
2.921ArgSer: 2.921 ± 0.528
2.125ArgThr: 2.125 ± 0.589
5.577ArgVal: 5.577 ± 0.91
1.062ArgTrp: 1.062 ± 0.346
3.452ArgTyr: 3.452 ± 0.977
0.0ArgXaa: 0.0 ± 0.0
Ser
3.585SerAla: 3.585 ± 0.678
0.531SerCys: 0.531 ± 0.29
4.647SerAsp: 4.647 ± 0.827
4.382SerGlu: 4.382 ± 0.874
2.656SerPhe: 2.656 ± 0.6
4.249SerGly: 4.249 ± 0.957
0.797SerHis: 0.797 ± 0.297
4.382SerIle: 4.382 ± 0.689
2.39SerLys: 2.39 ± 0.619
7.303SerLeu: 7.303 ± 0.867
1.726SerMet: 1.726 ± 0.487
1.992SerAsn: 1.992 ± 0.48
2.921SerPro: 2.921 ± 0.562
1.593SerGln: 1.593 ± 0.37
3.718SerArg: 3.718 ± 0.758
4.78SerSer: 4.78 ± 0.984
2.788SerThr: 2.788 ± 0.532
5.444SerVal: 5.444 ± 0.942
1.461SerTrp: 1.461 ± 0.715
2.788SerTyr: 2.788 ± 0.624
0.0SerXaa: 0.0 ± 0.0
Thr
2.921ThrAla: 2.921 ± 0.728
0.133ThrCys: 0.133 ± 0.125
2.921ThrAsp: 2.921 ± 0.693
4.249ThrGlu: 4.249 ± 0.849
1.992ThrPhe: 1.992 ± 0.728
5.444ThrGly: 5.444 ± 0.875
1.062ThrHis: 1.062 ± 0.294
4.382ThrIle: 4.382 ± 0.738
2.125ThrLys: 2.125 ± 0.579
5.975ThrLeu: 5.975 ± 0.705
1.461ThrMet: 1.461 ± 0.47
2.39ThrAsn: 2.39 ± 0.724
3.718ThrPro: 3.718 ± 0.587
1.328ThrGln: 1.328 ± 0.334
3.187ThrArg: 3.187 ± 0.589
3.054ThrSer: 3.054 ± 0.694
1.992ThrThr: 1.992 ± 0.732
5.046ThrVal: 5.046 ± 1.105
1.195ThrTrp: 1.195 ± 0.409
1.726ThrTyr: 1.726 ± 0.474
0.0ThrXaa: 0.0 ± 0.0
Val
3.585ValAla: 3.585 ± 0.696
0.266ValCys: 0.266 ± 0.175
5.444ValAsp: 5.444 ± 1.122
5.577ValGlu: 5.577 ± 1.289
2.125ValPhe: 2.125 ± 0.454
3.851ValGly: 3.851 ± 0.854
0.664ValHis: 0.664 ± 0.327
3.054ValIle: 3.054 ± 0.698
5.046ValLys: 5.046 ± 0.902
4.78ValLeu: 4.78 ± 0.856
2.125ValMet: 2.125 ± 0.456
4.647ValAsn: 4.647 ± 0.917
3.32ValPro: 3.32 ± 0.855
1.593ValGln: 1.593 ± 0.449
4.116ValArg: 4.116 ± 0.857
5.975ValSer: 5.975 ± 1.176
6.772ValThr: 6.772 ± 0.831
3.187ValVal: 3.187 ± 0.583
1.062ValTrp: 1.062 ± 0.289
2.257ValTyr: 2.257 ± 0.579
0.0ValXaa: 0.0 ± 0.0
Trp
0.133TrpAla: 0.133 ± 0.115
0.133TrpCys: 0.133 ± 0.157
1.062TrpAsp: 1.062 ± 0.369
0.531TrpGlu: 0.531 ± 0.285
0.266TrpPhe: 0.266 ± 0.169
0.664TrpGly: 0.664 ± 0.281
0.133TrpHis: 0.133 ± 0.138
1.195TrpIle: 1.195 ± 0.384
0.929TrpLys: 0.929 ± 0.308
0.797TrpLeu: 0.797 ± 0.371
0.664TrpMet: 0.664 ± 0.383
1.062TrpAsn: 1.062 ± 0.293
1.062TrpPro: 1.062 ± 0.561
0.133TrpGln: 0.133 ± 0.141
1.461TrpArg: 1.461 ± 0.406
1.062TrpSer: 1.062 ± 0.421
0.797TrpThr: 0.797 ± 0.393
1.593TrpVal: 1.593 ± 0.598
0.0TrpTrp: 0.0 ± 0.0
0.266TrpTyr: 0.266 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.39TyrAla: 2.39 ± 0.619
0.398TyrCys: 0.398 ± 0.217
1.992TyrAsp: 1.992 ± 0.51
2.788TyrGlu: 2.788 ± 0.616
2.523TyrPhe: 2.523 ± 0.534
3.187TyrGly: 3.187 ± 0.754
1.328TyrHis: 1.328 ± 0.488
1.859TyrIle: 1.859 ± 0.584
1.859TyrLys: 1.859 ± 0.554
4.116TyrLeu: 4.116 ± 0.688
0.929TyrMet: 0.929 ± 0.321
1.992TyrAsn: 1.992 ± 0.742
0.929TyrPro: 0.929 ± 0.289
1.062TyrGln: 1.062 ± 0.484
2.921TyrArg: 2.921 ± 0.834
2.523TyrSer: 2.523 ± 0.62
2.921TyrThr: 2.921 ± 0.601
4.116TyrVal: 4.116 ± 0.8
0.531TyrTrp: 0.531 ± 0.25
2.656TyrTyr: 2.656 ± 0.684
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 31 proteins (7532 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski