Amino acid dipepetide frequency for Organic Lake virophage

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.119AlaAla: 0.119 ± 0.135
0.475AlaCys: 0.475 ± 0.257
5.345AlaAsp: 5.345 ± 3.036
2.613AlaGlu: 2.613 ± 0.627
1.188AlaPhe: 1.188 ± 0.265
5.464AlaGly: 5.464 ± 1.377
0.356AlaHis: 0.356 ± 0.194
1.782AlaIle: 1.782 ± 0.526
3.563AlaLys: 3.563 ± 0.925
8.79AlaLeu: 8.79 ± 2.685
1.069AlaMet: 1.069 ± 0.378
3.801AlaAsn: 3.801 ± 0.958
6.652AlaPro: 6.652 ± 4.34
0.831AlaGln: 0.831 ± 0.339
1.425AlaArg: 1.425 ± 0.334
4.276AlaSer: 4.276 ± 0.755
5.701AlaThr: 5.701 ± 3.195
3.445AlaVal: 3.445 ± 0.912
0.356AlaTrp: 0.356 ± 0.198
1.782AlaTyr: 1.782 ± 0.553
0.0AlaXaa: 0.0 ± 0.0
Cys
0.238CysAla: 0.238 ± 0.154
0.238CysCys: 0.238 ± 0.165
0.119CysAsp: 0.119 ± 0.108
0.475CysGlu: 0.475 ± 0.259
0.713CysPhe: 0.713 ± 0.366
2.613CysGly: 2.613 ± 1.533
0.475CysHis: 0.475 ± 0.208
1.782CysIle: 1.782 ± 0.869
1.069CysLys: 1.069 ± 0.575
0.95CysLeu: 0.95 ± 0.319
0.119CysMet: 0.119 ± 0.119
0.594CysAsn: 0.594 ± 0.199
0.119CysPro: 0.119 ± 0.099
0.119CysGln: 0.119 ± 0.099
0.475CysArg: 0.475 ± 0.205
0.594CysSer: 0.594 ± 0.259
0.238CysThr: 0.238 ± 0.17
0.831CysVal: 0.831 ± 0.419
0.119CysTrp: 0.119 ± 0.133
0.594CysTyr: 0.594 ± 0.233
0.0CysXaa: 0.0 ± 0.0
Asp
3.088AspAla: 3.088 ± 0.734
0.475AspCys: 0.475 ± 0.199
3.682AspAsp: 3.682 ± 0.805
2.019AspGlu: 2.019 ± 0.442
3.088AspPhe: 3.088 ± 0.802
11.878AspGly: 11.878 ± 4.708
0.594AspHis: 0.594 ± 0.306
5.82AspIle: 5.82 ± 1.575
3.445AspLys: 3.445 ± 0.832
4.395AspLeu: 4.395 ± 0.621
1.069AspMet: 1.069 ± 0.512
2.732AspAsn: 2.732 ± 0.714
1.069AspPro: 1.069 ± 0.387
0.831AspGln: 0.831 ± 0.396
1.425AspArg: 1.425 ± 0.429
2.969AspSer: 2.969 ± 0.755
4.989AspThr: 4.989 ± 0.784
3.801AspVal: 3.801 ± 0.941
0.238AspTrp: 0.238 ± 0.153
1.9AspTyr: 1.9 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
2.613GluAla: 2.613 ± 0.523
0.475GluCys: 0.475 ± 0.336
2.851GluAsp: 2.851 ± 0.714
4.87GluGlu: 4.87 ± 1.242
2.138GluPhe: 2.138 ± 0.491
1.782GluGly: 1.782 ± 0.63
1.069GluHis: 1.069 ± 0.437
3.563GluIle: 3.563 ± 0.802
2.969GluLys: 2.969 ± 0.788
5.939GluLeu: 5.939 ± 0.796
1.663GluMet: 1.663 ± 0.543
2.851GluAsn: 2.851 ± 0.588
3.682GluPro: 3.682 ± 0.984
0.95GluGln: 0.95 ± 0.293
1.782GluArg: 1.782 ± 0.423
4.87GluSer: 4.87 ± 1.432
4.751GluThr: 4.751 ± 1.723
3.088GluVal: 3.088 ± 0.751
0.356GluTrp: 0.356 ± 0.193
2.969GluTyr: 2.969 ± 0.801
0.0GluXaa: 0.0 ± 0.0
Phe
2.494PheAla: 2.494 ± 1.207
0.475PheCys: 0.475 ± 0.28
2.376PheAsp: 2.376 ± 0.526
2.138PheGlu: 2.138 ± 0.565
1.188PhePhe: 1.188 ± 0.397
1.663PheGly: 1.663 ± 0.366
0.238PheHis: 0.238 ± 0.121
2.257PheIle: 2.257 ± 0.623
3.326PheLys: 3.326 ± 0.885
2.969PheLeu: 2.969 ± 0.771
0.831PheMet: 0.831 ± 0.217
3.445PheAsn: 3.445 ± 0.612
1.425PhePro: 1.425 ± 0.414
0.831PheGln: 0.831 ± 0.315
0.594PheArg: 0.594 ± 0.242
2.138PheSer: 2.138 ± 0.449
1.9PheThr: 1.9 ± 0.553
1.544PheVal: 1.544 ± 0.467
0.475PheTrp: 0.475 ± 0.242
1.188PheTyr: 1.188 ± 0.365
0.0PheXaa: 0.0 ± 0.0
Gly
11.997GlyAla: 11.997 ± 6.975
0.594GlyCys: 0.594 ± 0.294
4.989GlyAsp: 4.989 ± 1.661
4.989GlyGlu: 4.989 ± 1.335
2.494GlyPhe: 2.494 ± 0.612
4.989GlyGly: 4.989 ± 0.997
0.594GlyHis: 0.594 ± 0.227
5.464GlyIle: 5.464 ± 1.982
3.088GlyLys: 3.088 ± 0.876
3.92GlyLeu: 3.92 ± 0.882
0.475GlyMet: 0.475 ± 0.211
3.207GlyAsn: 3.207 ± 0.663
8.79GlyPro: 8.79 ± 3.926
0.95GlyGln: 0.95 ± 0.424
2.732GlyArg: 2.732 ± 1.016
5.226GlySer: 5.226 ± 0.92
3.326GlyThr: 3.326 ± 0.699
4.989GlyVal: 4.989 ± 1.455
0.356GlyTrp: 0.356 ± 0.214
2.019GlyTyr: 2.019 ± 0.672
0.0GlyXaa: 0.0 ± 0.0
His
0.594HisAla: 0.594 ± 0.237
0.119HisCys: 0.119 ± 0.103
0.119HisAsp: 0.119 ± 0.119
0.356HisGlu: 0.356 ± 0.246
0.475HisPhe: 0.475 ± 0.255
0.356HisGly: 0.356 ± 0.172
0.356HisHis: 0.356 ± 0.303
1.425HisIle: 1.425 ± 0.312
1.663HisLys: 1.663 ± 1.02
1.188HisLeu: 1.188 ± 0.503
0.238HisMet: 0.238 ± 0.164
0.475HisAsn: 0.475 ± 0.302
0.356HisPro: 0.356 ± 0.225
0.356HisGln: 0.356 ± 0.224
0.831HisArg: 0.831 ± 0.46
0.831HisSer: 0.831 ± 0.32
3.207HisThr: 3.207 ± 1.177
0.119HisVal: 0.119 ± 0.134
0.119HisTrp: 0.119 ± 0.103
0.475HisTyr: 0.475 ± 0.242
0.0HisXaa: 0.0 ± 0.0
Ile
3.088IleAla: 3.088 ± 0.784
0.475IleCys: 0.475 ± 0.241
5.226IleAsp: 5.226 ± 0.856
4.395IleGlu: 4.395 ± 1.281
2.138IlePhe: 2.138 ± 0.579
3.801IleGly: 3.801 ± 0.792
0.475IleHis: 0.475 ± 0.314
3.801IleIle: 3.801 ± 0.707
3.92IleLys: 3.92 ± 0.974
5.82IleLeu: 5.82 ± 1.461
1.544IleMet: 1.544 ± 0.397
5.583IleAsn: 5.583 ± 1.199
4.276IlePro: 4.276 ± 1.622
3.801IleGln: 3.801 ± 1.435
1.663IleArg: 1.663 ± 0.481
9.384IleSer: 9.384 ± 2.489
4.157IleThr: 4.157 ± 0.685
3.445IleVal: 3.445 ± 0.567
0.119IleTrp: 0.119 ± 0.108
3.088IleTyr: 3.088 ± 0.814
0.0IleXaa: 0.0 ± 0.0
Lys
2.732LysAla: 2.732 ± 0.733
0.356LysCys: 0.356 ± 0.186
3.563LysAsp: 3.563 ± 1.023
5.345LysGlu: 5.345 ± 1.444
1.782LysPhe: 1.782 ± 0.671
3.563LysGly: 3.563 ± 0.661
1.307LysHis: 1.307 ± 0.445
3.207LysIle: 3.207 ± 0.862
7.483LysLys: 7.483 ± 2.147
4.157LysLeu: 4.157 ± 1.051
1.188LysMet: 1.188 ± 0.54
4.632LysAsn: 4.632 ± 1.509
2.376LysPro: 2.376 ± 0.632
4.514LysGln: 4.514 ± 0.956
2.851LysArg: 2.851 ± 0.664
4.157LysSer: 4.157 ± 0.878
5.939LysThr: 5.939 ± 1.202
3.682LysVal: 3.682 ± 0.67
0.356LysTrp: 0.356 ± 0.2
4.038LysTyr: 4.038 ± 0.985
0.0LysXaa: 0.0 ± 0.0
Leu
3.801LeuAla: 3.801 ± 1.444
0.831LeuCys: 0.831 ± 0.252
6.889LeuAsp: 6.889 ± 1.314
5.701LeuGlu: 5.701 ± 0.75
2.138LeuPhe: 2.138 ± 0.557
2.257LeuGly: 2.257 ± 0.811
0.95LeuHis: 0.95 ± 0.268
4.157LeuIle: 4.157 ± 0.532
7.364LeuLys: 7.364 ± 1.749
5.82LeuLeu: 5.82 ± 1.161
1.9LeuMet: 1.9 ± 0.408
5.701LeuAsn: 5.701 ± 1.299
1.544LeuPro: 1.544 ± 0.387
7.127LeuGln: 7.127 ± 2.674
2.851LeuArg: 2.851 ± 0.901
5.82LeuSer: 5.82 ± 1.073
8.077LeuThr: 8.077 ± 1.846
4.038LeuVal: 4.038 ± 1.003
1.069LeuTrp: 1.069 ± 0.935
4.87LeuTyr: 4.87 ± 1.301
0.0LeuXaa: 0.0 ± 0.0
Met
0.95MetAla: 0.95 ± 0.273
0.475MetCys: 0.475 ± 0.183
0.713MetAsp: 0.713 ± 0.263
1.544MetGlu: 1.544 ± 0.406
0.356MetPhe: 0.356 ± 0.191
0.594MetGly: 0.594 ± 0.305
0.0MetHis: 0.0 ± 0.0
0.831MetIle: 0.831 ± 0.324
1.663MetLys: 1.663 ± 0.667
1.782MetLeu: 1.782 ± 0.488
1.188MetMet: 1.188 ± 0.521
1.9MetAsn: 1.9 ± 0.592
1.069MetPro: 1.069 ± 0.35
0.238MetGln: 0.238 ± 0.216
0.594MetArg: 0.594 ± 0.221
2.494MetSer: 2.494 ± 0.737
1.782MetThr: 1.782 ± 0.486
1.425MetVal: 1.425 ± 0.359
0.0MetTrp: 0.0 ± 0.0
1.069MetTyr: 1.069 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
2.969AsnAla: 2.969 ± 0.629
0.475AsnCys: 0.475 ± 0.275
5.226AsnAsp: 5.226 ± 1.418
3.801AsnGlu: 3.801 ± 0.642
1.782AsnPhe: 1.782 ± 0.252
3.326AsnGly: 3.326 ± 0.592
1.425AsnHis: 1.425 ± 0.534
5.226AsnIle: 5.226 ± 1.086
4.395AsnLys: 4.395 ± 1.096
6.652AsnLeu: 6.652 ± 1.14
1.307AsnMet: 1.307 ± 0.451
4.395AsnAsn: 4.395 ± 1.012
2.257AsnPro: 2.257 ± 0.529
2.376AsnGln: 2.376 ± 0.496
1.663AsnArg: 1.663 ± 0.372
4.157AsnSer: 4.157 ± 1.089
4.632AsnThr: 4.632 ± 1.006
4.157AsnVal: 4.157 ± 0.859
0.594AsnTrp: 0.594 ± 0.279
2.732AsnTyr: 2.732 ± 0.635
0.0AsnXaa: 0.0 ± 0.0
Pro
3.207ProAla: 3.207 ± 0.934
3.088ProCys: 3.088 ± 2.301
1.188ProAsp: 1.188 ± 0.434
0.95ProGlu: 0.95 ± 0.315
1.188ProPhe: 1.188 ± 0.412
0.119ProGly: 0.119 ± 0.1
0.475ProHis: 0.475 ± 0.197
2.851ProIle: 2.851 ± 0.897
2.732ProLys: 2.732 ± 1.036
2.494ProLeu: 2.494 ± 0.763
1.069ProMet: 1.069 ± 0.487
1.782ProAsn: 1.782 ± 0.597
1.425ProPro: 1.425 ± 0.709
4.276ProGln: 4.276 ± 2.04
0.594ProArg: 0.594 ± 0.295
6.889ProSer: 6.889 ± 3.254
2.969ProThr: 2.969 ± 0.729
6.77ProVal: 6.77 ± 3.841
0.238ProTrp: 0.238 ± 0.232
1.069ProTyr: 1.069 ± 0.461
0.0ProXaa: 0.0 ± 0.0
Gln
1.069GlnAla: 1.069 ± 0.425
0.238GlnCys: 0.238 ± 0.163
2.138GlnAsp: 2.138 ± 0.614
2.257GlnGlu: 2.257 ± 0.505
0.95GlnPhe: 0.95 ± 0.32
9.027GlnGly: 9.027 ± 3.538
0.238GlnHis: 0.238 ± 0.16
4.276GlnIle: 4.276 ± 1.492
2.376GlnLys: 2.376 ± 0.656
3.801GlnLeu: 3.801 ± 1.101
0.713GlnMet: 0.713 ± 0.392
1.425GlnAsn: 1.425 ± 0.429
0.594GlnPro: 0.594 ± 0.221
1.9GlnGln: 1.9 ± 0.628
0.713GlnArg: 0.713 ± 0.342
2.732GlnSer: 2.732 ± 0.583
2.019GlnThr: 2.019 ± 0.698
1.188GlnVal: 1.188 ± 0.385
0.238GlnTrp: 0.238 ± 0.156
2.019GlnTyr: 2.019 ± 0.631
0.0GlnXaa: 0.0 ± 0.0
Arg
0.594ArgAla: 0.594 ± 0.25
0.475ArgCys: 0.475 ± 0.197
1.069ArgAsp: 1.069 ± 0.307
1.425ArgGlu: 1.425 ± 0.698
1.307ArgPhe: 1.307 ± 0.377
1.188ArgGly: 1.188 ± 0.415
0.713ArgHis: 0.713 ± 0.251
1.425ArgIle: 1.425 ± 0.376
2.732ArgLys: 2.732 ± 1.098
3.445ArgLeu: 3.445 ± 1.077
0.475ArgMet: 0.475 ± 0.26
2.376ArgAsn: 2.376 ± 0.632
0.831ArgPro: 0.831 ± 0.316
1.069ArgGln: 1.069 ± 0.38
1.425ArgArg: 1.425 ± 0.47
1.663ArgSer: 1.663 ± 0.365
3.682ArgThr: 3.682 ± 2.033
0.95ArgVal: 0.95 ± 0.273
1.069ArgTrp: 1.069 ± 0.35
1.307ArgTyr: 1.307 ± 0.315
0.0ArgXaa: 0.0 ± 0.0
Ser
7.483SerAla: 7.483 ± 1.764
1.544SerCys: 1.544 ± 0.561
4.157SerAsp: 4.157 ± 0.799
2.851SerGlu: 2.851 ± 0.731
3.92SerPhe: 3.92 ± 1.284
5.107SerGly: 5.107 ± 1.262
1.9SerHis: 1.9 ± 0.946
6.77SerIle: 6.77 ± 1.244
4.87SerLys: 4.87 ± 0.812
6.533SerLeu: 6.533 ± 0.816
1.188SerMet: 1.188 ± 0.439
7.127SerAsn: 7.127 ± 1.434
3.801SerPro: 3.801 ± 1.509
3.92SerGln: 3.92 ± 1.2
2.732SerArg: 2.732 ± 1.086
11.522SerSer: 11.522 ± 3.723
9.859SerThr: 9.859 ± 5.276
3.92SerVal: 3.92 ± 0.834
0.238SerTrp: 0.238 ± 0.177
2.613SerTyr: 2.613 ± 0.555
0.0SerXaa: 0.0 ± 0.0
Thr
4.157ThrAla: 4.157 ± 1.286
0.475ThrCys: 0.475 ± 0.226
3.326ThrAsp: 3.326 ± 0.708
4.276ThrGlu: 4.276 ± 1.036
2.969ThrPhe: 2.969 ± 0.554
10.69ThrGly: 10.69 ± 5.636
1.425ThrHis: 1.425 ± 0.456
8.552ThrIle: 8.552 ± 2.01
2.494ThrLys: 2.494 ± 0.511
6.889ThrLeu: 6.889 ± 1.748
1.307ThrMet: 1.307 ± 0.417
5.226ThrAsn: 5.226 ± 0.786
2.969ThrPro: 2.969 ± 1.291
2.732ThrGln: 2.732 ± 0.798
0.713ThrArg: 0.713 ± 0.238
13.66ThrSer: 13.66 ± 5.33
4.989ThrThr: 4.989 ± 1.282
1.425ThrVal: 1.425 ± 0.766
0.356ThrTrp: 0.356 ± 0.191
1.663ThrTyr: 1.663 ± 0.393
0.0ThrXaa: 0.0 ± 0.0
Val
5.82ValAla: 5.82 ± 2.509
0.356ValCys: 0.356 ± 0.184
4.514ValAsp: 4.514 ± 1.389
2.851ValGlu: 2.851 ± 0.718
1.9ValPhe: 1.9 ± 0.696
4.395ValGly: 4.395 ± 0.844
0.475ValHis: 0.475 ± 0.322
4.157ValIle: 4.157 ± 0.723
4.038ValLys: 4.038 ± 0.908
3.445ValLeu: 3.445 ± 0.832
1.663ValMet: 1.663 ± 0.359
3.563ValAsn: 3.563 ± 0.691
1.188ValPro: 1.188 ± 0.457
1.069ValGln: 1.069 ± 0.556
1.9ValArg: 1.9 ± 0.424
5.226ValSer: 5.226 ± 1.241
2.969ValThr: 2.969 ± 0.521
3.326ValVal: 3.326 ± 0.816
0.119ValTrp: 0.119 ± 0.12
1.782ValTyr: 1.782 ± 0.493
0.0ValXaa: 0.0 ± 0.0
Trp
0.238TrpAla: 0.238 ± 0.18
0.119TrpCys: 0.119 ± 0.103
0.119TrpAsp: 0.119 ± 0.116
0.119TrpGlu: 0.119 ± 0.1
0.238TrpPhe: 0.238 ± 0.153
0.119TrpGly: 0.119 ± 0.109
0.119TrpHis: 0.119 ± 0.099
0.475TrpIle: 0.475 ± 0.272
0.475TrpLys: 0.475 ± 0.274
0.238TrpLeu: 0.238 ± 0.156
0.356TrpMet: 0.356 ± 0.183
0.238TrpAsn: 0.238 ± 0.163
0.0TrpPro: 0.0 ± 0.0
0.356TrpGln: 0.356 ± 0.191
0.119TrpArg: 0.119 ± 0.12
1.069TrpSer: 1.069 ± 0.414
1.9TrpThr: 1.9 ± 1.036
0.356TrpVal: 0.356 ± 0.202
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.782TyrAla: 1.782 ± 0.346
0.713TyrCys: 0.713 ± 0.461
1.9TyrAsp: 1.9 ± 0.631
2.257TyrGlu: 2.257 ± 0.732
1.544TyrPhe: 1.544 ± 0.38
2.257TyrGly: 2.257 ± 0.412
0.475TyrHis: 0.475 ± 0.292
2.494TyrIle: 2.494 ± 0.606
3.563TyrLys: 3.563 ± 0.899
3.682TyrLeu: 3.682 ± 0.603
1.188TyrMet: 1.188 ± 0.416
2.494TyrAsn: 2.494 ± 0.546
0.95TyrPro: 0.95 ± 0.312
1.663TyrGln: 1.663 ± 0.538
2.138TyrArg: 2.138 ± 0.563
2.376TyrSer: 2.376 ± 0.495
3.207TyrThr: 3.207 ± 1.203
2.257TyrVal: 2.257 ± 0.574
0.238TyrTrp: 0.238 ± 0.139
1.425TyrTyr: 1.425 ± 0.477
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 24 proteins (8420 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski