Amino acid dipepetide frequency for Verrucomicrobia phage P8625

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.144AlaAla: 9.144 ± 1.084
1.238AlaCys: 1.238 ± 0.382
7.715AlaAsp: 7.715 ± 0.88
8.096AlaGlu: 8.096 ± 0.914
3.524AlaPhe: 3.524 ± 0.537
6.667AlaGly: 6.667 ± 0.67
1.333AlaHis: 1.333 ± 0.298
6.286AlaIle: 6.286 ± 0.909
7.525AlaLys: 7.525 ± 1.031
7.525AlaLeu: 7.525 ± 0.992
2.762AlaMet: 2.762 ± 0.518
4.286AlaAsn: 4.286 ± 0.793
3.81AlaPro: 3.81 ± 0.875
3.429AlaGln: 3.429 ± 0.614
4.762AlaArg: 4.762 ± 0.735
6.953AlaSer: 6.953 ± 1.611
6.572AlaThr: 6.572 ± 1.132
3.81AlaVal: 3.81 ± 0.582
1.143AlaTrp: 1.143 ± 0.311
2.667AlaTyr: 2.667 ± 0.475
0.0AlaXaa: 0.0 ± 0.0
Cys
1.143CysAla: 1.143 ± 0.421
0.19CysCys: 0.19 ± 0.18
1.81CysAsp: 1.81 ± 0.435
1.429CysGlu: 1.429 ± 0.412
0.19CysPhe: 0.19 ± 0.126
1.238CysGly: 1.238 ± 0.409
0.19CysHis: 0.19 ± 0.2
0.476CysIle: 0.476 ± 0.241
1.048CysLys: 1.048 ± 0.414
0.952CysLeu: 0.952 ± 0.328
0.19CysMet: 0.19 ± 0.14
0.476CysAsn: 0.476 ± 0.196
0.095CysPro: 0.095 ± 0.095
0.571CysGln: 0.571 ± 0.234
0.952CysArg: 0.952 ± 0.329
0.857CysSer: 0.857 ± 0.259
0.286CysThr: 0.286 ± 0.167
0.667CysVal: 0.667 ± 0.308
0.19CysTrp: 0.19 ± 0.133
0.476CysTyr: 0.476 ± 0.257
0.0CysXaa: 0.0 ± 0.0
Asp
6.763AspAla: 6.763 ± 0.802
2.0AspCys: 2.0 ± 0.575
4.096AspAsp: 4.096 ± 0.72
5.239AspGlu: 5.239 ± 0.643
3.429AspPhe: 3.429 ± 0.633
6.858AspGly: 6.858 ± 0.98
1.048AspHis: 1.048 ± 0.365
3.429AspIle: 3.429 ± 0.475
3.81AspLys: 3.81 ± 0.754
5.048AspLeu: 5.048 ± 0.723
1.619AspMet: 1.619 ± 0.376
1.619AspAsn: 1.619 ± 0.326
2.286AspPro: 2.286 ± 0.403
2.476AspGln: 2.476 ± 0.432
2.572AspArg: 2.572 ± 0.51
4.477AspSer: 4.477 ± 0.708
3.334AspThr: 3.334 ± 0.493
4.096AspVal: 4.096 ± 0.643
1.714AspTrp: 1.714 ± 0.447
2.0AspTyr: 2.0 ± 0.406
0.0AspXaa: 0.0 ± 0.0
Glu
5.715GluAla: 5.715 ± 0.899
0.857GluCys: 0.857 ± 0.298
3.143GluAsp: 3.143 ± 0.664
3.429GluGlu: 3.429 ± 0.586
2.191GluPhe: 2.191 ± 0.511
4.381GluGly: 4.381 ± 0.866
0.667GluHis: 0.667 ± 0.227
5.239GluIle: 5.239 ± 0.785
4.762GluLys: 4.762 ± 0.769
5.81GluLeu: 5.81 ± 0.752
2.667GluMet: 2.667 ± 0.508
3.81GluAsn: 3.81 ± 0.6
1.81GluPro: 1.81 ± 0.504
3.143GluGln: 3.143 ± 0.537
5.048GluArg: 5.048 ± 0.932
5.143GluSer: 5.143 ± 0.786
3.905GluThr: 3.905 ± 0.636
3.334GluVal: 3.334 ± 0.559
1.619GluTrp: 1.619 ± 0.365
2.381GluTyr: 2.381 ± 0.472
0.0GluXaa: 0.0 ± 0.0
Phe
3.048PheAla: 3.048 ± 0.4
0.19PheCys: 0.19 ± 0.122
3.905PheAsp: 3.905 ± 0.705
2.762PheGlu: 2.762 ± 0.577
1.429PhePhe: 1.429 ± 0.279
3.715PheGly: 3.715 ± 0.63
0.857PheHis: 0.857 ± 0.243
1.905PheIle: 1.905 ± 0.502
2.0PheLys: 2.0 ± 0.474
2.381PheLeu: 2.381 ± 0.473
1.048PheMet: 1.048 ± 0.293
0.667PheAsn: 0.667 ± 0.234
1.429PhePro: 1.429 ± 0.415
1.143PheGln: 1.143 ± 0.382
1.619PheArg: 1.619 ± 0.305
2.476PheSer: 2.476 ± 0.563
2.762PheThr: 2.762 ± 0.509
2.0PheVal: 2.0 ± 0.476
0.381PheTrp: 0.381 ± 0.193
1.619PheTyr: 1.619 ± 0.435
0.0PheXaa: 0.0 ± 0.0
Gly
6.858GlyAla: 6.858 ± 0.927
0.667GlyCys: 0.667 ± 0.301
3.81GlyAsp: 3.81 ± 0.663
5.239GlyGlu: 5.239 ± 0.855
3.81GlyPhe: 3.81 ± 0.69
5.905GlyGly: 5.905 ± 0.79
1.429GlyHis: 1.429 ± 0.339
4.762GlyIle: 4.762 ± 0.609
4.953GlyLys: 4.953 ± 1.03
5.239GlyLeu: 5.239 ± 0.834
2.381GlyMet: 2.381 ± 0.452
4.096GlyAsn: 4.096 ± 0.743
0.0GlyPro: 0.0 ± 0.0
2.381GlyGln: 2.381 ± 0.429
4.286GlyArg: 4.286 ± 0.66
5.62GlySer: 5.62 ± 1.042
5.81GlyThr: 5.81 ± 1.889
3.81GlyVal: 3.81 ± 0.574
0.762GlyTrp: 0.762 ± 0.252
3.429GlyTyr: 3.429 ± 0.754
0.0GlyXaa: 0.0 ± 0.0
His
2.0HisAla: 2.0 ± 0.579
0.095HisCys: 0.095 ± 0.105
1.143HisAsp: 1.143 ± 0.327
0.857HisGlu: 0.857 ± 0.303
0.571HisPhe: 0.571 ± 0.207
1.524HisGly: 1.524 ± 0.486
0.19HisHis: 0.19 ± 0.144
0.762HisIle: 0.762 ± 0.238
0.667HisLys: 0.667 ± 0.262
1.143HisLeu: 1.143 ± 0.295
0.381HisMet: 0.381 ± 0.233
0.762HisAsn: 0.762 ± 0.296
0.762HisPro: 0.762 ± 0.277
0.476HisGln: 0.476 ± 0.199
1.524HisArg: 1.524 ± 0.355
0.857HisSer: 0.857 ± 0.26
0.381HisThr: 0.381 ± 0.227
1.238HisVal: 1.238 ± 0.379
0.095HisTrp: 0.095 ± 0.095
0.857HisTyr: 0.857 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
6.382IleAla: 6.382 ± 0.868
1.333IleCys: 1.333 ± 0.385
6.001IleAsp: 6.001 ± 0.689
6.096IleGlu: 6.096 ± 0.874
1.81IlePhe: 1.81 ± 0.348
4.096IleGly: 4.096 ± 0.543
0.857IleHis: 0.857 ± 0.227
3.619IleIle: 3.619 ± 0.65
3.715IleLys: 3.715 ± 0.635
2.953IleLeu: 2.953 ± 0.575
1.429IleMet: 1.429 ± 0.352
2.191IleAsn: 2.191 ± 0.506
2.476IlePro: 2.476 ± 0.482
1.81IleGln: 1.81 ± 0.432
4.667IleArg: 4.667 ± 0.633
3.715IleSer: 3.715 ± 0.655
3.048IleThr: 3.048 ± 0.561
3.81IleVal: 3.81 ± 0.57
1.238IleTrp: 1.238 ± 0.27
1.714IleTyr: 1.714 ± 0.374
0.0IleXaa: 0.0 ± 0.0
Lys
6.477LysAla: 6.477 ± 1.05
1.048LysCys: 1.048 ± 0.264
4.0LysAsp: 4.0 ± 0.58
3.048LysGlu: 3.048 ± 0.625
1.619LysPhe: 1.619 ± 0.377
3.334LysGly: 3.334 ± 0.525
1.524LysHis: 1.524 ± 0.433
4.286LysIle: 4.286 ± 0.635
4.477LysLys: 4.477 ± 0.976
5.239LysLeu: 5.239 ± 0.58
2.476LysMet: 2.476 ± 0.472
2.762LysAsn: 2.762 ± 0.504
2.667LysPro: 2.667 ± 0.653
3.238LysGln: 3.238 ± 0.606
3.715LysArg: 3.715 ± 0.603
3.905LysSer: 3.905 ± 0.53
3.81LysThr: 3.81 ± 0.591
2.857LysVal: 2.857 ± 0.496
1.238LysTrp: 1.238 ± 0.283
1.905LysTyr: 1.905 ± 0.371
0.0LysXaa: 0.0 ± 0.0
Leu
7.429LeuAla: 7.429 ± 0.901
1.238LeuCys: 1.238 ± 0.386
4.953LeuAsp: 4.953 ± 0.646
3.81LeuGlu: 3.81 ± 0.758
2.476LeuPhe: 2.476 ± 0.504
4.572LeuGly: 4.572 ± 0.53
1.048LeuHis: 1.048 ± 0.329
4.667LeuIle: 4.667 ± 0.697
4.572LeuLys: 4.572 ± 0.94
4.477LeuLeu: 4.477 ± 0.671
1.619LeuMet: 1.619 ± 0.504
3.715LeuAsn: 3.715 ± 0.644
2.857LeuPro: 2.857 ± 0.531
2.286LeuGln: 2.286 ± 0.356
4.477LeuArg: 4.477 ± 0.591
4.667LeuSer: 4.667 ± 0.769
4.572LeuThr: 4.572 ± 1.021
3.619LeuVal: 3.619 ± 0.762
0.857LeuTrp: 0.857 ± 0.319
2.572LeuTyr: 2.572 ± 0.44
0.0LeuXaa: 0.0 ± 0.0
Met
4.096MetAla: 4.096 ± 0.91
0.19MetCys: 0.19 ± 0.114
2.286MetAsp: 2.286 ± 0.563
1.524MetGlu: 1.524 ± 0.368
0.381MetPhe: 0.381 ± 0.201
1.143MetGly: 1.143 ± 0.305
0.381MetHis: 0.381 ± 0.294
1.905MetIle: 1.905 ± 0.379
2.381MetLys: 2.381 ± 0.479
2.191MetLeu: 2.191 ± 0.482
0.19MetMet: 0.19 ± 0.112
1.429MetAsn: 1.429 ± 0.337
1.333MetPro: 1.333 ± 0.307
1.048MetGln: 1.048 ± 0.405
1.238MetArg: 1.238 ± 0.377
1.333MetSer: 1.333 ± 0.409
2.0MetThr: 2.0 ± 0.428
1.333MetVal: 1.333 ± 0.394
0.286MetTrp: 0.286 ± 0.137
0.286MetTyr: 0.286 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
5.143AsnAla: 5.143 ± 0.891
0.476AsnCys: 0.476 ± 0.257
2.0AsnAsp: 2.0 ± 0.409
2.572AsnGlu: 2.572 ± 0.458
1.714AsnPhe: 1.714 ± 0.408
5.62AsnGly: 5.62 ± 1.053
0.857AsnHis: 0.857 ± 0.344
1.619AsnIle: 1.619 ± 0.423
3.048AsnLys: 3.048 ± 0.655
3.334AsnLeu: 3.334 ± 0.621
0.857AsnMet: 0.857 ± 0.329
1.714AsnAsn: 1.714 ± 0.43
2.286AsnPro: 2.286 ± 0.413
1.333AsnGln: 1.333 ± 0.331
3.429AsnArg: 3.429 ± 0.493
2.857AsnSer: 2.857 ± 0.584
1.619AsnThr: 1.619 ± 0.366
1.619AsnVal: 1.619 ± 0.412
0.476AsnTrp: 0.476 ± 0.164
1.429AsnTyr: 1.429 ± 0.353
0.0AsnXaa: 0.0 ± 0.0
Pro
3.619ProAla: 3.619 ± 0.772
0.19ProCys: 0.19 ± 0.123
2.476ProAsp: 2.476 ± 0.56
2.953ProGlu: 2.953 ± 0.522
0.952ProPhe: 0.952 ± 0.288
2.095ProGly: 2.095 ± 0.396
0.095ProHis: 0.095 ± 0.106
2.476ProIle: 2.476 ± 0.386
1.619ProLys: 1.619 ± 0.448
2.667ProLeu: 2.667 ± 0.566
0.762ProMet: 0.762 ± 0.225
1.524ProAsn: 1.524 ± 0.396
0.857ProPro: 0.857 ± 0.283
0.762ProGln: 0.762 ± 0.287
1.238ProArg: 1.238 ± 0.283
2.667ProSer: 2.667 ± 0.552
2.667ProThr: 2.667 ± 0.5
1.81ProVal: 1.81 ± 0.384
0.571ProTrp: 0.571 ± 0.227
1.238ProTyr: 1.238 ± 0.335
0.0ProXaa: 0.0 ± 0.0
Gln
3.524GlnAla: 3.524 ± 0.63
0.476GlnCys: 0.476 ± 0.189
1.429GlnAsp: 1.429 ± 0.311
1.81GlnGlu: 1.81 ± 0.333
1.714GlnPhe: 1.714 ± 0.359
1.81GlnGly: 1.81 ± 0.471
0.857GlnHis: 0.857 ± 0.238
2.667GlnIle: 2.667 ± 0.405
2.857GlnLys: 2.857 ± 0.42
2.762GlnLeu: 2.762 ± 0.654
1.333GlnMet: 1.333 ± 0.348
1.048GlnAsn: 1.048 ± 0.362
1.238GlnPro: 1.238 ± 0.347
0.762GlnGln: 0.762 ± 0.245
2.476GlnArg: 2.476 ± 0.5
3.619GlnSer: 3.619 ± 0.538
1.905GlnThr: 1.905 ± 0.472
1.238GlnVal: 1.238 ± 0.268
1.048GlnTrp: 1.048 ± 0.286
0.857GlnTyr: 0.857 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
5.048ArgAla: 5.048 ± 0.712
0.381ArgCys: 0.381 ± 0.16
3.619ArgAsp: 3.619 ± 0.705
4.191ArgGlu: 4.191 ± 0.625
2.667ArgPhe: 2.667 ± 0.579
4.191ArgGly: 4.191 ± 0.63
1.143ArgHis: 1.143 ± 0.347
3.048ArgIle: 3.048 ± 0.532
4.381ArgLys: 4.381 ± 0.889
4.191ArgLeu: 4.191 ± 0.546
1.905ArgMet: 1.905 ± 0.44
2.857ArgAsn: 2.857 ± 0.655
1.524ArgPro: 1.524 ± 0.428
2.0ArgGln: 2.0 ± 0.39
4.191ArgArg: 4.191 ± 0.704
2.953ArgSer: 2.953 ± 0.542
2.762ArgThr: 2.762 ± 0.582
3.429ArgVal: 3.429 ± 0.568
0.762ArgTrp: 0.762 ± 0.265
1.81ArgTyr: 1.81 ± 0.419
0.0ArgXaa: 0.0 ± 0.0
Ser
7.715SerAla: 7.715 ± 1.102
0.762SerCys: 0.762 ± 0.359
5.048SerAsp: 5.048 ± 0.761
4.762SerGlu: 4.762 ± 0.722
2.667SerPhe: 2.667 ± 0.423
6.858SerGly: 6.858 ± 1.41
0.857SerHis: 0.857 ± 0.519
5.524SerIle: 5.524 ± 0.898
3.238SerLys: 3.238 ± 0.599
3.905SerLeu: 3.905 ± 0.762
2.0SerMet: 2.0 ± 0.498
2.572SerAsn: 2.572 ± 0.399
1.333SerPro: 1.333 ± 0.403
2.381SerGln: 2.381 ± 0.379
3.715SerArg: 3.715 ± 0.521
4.477SerSer: 4.477 ± 0.883
3.619SerThr: 3.619 ± 0.682
4.572SerVal: 4.572 ± 0.847
1.238SerTrp: 1.238 ± 0.305
0.762SerTyr: 0.762 ± 0.226
0.0SerXaa: 0.0 ± 0.0
Thr
7.429ThrAla: 7.429 ± 1.619
0.381ThrCys: 0.381 ± 0.181
3.524ThrAsp: 3.524 ± 0.569
3.524ThrGlu: 3.524 ± 0.512
2.857ThrPhe: 2.857 ± 0.691
5.334ThrGly: 5.334 ± 1.159
1.143ThrHis: 1.143 ± 0.306
3.619ThrIle: 3.619 ± 0.481
3.143ThrLys: 3.143 ± 0.525
4.667ThrLeu: 4.667 ± 0.529
1.048ThrMet: 1.048 ± 0.258
2.476ThrAsn: 2.476 ± 0.545
2.191ThrPro: 2.191 ± 0.394
1.619ThrGln: 1.619 ± 0.392
1.905ThrArg: 1.905 ± 0.472
2.857ThrSer: 2.857 ± 0.495
4.381ThrThr: 4.381 ± 1.543
3.619ThrVal: 3.619 ± 0.509
1.333ThrTrp: 1.333 ± 0.333
2.476ThrTyr: 2.476 ± 0.722
0.0ThrXaa: 0.0 ± 0.0
Val
4.572ValAla: 4.572 ± 0.594
1.143ValCys: 1.143 ± 0.419
2.857ValAsp: 2.857 ± 0.53
4.762ValGlu: 4.762 ± 0.566
1.619ValPhe: 1.619 ± 0.367
3.238ValGly: 3.238 ± 0.502
0.571ValHis: 0.571 ± 0.239
3.524ValIle: 3.524 ± 0.478
3.143ValLys: 3.143 ± 0.584
2.572ValLeu: 2.572 ± 0.486
0.857ValMet: 0.857 ± 0.28
3.238ValAsn: 3.238 ± 0.551
2.0ValPro: 2.0 ± 0.481
2.476ValGln: 2.476 ± 0.728
2.191ValArg: 2.191 ± 0.464
4.572ValSer: 4.572 ± 0.777
4.096ValThr: 4.096 ± 0.559
2.0ValVal: 2.0 ± 0.475
0.762ValTrp: 0.762 ± 0.23
1.429ValTyr: 1.429 ± 0.329
0.0ValXaa: 0.0 ± 0.0
Trp
0.857TrpAla: 0.857 ± 0.244
0.19TrpCys: 0.19 ± 0.128
1.143TrpAsp: 1.143 ± 0.42
1.143TrpGlu: 1.143 ± 0.27
0.762TrpPhe: 0.762 ± 0.268
0.762TrpGly: 0.762 ± 0.204
0.667TrpHis: 0.667 ± 0.219
1.048TrpIle: 1.048 ± 0.235
0.667TrpLys: 0.667 ± 0.344
1.143TrpLeu: 1.143 ± 0.346
0.667TrpMet: 0.667 ± 0.255
0.667TrpAsn: 0.667 ± 0.248
0.952TrpPro: 0.952 ± 0.292
0.952TrpGln: 0.952 ± 0.308
1.714TrpArg: 1.714 ± 0.403
1.524TrpSer: 1.524 ± 0.322
0.381TrpThr: 0.381 ± 0.195
0.857TrpVal: 0.857 ± 0.288
0.095TrpTrp: 0.095 ± 0.089
0.667TrpTyr: 0.667 ± 0.245
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.286TyrAla: 2.286 ± 0.442
0.381TyrCys: 0.381 ± 0.191
2.953TyrAsp: 2.953 ± 0.535
1.81TyrGlu: 1.81 ± 0.488
0.952TyrPhe: 0.952 ± 0.387
1.905TyrGly: 1.905 ± 0.49
0.571TyrHis: 0.571 ± 0.296
2.095TyrIle: 2.095 ± 0.409
1.619TyrLys: 1.619 ± 0.44
2.381TyrLeu: 2.381 ± 0.522
0.667TyrMet: 0.667 ± 0.21
2.095TyrAsn: 2.095 ± 0.55
1.333TyrPro: 1.333 ± 0.331
1.143TyrGln: 1.143 ± 0.305
1.333TyrArg: 1.333 ± 0.433
2.476TyrSer: 2.476 ± 0.34
1.619TyrThr: 1.619 ± 0.42
1.905TyrVal: 1.905 ± 0.507
1.048TyrTrp: 1.048 ± 0.437
1.143TyrTyr: 1.143 ± 0.487
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (10500 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski