Amino acid dipepetide frequency for Gordonia phage GMA1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.489AlaAla: 19.489 ± 2.255
1.146AlaCys: 1.146 ± 0.342
8.789AlaAsp: 8.789 ± 0.965
8.178AlaGlu: 8.178 ± 0.894
2.751AlaPhe: 2.751 ± 0.446
8.866AlaGly: 8.866 ± 0.821
2.599AlaHis: 2.599 ± 0.524
5.503AlaIle: 5.503 ± 0.574
4.28AlaLys: 4.28 ± 0.719
10.012AlaLeu: 10.012 ± 1.051
3.134AlaMet: 3.134 ± 0.459
2.981AlaAsn: 2.981 ± 0.568
5.579AlaPro: 5.579 ± 0.611
3.439AlaGln: 3.439 ± 0.584
9.707AlaArg: 9.707 ± 0.925
8.025AlaSer: 8.025 ± 0.84
7.566AlaThr: 7.566 ± 0.806
9.095AlaVal: 9.095 ± 0.953
2.675AlaTrp: 2.675 ± 0.574
1.911AlaTyr: 1.911 ± 0.304
0.0AlaXaa: 0.0 ± 0.0
Cys
1.299CysAla: 1.299 ± 0.323
0.153CysCys: 0.153 ± 0.115
0.611CysAsp: 0.611 ± 0.258
0.764CysGlu: 0.764 ± 0.301
0.153CysPhe: 0.153 ± 0.111
1.834CysGly: 1.834 ± 0.442
0.0CysHis: 0.0 ± 0.0
0.306CysIle: 0.306 ± 0.222
0.382CysLys: 0.382 ± 0.203
0.382CysLeu: 0.382 ± 0.157
0.076CysMet: 0.076 ± 0.069
0.229CysAsn: 0.229 ± 0.16
1.452CysPro: 1.452 ± 0.41
0.229CysGln: 0.229 ± 0.13
1.911CysArg: 1.911 ± 0.463
0.535CysSer: 0.535 ± 0.204
0.611CysThr: 0.611 ± 0.19
0.688CysVal: 0.688 ± 0.258
0.382CysTrp: 0.382 ± 0.169
0.229CysTyr: 0.229 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
9.172AspAla: 9.172 ± 0.893
0.611AspCys: 0.611 ± 0.244
4.586AspAsp: 4.586 ± 0.583
3.745AspGlu: 3.745 ± 0.6
1.834AspPhe: 1.834 ± 0.368
6.802AspGly: 6.802 ± 0.641
2.446AspHis: 2.446 ± 0.527
2.751AspIle: 2.751 ± 0.363
2.981AspLys: 2.981 ± 0.587
6.879AspLeu: 6.879 ± 0.669
1.452AspMet: 1.452 ± 0.309
1.529AspAsn: 1.529 ± 0.377
4.662AspPro: 4.662 ± 0.481
2.14AspGln: 2.14 ± 0.387
4.127AspArg: 4.127 ± 0.702
2.599AspSer: 2.599 ± 0.445
3.592AspThr: 3.592 ± 0.722
4.127AspVal: 4.127 ± 0.558
1.299AspTrp: 1.299 ± 0.296
1.452AspTyr: 1.452 ± 0.409
0.0AspXaa: 0.0 ± 0.0
Glu
7.719GluAla: 7.719 ± 0.99
1.681GluCys: 1.681 ± 0.381
2.446GluAsp: 2.446 ± 0.474
2.828GluGlu: 2.828 ± 0.483
1.529GluPhe: 1.529 ± 0.336
3.363GluGly: 3.363 ± 0.59
1.681GluHis: 1.681 ± 0.397
2.904GluIle: 2.904 ± 0.618
2.369GluLys: 2.369 ± 0.404
4.739GluLeu: 4.739 ± 0.712
1.299GluMet: 1.299 ± 0.431
1.681GluAsn: 1.681 ± 0.332
2.14GluPro: 2.14 ± 0.371
2.216GluGln: 2.216 ± 0.378
5.121GluArg: 5.121 ± 0.913
2.981GluSer: 2.981 ± 0.466
3.286GluThr: 3.286 ± 0.508
3.974GluVal: 3.974 ± 0.585
1.299GluTrp: 1.299 ± 0.255
1.07GluTyr: 1.07 ± 0.315
0.0GluXaa: 0.0 ± 0.0
Phe
2.369PheAla: 2.369 ± 0.39
0.153PheCys: 0.153 ± 0.088
2.064PheAsp: 2.064 ± 0.447
0.688PheGlu: 0.688 ± 0.231
0.459PhePhe: 0.459 ± 0.196
2.751PheGly: 2.751 ± 0.446
0.688PheHis: 0.688 ± 0.23
0.688PheIle: 0.688 ± 0.232
0.382PheLys: 0.382 ± 0.149
1.223PheLeu: 1.223 ± 0.338
0.306PheMet: 0.306 ± 0.161
0.611PheAsn: 0.611 ± 0.23
0.688PhePro: 0.688 ± 0.198
1.376PheGln: 1.376 ± 0.265
1.911PheArg: 1.911 ± 0.384
1.834PheSer: 1.834 ± 0.429
1.146PheThr: 1.146 ± 0.281
1.834PheVal: 1.834 ± 0.497
0.306PheTrp: 0.306 ± 0.15
0.688PheTyr: 0.688 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
7.414GlyAla: 7.414 ± 1.012
0.535GlyCys: 0.535 ± 0.208
5.426GlyAsp: 5.426 ± 0.692
5.732GlyGlu: 5.732 ± 0.56
2.369GlyPhe: 2.369 ± 0.375
8.254GlyGly: 8.254 ± 1.125
2.522GlyHis: 2.522 ± 0.438
3.439GlyIle: 3.439 ± 0.399
2.828GlyLys: 2.828 ± 0.564
6.496GlyLeu: 6.496 ± 0.696
2.981GlyMet: 2.981 ± 0.348
2.446GlyAsn: 2.446 ± 0.426
3.669GlyPro: 3.669 ± 0.5
2.675GlyGln: 2.675 ± 0.407
6.726GlyArg: 6.726 ± 0.661
5.503GlySer: 5.503 ± 0.645
6.726GlyThr: 6.726 ± 0.774
6.038GlyVal: 6.038 ± 0.681
1.605GlyTrp: 1.605 ± 0.315
2.522GlyTyr: 2.522 ± 0.528
0.0GlyXaa: 0.0 ± 0.0
His
1.911HisAla: 1.911 ± 0.449
0.611HisCys: 0.611 ± 0.211
2.522HisAsp: 2.522 ± 0.418
1.911HisGlu: 1.911 ± 0.392
0.688HisPhe: 0.688 ± 0.214
2.293HisGly: 2.293 ± 0.458
0.917HisHis: 0.917 ± 0.364
1.376HisIle: 1.376 ± 0.297
0.611HisLys: 0.611 ± 0.21
2.216HisLeu: 2.216 ± 0.373
0.535HisMet: 0.535 ± 0.188
0.306HisAsn: 0.306 ± 0.14
1.758HisPro: 1.758 ± 0.393
0.764HisGln: 0.764 ± 0.237
2.14HisArg: 2.14 ± 0.522
1.07HisSer: 1.07 ± 0.343
2.14HisThr: 2.14 ± 0.449
1.605HisVal: 1.605 ± 0.337
0.688HisTrp: 0.688 ± 0.214
0.764HisTyr: 0.764 ± 0.287
0.0HisXaa: 0.0 ± 0.0
Ile
4.891IleAla: 4.891 ± 0.583
0.382IleCys: 0.382 ± 0.168
4.586IleAsp: 4.586 ± 0.582
3.745IleGlu: 3.745 ± 0.508
0.611IlePhe: 0.611 ± 0.228
4.356IleGly: 4.356 ± 0.641
1.299IleHis: 1.299 ± 0.249
2.064IleIle: 2.064 ± 0.417
1.223IleLys: 1.223 ± 0.478
2.751IleLeu: 2.751 ± 0.382
0.306IleMet: 0.306 ± 0.139
1.758IleAsn: 1.758 ± 0.355
2.293IlePro: 2.293 ± 0.415
1.452IleGln: 1.452 ± 0.316
3.974IleArg: 3.974 ± 0.492
2.293IleSer: 2.293 ± 0.299
3.134IleThr: 3.134 ± 0.616
3.286IleVal: 3.286 ± 0.433
0.764IleTrp: 0.764 ± 0.248
0.917IleTyr: 0.917 ± 0.259
0.0IleXaa: 0.0 ± 0.0
Lys
4.662LysAla: 4.662 ± 0.766
0.306LysCys: 0.306 ± 0.146
1.07LysAsp: 1.07 ± 0.23
0.994LysGlu: 0.994 ± 0.269
0.841LysPhe: 0.841 ± 0.282
2.828LysGly: 2.828 ± 0.475
0.688LysHis: 0.688 ± 0.261
1.07LysIle: 1.07 ± 0.427
2.064LysLys: 2.064 ± 0.402
3.21LysLeu: 3.21 ± 0.546
0.611LysMet: 0.611 ± 0.184
0.764LysAsn: 0.764 ± 0.237
2.369LysPro: 2.369 ± 0.508
0.917LysGln: 0.917 ± 0.244
2.446LysArg: 2.446 ± 0.38
1.681LysSer: 1.681 ± 0.388
2.904LysThr: 2.904 ± 0.481
2.751LysVal: 2.751 ± 0.335
0.688LysTrp: 0.688 ± 0.2
0.841LysTyr: 0.841 ± 0.31
0.0LysXaa: 0.0 ± 0.0
Leu
11.159LeuAla: 11.159 ± 1.053
0.841LeuCys: 0.841 ± 0.231
5.503LeuAsp: 5.503 ± 0.515
3.439LeuGlu: 3.439 ± 0.465
1.529LeuPhe: 1.529 ± 0.276
7.031LeuGly: 7.031 ± 0.864
3.286LeuHis: 3.286 ± 0.521
4.28LeuIle: 4.28 ± 0.617
1.758LeuLys: 1.758 ± 0.355
5.426LeuLeu: 5.426 ± 0.759
1.452LeuMet: 1.452 ± 0.278
1.987LeuAsn: 1.987 ± 0.302
2.904LeuPro: 2.904 ± 0.371
2.14LeuGln: 2.14 ± 0.397
4.662LeuArg: 4.662 ± 0.547
3.974LeuSer: 3.974 ± 0.457
6.114LeuThr: 6.114 ± 0.501
3.974LeuVal: 3.974 ± 0.554
1.452LeuTrp: 1.452 ± 0.398
1.681LeuTyr: 1.681 ± 0.431
0.0LeuXaa: 0.0 ± 0.0
Met
2.904MetAla: 2.904 ± 0.453
0.153MetCys: 0.153 ± 0.112
0.994MetAsp: 0.994 ± 0.266
0.841MetGlu: 0.841 ± 0.295
0.229MetPhe: 0.229 ± 0.134
1.376MetGly: 1.376 ± 0.352
0.459MetHis: 0.459 ± 0.184
1.07MetIle: 1.07 ± 0.224
0.459MetLys: 0.459 ± 0.17
1.758MetLeu: 1.758 ± 0.309
0.076MetMet: 0.076 ± 0.055
0.841MetAsn: 0.841 ± 0.206
1.07MetPro: 1.07 ± 0.287
1.146MetGln: 1.146 ± 0.283
1.605MetArg: 1.605 ± 0.316
1.758MetSer: 1.758 ± 0.341
3.439MetThr: 3.439 ± 0.519
1.146MetVal: 1.146 ± 0.38
0.459MetTrp: 0.459 ± 0.191
0.229MetTyr: 0.229 ± 0.123
0.0MetXaa: 0.0 ± 0.0
Asn
3.669AsnAla: 3.669 ± 0.569
0.153AsnCys: 0.153 ± 0.116
1.146AsnAsp: 1.146 ± 0.261
0.917AsnGlu: 0.917 ± 0.261
0.764AsnPhe: 0.764 ± 0.314
2.904AsnGly: 2.904 ± 0.565
0.764AsnHis: 0.764 ± 0.218
0.306AsnIle: 0.306 ± 0.229
0.764AsnLys: 0.764 ± 0.265
2.522AsnLeu: 2.522 ± 0.435
0.382AsnMet: 0.382 ± 0.176
0.841AsnAsn: 0.841 ± 0.283
2.064AsnPro: 2.064 ± 0.394
1.223AsnGln: 1.223 ± 0.233
1.681AsnArg: 1.681 ± 0.362
1.605AsnSer: 1.605 ± 0.283
1.605AsnThr: 1.605 ± 0.397
1.911AsnVal: 1.911 ± 0.41
0.688AsnTrp: 0.688 ± 0.173
0.382AsnTyr: 0.382 ± 0.151
0.0AsnXaa: 0.0 ± 0.0
Pro
6.114ProAla: 6.114 ± 0.555
0.841ProCys: 0.841 ± 0.251
5.35ProAsp: 5.35 ± 0.604
3.286ProGlu: 3.286 ± 0.588
0.764ProPhe: 0.764 ± 0.231
5.503ProGly: 5.503 ± 0.797
0.917ProHis: 0.917 ± 0.278
2.14ProIle: 2.14 ± 0.349
2.216ProLys: 2.216 ± 0.473
2.369ProLeu: 2.369 ± 0.432
1.529ProMet: 1.529 ± 0.381
1.07ProAsn: 1.07 ± 0.287
2.14ProPro: 2.14 ± 0.548
1.605ProGln: 1.605 ± 0.333
2.904ProArg: 2.904 ± 0.5
2.828ProSer: 2.828 ± 0.534
4.739ProThr: 4.739 ± 0.757
4.739ProVal: 4.739 ± 0.688
0.841ProTrp: 0.841 ± 0.217
0.611ProTyr: 0.611 ± 0.237
0.0ProXaa: 0.0 ± 0.0
Gln
4.204GlnAla: 4.204 ± 0.692
0.306GlnCys: 0.306 ± 0.173
2.064GlnAsp: 2.064 ± 0.369
2.14GlnGlu: 2.14 ± 0.383
0.382GlnPhe: 0.382 ± 0.161
2.064GlnGly: 2.064 ± 0.437
0.764GlnHis: 0.764 ± 0.255
2.216GlnIle: 2.216 ± 0.351
1.299GlnLys: 1.299 ± 0.263
2.904GlnLeu: 2.904 ± 0.473
1.146GlnMet: 1.146 ± 0.307
0.917GlnAsn: 0.917 ± 0.26
1.834GlnPro: 1.834 ± 0.451
1.987GlnGln: 1.987 ± 0.361
2.14GlnArg: 2.14 ± 0.411
1.376GlnSer: 1.376 ± 0.379
2.446GlnThr: 2.446 ± 0.424
1.605GlnVal: 1.605 ± 0.34
0.611GlnTrp: 0.611 ± 0.25
0.764GlnTyr: 0.764 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
8.56ArgAla: 8.56 ± 0.867
1.376ArgCys: 1.376 ± 0.439
4.739ArgAsp: 4.739 ± 0.622
3.592ArgGlu: 3.592 ± 0.728
1.376ArgPhe: 1.376 ± 0.268
4.28ArgGly: 4.28 ± 0.59
2.446ArgHis: 2.446 ± 0.445
3.669ArgIle: 3.669 ± 0.606
3.134ArgLys: 3.134 ± 0.47
5.961ArgLeu: 5.961 ± 0.691
2.293ArgMet: 2.293 ± 0.564
1.529ArgAsn: 1.529 ± 0.264
3.439ArgPro: 3.439 ± 0.492
2.522ArgGln: 2.522 ± 0.574
6.42ArgArg: 6.42 ± 0.916
3.821ArgSer: 3.821 ± 0.508
4.127ArgThr: 4.127 ± 0.56
6.114ArgVal: 6.114 ± 0.67
1.376ArgTrp: 1.376 ± 0.283
1.758ArgTyr: 1.758 ± 0.378
0.0ArgXaa: 0.0 ± 0.0
Ser
6.802SerAla: 6.802 ± 0.782
0.535SerCys: 0.535 ± 0.231
4.356SerAsp: 4.356 ± 0.603
3.134SerGlu: 3.134 ± 0.545
1.07SerPhe: 1.07 ± 0.339
6.802SerGly: 6.802 ± 0.955
0.611SerHis: 0.611 ± 0.216
2.522SerIle: 2.522 ± 0.343
1.758SerLys: 1.758 ± 0.413
4.051SerLeu: 4.051 ± 0.613
1.681SerMet: 1.681 ± 0.327
1.681SerAsn: 1.681 ± 0.339
2.599SerPro: 2.599 ± 0.429
1.758SerGln: 1.758 ± 0.256
3.134SerArg: 3.134 ± 0.601
3.286SerSer: 3.286 ± 0.602
4.509SerThr: 4.509 ± 0.682
3.745SerVal: 3.745 ± 0.438
1.299SerTrp: 1.299 ± 0.304
0.994SerTyr: 0.994 ± 0.288
0.0SerXaa: 0.0 ± 0.0
Thr
9.019ThrAla: 9.019 ± 0.814
0.764ThrCys: 0.764 ± 0.274
4.891ThrAsp: 4.891 ± 0.646
2.981ThrGlu: 2.981 ± 0.492
1.758ThrPhe: 1.758 ± 0.427
6.191ThrGly: 6.191 ± 0.859
1.911ThrHis: 1.911 ± 0.507
3.974ThrIle: 3.974 ± 0.517
1.834ThrLys: 1.834 ± 0.41
4.586ThrLeu: 4.586 ± 0.546
0.994ThrMet: 0.994 ± 0.472
2.828ThrAsn: 2.828 ± 0.458
5.732ThrPro: 5.732 ± 0.687
1.911ThrGln: 1.911 ± 0.35
3.592ThrArg: 3.592 ± 0.516
4.28ThrSer: 4.28 ± 0.655
6.42ThrThr: 6.42 ± 0.918
5.961ThrVal: 5.961 ± 0.706
1.758ThrTrp: 1.758 ± 0.366
1.376ThrTyr: 1.376 ± 0.322
0.0ThrXaa: 0.0 ± 0.0
Val
8.407ValAla: 8.407 ± 0.812
0.917ValCys: 0.917 ± 0.287
5.426ValAsp: 5.426 ± 0.604
4.815ValGlu: 4.815 ± 0.584
2.064ValPhe: 2.064 ± 0.333
5.579ValGly: 5.579 ± 0.749
1.758ValHis: 1.758 ± 0.421
4.051ValIle: 4.051 ± 0.574
1.911ValLys: 1.911 ± 0.435
4.509ValLeu: 4.509 ± 0.625
1.07ValMet: 1.07 ± 0.249
1.529ValAsn: 1.529 ± 0.408
3.974ValPro: 3.974 ± 0.675
2.14ValGln: 2.14 ± 0.362
4.815ValArg: 4.815 ± 0.598
4.051ValSer: 4.051 ± 0.566
5.579ValThr: 5.579 ± 0.741
5.044ValVal: 5.044 ± 0.668
1.681ValTrp: 1.681 ± 0.345
1.452ValTyr: 1.452 ± 0.398
0.0ValXaa: 0.0 ± 0.0
Trp
2.981TrpAla: 2.981 ± 0.514
0.535TrpCys: 0.535 ± 0.196
0.994TrpAsp: 0.994 ± 0.244
1.529TrpGlu: 1.529 ± 0.355
0.229TrpPhe: 0.229 ± 0.12
0.917TrpGly: 0.917 ± 0.265
0.611TrpHis: 0.611 ± 0.236
0.917TrpIle: 0.917 ± 0.216
0.459TrpLys: 0.459 ± 0.222
1.605TrpLeu: 1.605 ± 0.354
0.535TrpMet: 0.535 ± 0.186
0.229TrpAsn: 0.229 ± 0.113
1.376TrpPro: 1.376 ± 0.278
0.688TrpGln: 0.688 ± 0.269
2.14TrpArg: 2.14 ± 0.463
1.681TrpSer: 1.681 ± 0.326
1.299TrpThr: 1.299 ± 0.374
1.681TrpVal: 1.681 ± 0.453
0.459TrpTrp: 0.459 ± 0.175
0.153TrpTyr: 0.153 ± 0.112
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.981TyrAla: 2.981 ± 0.508
0.229TyrCys: 0.229 ± 0.18
1.223TyrAsp: 1.223 ± 0.25
1.146TyrGlu: 1.146 ± 0.267
0.764TyrPhe: 0.764 ± 0.249
1.911TyrGly: 1.911 ± 0.426
0.459TyrHis: 0.459 ± 0.239
0.841TyrIle: 0.841 ± 0.238
0.994TyrLys: 0.994 ± 0.261
0.994TyrLeu: 0.994 ± 0.306
0.076TyrMet: 0.076 ± 0.081
0.535TyrAsn: 0.535 ± 0.189
0.764TyrPro: 0.764 ± 0.316
0.841TyrGln: 0.841 ± 0.278
1.376TyrArg: 1.376 ± 0.335
1.146TyrSer: 1.146 ± 0.24
1.376TyrThr: 1.376 ± 0.312
1.452TyrVal: 1.452 ± 0.263
0.688TyrTrp: 0.688 ± 0.232
0.459TyrTyr: 0.459 ± 0.21
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13085 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski