Amino acid dipepetide frequency for Moraxella phage Mcat19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.867AlaAla: 4.867 ± 1.356
1.025AlaCys: 1.025 ± 0.429
4.995AlaAsp: 4.995 ± 0.564
6.148AlaGlu: 6.148 ± 1.009
2.818AlaPhe: 2.818 ± 0.615
5.635AlaGly: 5.635 ± 0.86
1.793AlaHis: 1.793 ± 0.435
4.611AlaIle: 4.611 ± 0.489
7.556AlaLys: 7.556 ± 1.129
10.246AlaLeu: 10.246 ± 1.603
1.921AlaMet: 1.921 ± 0.669
3.586AlaAsn: 3.586 ± 0.666
1.537AlaPro: 1.537 ± 0.423
4.739AlaGln: 4.739 ± 0.771
4.355AlaArg: 4.355 ± 0.539
6.276AlaSer: 6.276 ± 0.896
6.019AlaThr: 6.019 ± 1.362
5.123AlaVal: 5.123 ± 0.771
0.897AlaTrp: 0.897 ± 0.441
2.69AlaTyr: 2.69 ± 0.57
0.0AlaXaa: 0.0 ± 0.0
Cys
0.512CysAla: 0.512 ± 0.287
0.256CysCys: 0.256 ± 0.167
0.897CysAsp: 0.897 ± 0.392
0.256CysGlu: 0.256 ± 0.217
0.384CysPhe: 0.384 ± 0.199
0.512CysGly: 0.512 ± 0.417
0.384CysHis: 0.384 ± 0.229
0.512CysIle: 0.512 ± 0.321
0.897CysLys: 0.897 ± 0.485
0.512CysLeu: 0.512 ± 0.26
0.256CysMet: 0.256 ± 0.22
0.768CysAsn: 0.768 ± 0.326
0.256CysPro: 0.256 ± 0.184
0.64CysGln: 0.64 ± 0.308
1.025CysArg: 1.025 ± 0.328
1.153CysSer: 1.153 ± 0.414
0.64CysThr: 0.64 ± 0.378
0.64CysVal: 0.64 ± 0.277
0.0CysTrp: 0.0 ± 0.0
0.384CysTyr: 0.384 ± 0.284
0.0CysXaa: 0.0 ± 0.0
Asp
5.123AspAla: 5.123 ± 0.789
0.768AspCys: 0.768 ± 0.376
4.995AspAsp: 4.995 ± 1.182
4.867AspGlu: 4.867 ± 0.743
2.69AspPhe: 2.69 ± 0.597
6.66AspGly: 6.66 ± 1.112
0.768AspHis: 0.768 ± 0.375
3.97AspIle: 3.97 ± 0.725
5.123AspLys: 5.123 ± 0.588
4.483AspLeu: 4.483 ± 0.704
1.409AspMet: 1.409 ± 0.377
4.483AspAsn: 4.483 ± 0.955
1.665AspPro: 1.665 ± 0.484
1.153AspGln: 1.153 ± 0.499
2.177AspArg: 2.177 ± 0.525
2.69AspSer: 2.69 ± 0.57
2.433AspThr: 2.433 ± 0.53
3.586AspVal: 3.586 ± 0.605
1.281AspTrp: 1.281 ± 0.502
2.305AspTyr: 2.305 ± 0.672
0.0AspXaa: 0.0 ± 0.0
Glu
4.355GluAla: 4.355 ± 0.727
0.64GluCys: 0.64 ± 0.296
3.458GluAsp: 3.458 ± 0.74
2.177GluGlu: 2.177 ± 0.8
3.586GluPhe: 3.586 ± 0.575
2.177GluGly: 2.177 ± 0.527
1.665GluHis: 1.665 ± 0.322
3.97GluIle: 3.97 ± 0.8
4.867GluLys: 4.867 ± 0.852
7.3GluLeu: 7.3 ± 1.045
1.793GluMet: 1.793 ± 0.545
3.714GluAsn: 3.714 ± 0.612
2.433GluPro: 2.433 ± 0.814
3.202GluGln: 3.202 ± 0.57
4.483GluArg: 4.483 ± 0.615
1.921GluSer: 1.921 ± 0.489
2.305GluThr: 2.305 ± 0.455
3.33GluVal: 3.33 ± 0.693
0.768GluTrp: 0.768 ± 0.292
3.202GluTyr: 3.202 ± 0.599
0.0GluXaa: 0.0 ± 0.0
Phe
3.074PheAla: 3.074 ± 0.956
0.512PheCys: 0.512 ± 0.261
3.458PheAsp: 3.458 ± 0.488
3.202PheGlu: 3.202 ± 0.781
0.768PhePhe: 0.768 ± 0.282
3.842PheGly: 3.842 ± 0.699
1.025PheHis: 1.025 ± 0.387
2.561PheIle: 2.561 ± 0.548
1.665PheLys: 1.665 ± 0.411
2.433PheLeu: 2.433 ± 0.586
0.897PheMet: 0.897 ± 0.461
1.537PheAsn: 1.537 ± 0.453
0.768PhePro: 0.768 ± 0.303
0.512PheGln: 0.512 ± 0.324
1.025PheArg: 1.025 ± 0.377
1.793PheSer: 1.793 ± 0.444
2.946PheThr: 2.946 ± 0.488
2.561PheVal: 2.561 ± 0.528
0.512PheTrp: 0.512 ± 0.278
1.025PheTyr: 1.025 ± 0.435
0.0PheXaa: 0.0 ± 0.0
Gly
4.483GlyAla: 4.483 ± 0.967
0.64GlyCys: 0.64 ± 0.306
3.97GlyAsp: 3.97 ± 0.711
4.867GlyGlu: 4.867 ± 0.737
3.842GlyPhe: 3.842 ± 0.609
6.916GlyGly: 6.916 ± 1.449
0.897GlyHis: 0.897 ± 0.351
3.714GlyIle: 3.714 ± 0.671
6.916GlyLys: 6.916 ± 0.957
7.044GlyLeu: 7.044 ± 1.013
2.049GlyMet: 2.049 ± 0.596
3.074GlyAsn: 3.074 ± 0.613
0.0GlyPro: 0.0 ± 0.0
2.433GlyGln: 2.433 ± 0.405
3.586GlyArg: 3.586 ± 0.566
3.586GlySer: 3.586 ± 0.409
2.946GlyThr: 2.946 ± 0.89
4.611GlyVal: 4.611 ± 0.835
1.153GlyTrp: 1.153 ± 0.399
1.921GlyTyr: 1.921 ± 0.76
0.0GlyXaa: 0.0 ± 0.0
His
1.793HisAla: 1.793 ± 0.628
0.384HisCys: 0.384 ± 0.212
1.665HisAsp: 1.665 ± 0.418
1.025HisGlu: 1.025 ± 0.371
0.768HisPhe: 0.768 ± 0.278
2.305HisGly: 2.305 ± 0.67
0.897HisHis: 0.897 ± 0.447
1.025HisIle: 1.025 ± 0.383
1.281HisLys: 1.281 ± 0.415
2.049HisLeu: 2.049 ± 0.47
0.128HisMet: 0.128 ± 0.133
0.768HisAsn: 0.768 ± 0.311
1.281HisPro: 1.281 ± 0.477
0.64HisGln: 0.64 ± 0.231
0.768HisArg: 0.768 ± 0.446
0.512HisSer: 0.512 ± 0.243
1.665HisThr: 1.665 ± 0.385
0.512HisVal: 0.512 ± 0.233
0.256HisTrp: 0.256 ± 0.197
1.153HisTyr: 1.153 ± 0.334
0.0HisXaa: 0.0 ± 0.0
Ile
5.763IleAla: 5.763 ± 0.814
0.897IleCys: 0.897 ± 0.385
5.123IleAsp: 5.123 ± 0.588
3.458IleGlu: 3.458 ± 0.644
0.897IlePhe: 0.897 ± 0.388
4.739IleGly: 4.739 ± 0.67
1.025IleHis: 1.025 ± 0.403
5.251IleIle: 5.251 ± 0.71
4.611IleLys: 4.611 ± 0.844
4.355IleLeu: 4.355 ± 0.616
1.921IleMet: 1.921 ± 0.49
3.074IleAsn: 3.074 ± 0.439
1.409IlePro: 1.409 ± 0.498
2.561IleGln: 2.561 ± 0.564
1.409IleArg: 1.409 ± 0.366
3.97IleSer: 3.97 ± 0.884
5.379IleThr: 5.379 ± 0.92
3.202IleVal: 3.202 ± 0.629
0.256IleTrp: 0.256 ± 0.156
1.793IleTyr: 1.793 ± 0.426
0.0IleXaa: 0.0 ± 0.0
Lys
9.734LysAla: 9.734 ± 1.598
0.64LysCys: 0.64 ± 0.272
3.458LysAsp: 3.458 ± 0.591
2.818LysGlu: 2.818 ± 0.598
2.818LysPhe: 2.818 ± 0.706
3.714LysGly: 3.714 ± 0.63
1.153LysHis: 1.153 ± 0.406
4.483LysIle: 4.483 ± 0.855
5.379LysLys: 5.379 ± 0.999
7.428LysLeu: 7.428 ± 0.848
2.433LysMet: 2.433 ± 0.694
3.202LysAsn: 3.202 ± 0.628
2.433LysPro: 2.433 ± 0.476
3.074LysGln: 3.074 ± 0.821
4.226LysArg: 4.226 ± 0.891
6.404LysSer: 6.404 ± 1.087
4.739LysThr: 4.739 ± 0.986
3.842LysVal: 3.842 ± 0.834
0.768LysTrp: 0.768 ± 0.303
2.049LysTyr: 2.049 ± 0.517
0.0LysXaa: 0.0 ± 0.0
Leu
8.581LeuAla: 8.581 ± 1.134
0.768LeuCys: 0.768 ± 0.481
7.428LeuAsp: 7.428 ± 0.673
5.507LeuGlu: 5.507 ± 0.97
2.69LeuPhe: 2.69 ± 0.482
4.867LeuGly: 4.867 ± 0.768
1.793LeuHis: 1.793 ± 0.473
5.379LeuIle: 5.379 ± 0.795
8.837LeuLys: 8.837 ± 1.076
6.148LeuLeu: 6.148 ± 1.048
1.921LeuMet: 1.921 ± 0.519
4.739LeuAsn: 4.739 ± 0.931
3.33LeuPro: 3.33 ± 0.732
3.586LeuGln: 3.586 ± 0.728
4.611LeuArg: 4.611 ± 0.609
6.276LeuSer: 6.276 ± 0.698
4.995LeuThr: 4.995 ± 1.068
4.739LeuVal: 4.739 ± 0.676
1.153LeuTrp: 1.153 ± 0.342
3.33LeuTyr: 3.33 ± 0.523
0.0LeuXaa: 0.0 ± 0.0
Met
2.049MetAla: 2.049 ± 0.565
0.128MetCys: 0.128 ± 0.153
1.409MetAsp: 1.409 ± 0.514
1.025MetGlu: 1.025 ± 0.418
1.281MetPhe: 1.281 ± 0.4
1.921MetGly: 1.921 ± 0.475
0.512MetHis: 0.512 ± 0.245
1.537MetIle: 1.537 ± 0.397
1.793MetLys: 1.793 ± 0.533
2.433MetLeu: 2.433 ± 0.521
0.768MetMet: 0.768 ± 0.384
1.025MetAsn: 1.025 ± 0.503
1.025MetPro: 1.025 ± 0.278
1.537MetGln: 1.537 ± 0.41
0.768MetArg: 0.768 ± 0.411
2.946MetSer: 2.946 ± 0.447
1.281MetThr: 1.281 ± 0.388
2.561MetVal: 2.561 ± 0.779
0.384MetTrp: 0.384 ± 0.195
0.64MetTyr: 0.64 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
3.842AsnAla: 3.842 ± 0.785
0.256AsnCys: 0.256 ± 0.212
1.921AsnAsp: 1.921 ± 0.435
3.586AsnGlu: 3.586 ± 0.606
1.793AsnPhe: 1.793 ± 0.388
3.202AsnGly: 3.202 ± 0.485
1.025AsnHis: 1.025 ± 0.362
2.818AsnIle: 2.818 ± 0.553
2.69AsnLys: 2.69 ± 0.647
3.97AsnLeu: 3.97 ± 1.075
1.793AsnMet: 1.793 ± 0.574
3.074AsnAsn: 3.074 ± 0.579
3.074AsnPro: 3.074 ± 0.745
1.665AsnGln: 1.665 ± 0.596
2.433AsnArg: 2.433 ± 0.598
3.458AsnSer: 3.458 ± 0.765
2.433AsnThr: 2.433 ± 0.636
2.561AsnVal: 2.561 ± 0.643
0.384AsnTrp: 0.384 ± 0.189
2.561AsnTyr: 2.561 ± 0.487
0.0AsnXaa: 0.0 ± 0.0
Pro
1.665ProAla: 1.665 ± 0.585
0.768ProCys: 0.768 ± 0.341
1.409ProAsp: 1.409 ± 0.423
1.793ProGlu: 1.793 ± 0.393
1.409ProPhe: 1.409 ± 0.515
0.768ProGly: 0.768 ± 0.268
0.64ProHis: 0.64 ± 0.305
2.305ProIle: 2.305 ± 0.533
2.946ProLys: 2.946 ± 0.674
1.921ProLeu: 1.921 ± 0.518
1.025ProMet: 1.025 ± 0.328
2.69ProAsn: 2.69 ± 0.824
1.409ProPro: 1.409 ± 0.448
1.281ProGln: 1.281 ± 0.447
1.153ProArg: 1.153 ± 0.425
2.946ProSer: 2.946 ± 0.698
1.409ProThr: 1.409 ± 0.441
2.049ProVal: 2.049 ± 0.487
0.384ProTrp: 0.384 ± 0.235
0.64ProTyr: 0.64 ± 0.315
0.0ProXaa: 0.0 ± 0.0
Gln
5.123GlnAla: 5.123 ± 0.781
0.256GlnCys: 0.256 ± 0.189
2.049GlnAsp: 2.049 ± 0.559
2.561GlnGlu: 2.561 ± 0.517
0.897GlnPhe: 0.897 ± 0.302
2.305GlnGly: 2.305 ± 0.696
0.64GlnHis: 0.64 ± 0.363
3.202GlnIle: 3.202 ± 0.798
3.714GlnLys: 3.714 ± 0.698
4.226GlnLeu: 4.226 ± 0.762
1.537GlnMet: 1.537 ± 0.464
1.537GlnAsn: 1.537 ± 0.423
1.281GlnPro: 1.281 ± 0.35
2.561GlnGln: 2.561 ± 0.788
2.049GlnArg: 2.049 ± 0.535
2.433GlnSer: 2.433 ± 0.635
2.946GlnThr: 2.946 ± 0.649
2.561GlnVal: 2.561 ± 0.65
0.512GlnTrp: 0.512 ± 0.31
1.665GlnTyr: 1.665 ± 0.479
0.0GlnXaa: 0.0 ± 0.0
Arg
4.867ArgAla: 4.867 ± 0.744
0.384ArgCys: 0.384 ± 0.279
2.946ArgAsp: 2.946 ± 0.606
3.202ArgGlu: 3.202 ± 0.506
2.177ArgPhe: 2.177 ± 0.505
3.33ArgGly: 3.33 ± 0.631
1.665ArgHis: 1.665 ± 0.556
2.561ArgIle: 2.561 ± 0.639
1.921ArgLys: 1.921 ± 0.399
5.763ArgLeu: 5.763 ± 1.048
0.768ArgMet: 0.768 ± 0.284
1.793ArgAsn: 1.793 ± 0.498
1.281ArgPro: 1.281 ± 0.333
1.793ArgGln: 1.793 ± 0.523
2.561ArgArg: 2.561 ± 0.522
2.433ArgSer: 2.433 ± 0.478
2.561ArgThr: 2.561 ± 0.472
2.946ArgVal: 2.946 ± 0.466
0.897ArgTrp: 0.897 ± 0.348
3.458ArgTyr: 3.458 ± 0.634
0.0ArgXaa: 0.0 ± 0.0
Ser
5.251SerAla: 5.251 ± 0.924
0.512SerCys: 0.512 ± 0.255
2.69SerAsp: 2.69 ± 0.482
4.995SerGlu: 4.995 ± 0.697
1.921SerPhe: 1.921 ± 0.502
2.818SerGly: 2.818 ± 0.576
2.177SerHis: 2.177 ± 0.35
4.226SerIle: 4.226 ± 0.704
3.714SerLys: 3.714 ± 0.723
5.891SerLeu: 5.891 ± 0.771
1.281SerMet: 1.281 ± 0.372
3.97SerAsn: 3.97 ± 0.924
1.409SerPro: 1.409 ± 0.544
3.714SerGln: 3.714 ± 0.815
3.842SerArg: 3.842 ± 0.67
2.305SerSer: 2.305 ± 0.592
2.433SerThr: 2.433 ± 0.584
5.507SerVal: 5.507 ± 0.808
0.384SerTrp: 0.384 ± 0.188
1.537SerTyr: 1.537 ± 0.33
0.0SerXaa: 0.0 ± 0.0
Thr
6.916ThrAla: 6.916 ± 1.266
0.128ThrCys: 0.128 ± 0.144
3.714ThrAsp: 3.714 ± 0.687
3.97ThrGlu: 3.97 ± 0.771
1.409ThrPhe: 1.409 ± 0.326
4.098ThrGly: 4.098 ± 0.778
1.025ThrHis: 1.025 ± 0.371
2.69ThrIle: 2.69 ± 0.57
3.97ThrLys: 3.97 ± 0.872
4.226ThrLeu: 4.226 ± 0.629
1.921ThrMet: 1.921 ± 0.428
2.049ThrAsn: 2.049 ± 0.431
2.177ThrPro: 2.177 ± 0.58
2.818ThrGln: 2.818 ± 0.512
1.537ThrArg: 1.537 ± 0.459
2.305ThrSer: 2.305 ± 0.599
3.842ThrThr: 3.842 ± 0.942
4.611ThrVal: 4.611 ± 0.719
0.897ThrTrp: 0.897 ± 0.303
1.537ThrTyr: 1.537 ± 0.403
0.0ThrXaa: 0.0 ± 0.0
Val
5.123ValAla: 5.123 ± 1.047
1.153ValCys: 1.153 ± 0.482
3.458ValAsp: 3.458 ± 0.859
3.074ValGlu: 3.074 ± 0.501
2.305ValPhe: 2.305 ± 0.558
4.483ValGly: 4.483 ± 0.655
0.897ValHis: 0.897 ± 0.33
3.97ValIle: 3.97 ± 0.718
4.226ValLys: 4.226 ± 0.957
6.019ValLeu: 6.019 ± 0.826
1.921ValMet: 1.921 ± 0.434
2.305ValAsn: 2.305 ± 0.548
1.921ValPro: 1.921 ± 0.417
2.946ValGln: 2.946 ± 0.633
4.098ValArg: 4.098 ± 0.823
3.97ValSer: 3.97 ± 0.711
2.818ValThr: 2.818 ± 0.474
3.458ValVal: 3.458 ± 0.784
1.025ValTrp: 1.025 ± 0.426
1.921ValTyr: 1.921 ± 0.585
0.0ValXaa: 0.0 ± 0.0
Trp
1.281TrpAla: 1.281 ± 0.448
0.256TrpCys: 0.256 ± 0.213
1.153TrpAsp: 1.153 ± 0.435
0.768TrpGlu: 0.768 ± 0.26
0.256TrpPhe: 0.256 ± 0.216
0.64TrpGly: 0.64 ± 0.242
0.128TrpHis: 0.128 ± 0.095
0.384TrpIle: 0.384 ± 0.2
0.512TrpLys: 0.512 ± 0.332
1.025TrpLeu: 1.025 ± 0.437
0.128TrpMet: 0.128 ± 0.141
0.256TrpAsn: 0.256 ± 0.19
0.128TrpPro: 0.128 ± 0.144
1.665TrpGln: 1.665 ± 0.565
0.512TrpArg: 0.512 ± 0.293
0.768TrpSer: 0.768 ± 0.335
0.512TrpThr: 0.512 ± 0.269
1.025TrpVal: 1.025 ± 0.345
0.256TrpTrp: 0.256 ± 0.172
0.897TrpTyr: 0.897 ± 0.379
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.561TyrAla: 2.561 ± 0.792
0.512TyrCys: 0.512 ± 0.285
2.433TyrAsp: 2.433 ± 0.43
2.049TyrGlu: 2.049 ± 0.583
1.409TyrPhe: 1.409 ± 0.386
3.714TyrGly: 3.714 ± 0.661
0.768TyrHis: 0.768 ± 0.461
1.921TyrIle: 1.921 ± 0.516
2.177TyrLys: 2.177 ± 0.584
3.202TyrLeu: 3.202 ± 0.812
1.025TyrMet: 1.025 ± 0.404
0.897TyrAsn: 0.897 ± 0.275
1.793TyrPro: 1.793 ± 0.51
1.537TyrGln: 1.537 ± 0.462
2.69TyrArg: 2.69 ± 0.592
2.433TyrSer: 2.433 ± 0.532
1.537TyrThr: 1.537 ± 0.426
1.665TyrVal: 1.665 ± 0.453
0.384TyrTrp: 0.384 ± 0.213
1.793TyrTyr: 1.793 ± 0.573
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 41 proteins (7809 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski