Amino acid dipepetide frequency for Streptococcus phage CHPC931

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.32AlaAla: 6.32 ± 2.578
0.366AlaCys: 0.366 ± 0.185
4.855AlaAsp: 4.855 ± 0.881
4.855AlaGlu: 4.855 ± 0.697
3.298AlaPhe: 3.298 ± 1.212
5.313AlaGly: 5.313 ± 1.396
1.008AlaHis: 1.008 ± 0.301
5.771AlaIle: 5.771 ± 1.522
4.305AlaLys: 4.305 ± 0.643
6.687AlaLeu: 6.687 ± 1.185
2.107AlaMet: 2.107 ± 0.894
4.03AlaAsn: 4.03 ± 0.67
2.29AlaPro: 2.29 ± 0.515
3.389AlaGln: 3.389 ± 0.911
3.481AlaArg: 3.481 ± 0.659
6.32AlaSer: 6.32 ± 1.391
3.847AlaThr: 3.847 ± 0.911
5.038AlaVal: 5.038 ± 1.409
0.733AlaTrp: 0.733 ± 0.236
2.015AlaTyr: 2.015 ± 0.477
0.0AlaXaa: 0.0 ± 0.0
Cys
0.366CysAla: 0.366 ± 0.222
0.183CysCys: 0.183 ± 0.133
0.55CysAsp: 0.55 ± 0.282
0.733CysGlu: 0.733 ± 0.341
0.092CysPhe: 0.092 ± 0.102
0.458CysGly: 0.458 ± 0.226
0.275CysHis: 0.275 ± 0.169
0.275CysIle: 0.275 ± 0.149
0.458CysLys: 0.458 ± 0.219
0.366CysLeu: 0.366 ± 0.164
0.092CysMet: 0.092 ± 0.091
0.366CysAsn: 0.366 ± 0.172
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.366CysArg: 0.366 ± 0.222
1.008CysSer: 1.008 ± 0.337
0.0CysThr: 0.0 ± 0.0
0.275CysVal: 0.275 ± 0.171
0.092CysTrp: 0.092 ± 0.092
0.275CysTyr: 0.275 ± 0.135
0.0CysXaa: 0.0 ± 0.0
Asp
3.664AspAla: 3.664 ± 0.494
0.458AspCys: 0.458 ± 0.25
5.221AspAsp: 5.221 ± 0.81
4.03AspGlu: 4.03 ± 0.879
3.572AspPhe: 3.572 ± 0.608
4.672AspGly: 4.672 ± 0.744
0.733AspHis: 0.733 ± 0.304
3.664AspIle: 3.664 ± 0.621
4.488AspLys: 4.488 ± 0.936
5.221AspLeu: 5.221 ± 0.726
1.557AspMet: 1.557 ± 0.388
4.58AspAsn: 4.58 ± 0.762
0.916AspPro: 0.916 ± 0.367
1.099AspGln: 1.099 ± 0.279
2.29AspArg: 2.29 ± 0.425
3.939AspSer: 3.939 ± 0.612
3.664AspThr: 3.664 ± 0.768
3.023AspVal: 3.023 ± 0.506
1.008AspTrp: 1.008 ± 0.409
3.114AspTyr: 3.114 ± 0.709
0.0AspXaa: 0.0 ± 0.0
Glu
5.496GluAla: 5.496 ± 0.695
0.183GluCys: 0.183 ± 0.131
2.473GluAsp: 2.473 ± 0.554
4.946GluGlu: 4.946 ± 0.899
2.656GluPhe: 2.656 ± 0.5
3.298GluGly: 3.298 ± 0.524
1.557GluHis: 1.557 ± 0.474
5.221GluIle: 5.221 ± 0.772
5.954GluLys: 5.954 ± 1.042
6.504GluLeu: 6.504 ± 1.209
2.748GluMet: 2.748 ± 0.634
4.397GluAsn: 4.397 ± 0.704
1.466GluPro: 1.466 ± 0.415
2.931GluGln: 2.931 ± 0.656
3.939GluArg: 3.939 ± 0.802
2.656GluSer: 2.656 ± 0.69
3.298GluThr: 3.298 ± 0.692
6.595GluVal: 6.595 ± 1.106
0.824GluTrp: 0.824 ± 0.335
3.023GluTyr: 3.023 ± 0.703
0.0GluXaa: 0.0 ± 0.0
Phe
2.29PheAla: 2.29 ± 0.417
0.275PheCys: 0.275 ± 0.175
3.114PheAsp: 3.114 ± 0.609
4.122PheGlu: 4.122 ± 0.662
1.191PhePhe: 1.191 ± 0.407
4.03PheGly: 4.03 ± 0.768
0.275PheHis: 0.275 ± 0.129
3.206PheIle: 3.206 ± 0.539
5.221PheLys: 5.221 ± 0.713
2.107PheLeu: 2.107 ± 0.546
0.55PheMet: 0.55 ± 0.231
3.114PheAsn: 3.114 ± 0.472
0.55PhePro: 0.55 ± 0.297
1.191PheGln: 1.191 ± 0.312
1.099PheArg: 1.099 ± 0.252
3.664PheSer: 3.664 ± 0.722
2.565PheThr: 2.565 ± 0.565
2.107PheVal: 2.107 ± 0.442
0.824PheTrp: 0.824 ± 0.259
1.282PheTyr: 1.282 ± 0.362
0.0PheXaa: 0.0 ± 0.0
Gly
4.946GlyAla: 4.946 ± 1.169
0.366GlyCys: 0.366 ± 0.175
3.114GlyAsp: 3.114 ± 0.41
2.931GlyGlu: 2.931 ± 0.504
3.206GlyPhe: 3.206 ± 0.535
3.206GlyGly: 3.206 ± 0.582
0.55GlyHis: 0.55 ± 0.27
5.862GlyIle: 5.862 ± 1.797
5.404GlyLys: 5.404 ± 0.863
6.229GlyLeu: 6.229 ± 0.989
1.374GlyMet: 1.374 ± 0.786
3.847GlyAsn: 3.847 ± 0.68
1.282GlyPro: 1.282 ± 0.428
3.206GlyGln: 3.206 ± 0.496
2.931GlyArg: 2.931 ± 0.547
4.03GlySer: 4.03 ± 0.752
4.58GlyThr: 4.58 ± 0.817
5.13GlyVal: 5.13 ± 0.836
0.641GlyTrp: 0.641 ± 0.313
3.206GlyTyr: 3.206 ± 0.596
0.0GlyXaa: 0.0 ± 0.0
His
1.008HisAla: 1.008 ± 0.295
0.092HisCys: 0.092 ± 0.1
0.916HisAsp: 0.916 ± 0.259
0.641HisGlu: 0.641 ± 0.29
0.733HisPhe: 0.733 ± 0.27
1.008HisGly: 1.008 ± 0.303
0.366HisHis: 0.366 ± 0.148
1.008HisIle: 1.008 ± 0.273
1.282HisLys: 1.282 ± 0.318
0.824HisLeu: 0.824 ± 0.284
0.366HisMet: 0.366 ± 0.178
0.824HisAsn: 0.824 ± 0.299
0.458HisPro: 0.458 ± 0.201
0.55HisGln: 0.55 ± 0.258
0.824HisArg: 0.824 ± 0.293
0.916HisSer: 0.916 ± 0.321
1.008HisThr: 1.008 ± 0.319
1.099HisVal: 1.099 ± 0.386
0.183HisTrp: 0.183 ± 0.141
0.366HisTyr: 0.366 ± 0.17
0.0HisXaa: 0.0 ± 0.0
Ile
4.763IleAla: 4.763 ± 1.022
0.366IleCys: 0.366 ± 0.172
5.313IleAsp: 5.313 ± 0.734
4.03IleGlu: 4.03 ± 0.681
1.649IlePhe: 1.649 ± 0.331
5.954IleGly: 5.954 ± 1.194
1.099IleHis: 1.099 ± 0.318
3.572IleIle: 3.572 ± 0.831
5.038IleLys: 5.038 ± 0.541
3.481IleLeu: 3.481 ± 0.522
2.29IleMet: 2.29 ± 0.352
4.214IleAsn: 4.214 ± 0.671
2.29IlePro: 2.29 ± 0.621
2.84IleGln: 2.84 ± 0.466
2.656IleArg: 2.656 ± 0.62
5.954IleSer: 5.954 ± 1.554
4.305IleThr: 4.305 ± 0.63
4.03IleVal: 4.03 ± 0.736
0.733IleTrp: 0.733 ± 0.258
2.382IleTyr: 2.382 ± 0.654
0.0IleXaa: 0.0 ± 0.0
Lys
7.328LysAla: 7.328 ± 0.998
0.458LysCys: 0.458 ± 0.245
5.038LysAsp: 5.038 ± 0.88
7.603LysGlu: 7.603 ± 1.346
2.107LysPhe: 2.107 ± 0.376
5.13LysGly: 5.13 ± 0.512
1.74LysHis: 1.74 ± 0.458
5.588LysIle: 5.588 ± 0.593
6.32LysLys: 6.32 ± 1.334
6.87LysLeu: 6.87 ± 0.938
1.924LysMet: 1.924 ± 0.488
3.206LysAsn: 3.206 ± 0.607
3.023LysPro: 3.023 ± 0.654
2.565LysGln: 2.565 ± 0.474
4.488LysArg: 4.488 ± 0.81
4.488LysSer: 4.488 ± 0.501
5.496LysThr: 5.496 ± 0.822
3.572LysVal: 3.572 ± 0.579
0.824LysTrp: 0.824 ± 0.196
4.58LysTyr: 4.58 ± 1.022
0.0LysXaa: 0.0 ± 0.0
Leu
6.137LeuAla: 6.137 ± 1.176
0.183LeuCys: 0.183 ± 0.124
4.122LeuAsp: 4.122 ± 0.738
6.32LeuGlu: 6.32 ± 1.076
3.114LeuPhe: 3.114 ± 0.407
6.046LeuGly: 6.046 ± 0.917
0.55LeuHis: 0.55 ± 0.227
3.756LeuIle: 3.756 ± 0.523
5.954LeuLys: 5.954 ± 0.89
4.946LeuLeu: 4.946 ± 0.835
1.832LeuMet: 1.832 ± 0.389
5.221LeuAsn: 5.221 ± 0.716
1.924LeuPro: 1.924 ± 0.503
2.382LeuGln: 2.382 ± 0.516
3.389LeuArg: 3.389 ± 0.753
5.954LeuSer: 5.954 ± 0.63
6.32LeuThr: 6.32 ± 0.885
4.672LeuVal: 4.672 ± 0.599
0.55LeuTrp: 0.55 ± 0.209
3.298LeuTyr: 3.298 ± 0.627
0.0LeuXaa: 0.0 ± 0.0
Met
3.023MetAla: 3.023 ± 1.099
0.092MetCys: 0.092 ± 0.093
1.191MetAsp: 1.191 ± 0.36
0.916MetGlu: 0.916 ± 0.289
1.191MetPhe: 1.191 ± 0.292
1.099MetGly: 1.099 ± 0.382
0.183MetHis: 0.183 ± 0.119
0.916MetIle: 0.916 ± 0.357
2.107MetLys: 2.107 ± 0.45
1.557MetLeu: 1.557 ± 0.346
1.099MetMet: 1.099 ± 0.518
1.008MetAsn: 1.008 ± 0.325
0.366MetPro: 0.366 ± 0.19
1.466MetGln: 1.466 ± 0.427
1.008MetArg: 1.008 ± 0.283
2.473MetSer: 2.473 ± 0.482
1.649MetThr: 1.649 ± 0.33
1.74MetVal: 1.74 ± 0.472
0.092MetTrp: 0.092 ± 0.096
0.733MetTyr: 0.733 ± 0.316
0.0MetXaa: 0.0 ± 0.0
Asn
3.389AsnAla: 3.389 ± 0.456
0.366AsnCys: 0.366 ± 0.151
3.389AsnAsp: 3.389 ± 0.491
4.397AsnGlu: 4.397 ± 0.97
2.748AsnPhe: 2.748 ± 0.461
5.771AsnGly: 5.771 ± 0.91
1.191AsnHis: 1.191 ± 0.435
2.565AsnIle: 2.565 ± 0.517
5.038AsnLys: 5.038 ± 0.773
4.397AsnLeu: 4.397 ± 0.748
0.916AsnMet: 0.916 ± 0.298
3.389AsnAsn: 3.389 ± 0.859
2.473AsnPro: 2.473 ± 0.511
2.29AsnGln: 2.29 ± 0.541
2.656AsnArg: 2.656 ± 0.56
3.756AsnSer: 3.756 ± 0.589
3.206AsnThr: 3.206 ± 0.763
3.481AsnVal: 3.481 ± 0.603
1.099AsnTrp: 1.099 ± 0.369
2.015AsnTyr: 2.015 ± 0.43
0.0AsnXaa: 0.0 ± 0.0
Pro
1.191ProAla: 1.191 ± 0.277
0.183ProCys: 0.183 ± 0.171
2.015ProAsp: 2.015 ± 0.488
1.649ProGlu: 1.649 ± 0.479
1.282ProPhe: 1.282 ± 0.349
0.824ProGly: 0.824 ± 0.337
0.275ProHis: 0.275 ± 0.134
1.74ProIle: 1.74 ± 0.352
2.931ProLys: 2.931 ± 0.487
1.74ProLeu: 1.74 ± 0.373
0.092ProMet: 0.092 ± 0.091
1.832ProAsn: 1.832 ± 0.424
0.824ProPro: 0.824 ± 0.256
1.74ProGln: 1.74 ± 0.277
1.191ProArg: 1.191 ± 0.375
1.924ProSer: 1.924 ± 0.435
1.374ProThr: 1.374 ± 0.45
1.466ProVal: 1.466 ± 0.353
0.458ProTrp: 0.458 ± 0.235
1.282ProTyr: 1.282 ± 0.387
0.0ProXaa: 0.0 ± 0.0
Gln
4.305GlnAla: 4.305 ± 0.86
0.275GlnCys: 0.275 ± 0.146
2.107GlnAsp: 2.107 ± 0.399
3.206GlnGlu: 3.206 ± 0.836
2.656GlnPhe: 2.656 ± 0.584
2.107GlnGly: 2.107 ± 0.617
0.55GlnHis: 0.55 ± 0.232
1.924GlnIle: 1.924 ± 0.607
2.84GlnLys: 2.84 ± 0.632
4.397GlnLeu: 4.397 ± 0.561
1.099GlnMet: 1.099 ± 0.289
1.74GlnAsn: 1.74 ± 0.368
0.824GlnPro: 0.824 ± 0.26
1.832GlnGln: 1.832 ± 0.692
1.649GlnArg: 1.649 ± 0.469
2.107GlnSer: 2.107 ± 0.607
2.656GlnThr: 2.656 ± 0.401
2.198GlnVal: 2.198 ± 0.319
0.458GlnTrp: 0.458 ± 0.216
1.008GlnTyr: 1.008 ± 0.371
0.0GlnXaa: 0.0 ± 0.0
Arg
3.756ArgAla: 3.756 ± 0.493
0.641ArgCys: 0.641 ± 0.245
1.924ArgAsp: 1.924 ± 0.382
3.572ArgGlu: 3.572 ± 0.748
1.466ArgPhe: 1.466 ± 0.357
2.198ArgGly: 2.198 ± 0.438
0.275ArgHis: 0.275 ± 0.143
3.481ArgIle: 3.481 ± 0.825
3.847ArgLys: 3.847 ± 0.74
3.664ArgLeu: 3.664 ± 0.677
1.282ArgMet: 1.282 ± 0.425
1.924ArgAsn: 1.924 ± 0.467
0.916ArgPro: 0.916 ± 0.349
2.198ArgGln: 2.198 ± 0.517
1.557ArgArg: 1.557 ± 0.513
2.565ArgSer: 2.565 ± 0.443
2.29ArgThr: 2.29 ± 0.525
2.656ArgVal: 2.656 ± 0.566
0.55ArgTrp: 0.55 ± 0.231
2.29ArgTyr: 2.29 ± 0.47
0.0ArgXaa: 0.0 ± 0.0
Ser
6.32SerAla: 6.32 ± 2.878
0.366SerCys: 0.366 ± 0.179
4.488SerAsp: 4.488 ± 0.726
3.572SerGlu: 3.572 ± 0.697
2.748SerPhe: 2.748 ± 0.401
3.939SerGly: 3.939 ± 0.536
1.099SerHis: 1.099 ± 0.366
6.137SerIle: 6.137 ± 0.699
6.32SerLys: 6.32 ± 0.821
4.305SerLeu: 4.305 ± 0.891
1.191SerMet: 1.191 ± 0.284
4.03SerAsn: 4.03 ± 0.622
1.466SerPro: 1.466 ± 0.407
3.389SerGln: 3.389 ± 1.039
2.198SerArg: 2.198 ± 0.419
3.572SerSer: 3.572 ± 1.068
4.672SerThr: 4.672 ± 0.841
5.038SerVal: 5.038 ± 0.838
0.458SerTrp: 0.458 ± 0.188
2.473SerTyr: 2.473 ± 0.511
0.0SerXaa: 0.0 ± 0.0
Thr
4.58ThrAla: 4.58 ± 1.434
0.183ThrCys: 0.183 ± 0.129
3.023ThrAsp: 3.023 ± 0.59
3.572ThrGlu: 3.572 ± 0.766
3.572ThrPhe: 3.572 ± 0.541
4.305ThrGly: 4.305 ± 0.672
1.282ThrHis: 1.282 ± 0.393
4.672ThrIle: 4.672 ± 0.873
6.32ThrLys: 6.32 ± 0.808
5.313ThrLeu: 5.313 ± 0.595
1.099ThrMet: 1.099 ± 0.821
3.389ThrAsn: 3.389 ± 0.543
1.649ThrPro: 1.649 ± 0.413
2.748ThrGln: 2.748 ± 0.468
2.473ThrArg: 2.473 ± 0.463
3.298ThrSer: 3.298 ± 0.811
3.114ThrThr: 3.114 ± 0.57
4.855ThrVal: 4.855 ± 0.624
0.275ThrTrp: 0.275 ± 0.184
2.473ThrTyr: 2.473 ± 0.753
0.0ThrXaa: 0.0 ± 0.0
Val
3.847ValAla: 3.847 ± 1.194
0.458ValCys: 0.458 ± 0.194
4.58ValAsp: 4.58 ± 0.739
5.679ValGlu: 5.679 ± 0.778
3.298ValPhe: 3.298 ± 0.601
3.114ValGly: 3.114 ± 0.727
0.641ValHis: 0.641 ± 0.236
4.122ValIle: 4.122 ± 0.731
5.13ValLys: 5.13 ± 0.558
4.214ValLeu: 4.214 ± 0.628
0.824ValMet: 0.824 ± 0.304
4.488ValAsn: 4.488 ± 0.835
1.832ValPro: 1.832 ± 0.428
2.473ValGln: 2.473 ± 0.556
2.198ValArg: 2.198 ± 0.329
5.496ValSer: 5.496 ± 0.867
5.13ValThr: 5.13 ± 0.658
4.946ValVal: 4.946 ± 0.716
1.008ValTrp: 1.008 ± 0.274
1.74ValTyr: 1.74 ± 0.531
0.0ValXaa: 0.0 ± 0.0
Trp
0.458TrpAla: 0.458 ± 0.165
0.092TrpCys: 0.092 ± 0.093
0.641TrpAsp: 0.641 ± 0.246
1.008TrpGlu: 1.008 ± 0.338
0.275TrpPhe: 0.275 ± 0.166
0.916TrpGly: 0.916 ± 0.29
0.0TrpHis: 0.0 ± 0.0
0.641TrpIle: 0.641 ± 0.306
0.641TrpLys: 0.641 ± 0.212
1.008TrpLeu: 1.008 ± 0.266
0.183TrpMet: 0.183 ± 0.144
0.641TrpAsn: 0.641 ± 0.234
0.092TrpPro: 0.092 ± 0.084
0.183TrpGln: 0.183 ± 0.143
0.641TrpArg: 0.641 ± 0.266
1.099TrpSer: 1.099 ± 0.54
1.099TrpThr: 1.099 ± 0.318
1.008TrpVal: 1.008 ± 0.255
0.366TrpTrp: 0.366 ± 0.198
0.458TrpTyr: 0.458 ± 0.24
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.84TyrAla: 2.84 ± 0.577
0.641TyrCys: 0.641 ± 0.216
2.84TyrAsp: 2.84 ± 0.732
2.382TyrGlu: 2.382 ± 0.554
1.924TyrPhe: 1.924 ± 0.485
2.656TyrGly: 2.656 ± 0.535
0.824TyrHis: 0.824 ± 0.376
2.84TyrIle: 2.84 ± 0.59
2.748TyrLys: 2.748 ± 0.502
2.84TyrLeu: 2.84 ± 0.593
1.191TyrMet: 1.191 ± 0.395
2.29TyrAsn: 2.29 ± 0.558
1.374TyrPro: 1.374 ± 0.385
1.466TyrGln: 1.466 ± 0.343
2.107TyrArg: 2.107 ± 0.537
2.565TyrSer: 2.565 ± 0.579
1.924TyrThr: 1.924 ± 0.583
2.29TyrVal: 2.29 ± 0.546
0.275TyrTrp: 0.275 ± 0.168
1.282TyrTyr: 1.282 ± 0.513
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (10918 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski