Amino acid dipepetide frequency for Streptococcus phage CHPC1036

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.572AlaAla: 3.572 ± 1.026
0.183AlaCys: 0.183 ± 0.127
4.03AlaAsp: 4.03 ± 0.723
5.037AlaGlu: 5.037 ± 0.659
2.656AlaPhe: 2.656 ± 0.699
4.671AlaGly: 4.671 ± 0.805
0.824AlaHis: 0.824 ± 0.325
4.579AlaIle: 4.579 ± 0.846
5.953AlaLys: 5.953 ± 0.951
6.869AlaLeu: 6.869 ± 0.807
1.649AlaMet: 1.649 ± 0.38
4.396AlaAsn: 4.396 ± 0.887
1.557AlaPro: 1.557 ± 0.374
2.748AlaGln: 2.748 ± 0.756
2.198AlaArg: 2.198 ± 0.479
3.847AlaSer: 3.847 ± 0.503
3.663AlaThr: 3.663 ± 0.724
4.121AlaVal: 4.121 ± 0.681
1.007AlaTrp: 1.007 ± 0.277
2.29AlaTyr: 2.29 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.092CysAla: 0.092 ± 0.089
0.0CysCys: 0.0 ± 0.0
0.733CysAsp: 0.733 ± 0.287
0.183CysGlu: 0.183 ± 0.117
0.275CysPhe: 0.275 ± 0.211
0.092CysGly: 0.092 ± 0.093
0.092CysHis: 0.092 ± 0.081
0.092CysIle: 0.092 ± 0.109
0.458CysLys: 0.458 ± 0.21
0.55CysLeu: 0.55 ± 0.26
0.0CysMet: 0.0 ± 0.0
0.183CysAsn: 0.183 ± 0.131
0.092CysPro: 0.092 ± 0.111
0.092CysGln: 0.092 ± 0.115
0.275CysArg: 0.275 ± 0.253
0.366CysSer: 0.366 ± 0.195
0.366CysThr: 0.366 ± 0.189
0.183CysVal: 0.183 ± 0.115
0.183CysTrp: 0.183 ± 0.138
0.092CysTyr: 0.092 ± 0.089
0.0CysXaa: 0.0 ± 0.0
Asp
3.48AspAla: 3.48 ± 0.597
0.641AspCys: 0.641 ± 0.252
4.396AspAsp: 4.396 ± 0.929
4.488AspGlu: 4.488 ± 0.765
4.213AspPhe: 4.213 ± 0.488
7.601AspGly: 7.601 ± 2.099
1.007AspHis: 1.007 ± 0.357
3.847AspIle: 3.847 ± 0.527
5.129AspLys: 5.129 ± 0.685
4.946AspLeu: 4.946 ± 0.954
1.923AspMet: 1.923 ± 0.39
3.572AspAsn: 3.572 ± 0.638
2.015AspPro: 2.015 ± 0.491
1.465AspGln: 1.465 ± 0.32
3.205AspArg: 3.205 ± 0.905
3.847AspSer: 3.847 ± 0.775
3.755AspThr: 3.755 ± 0.718
3.205AspVal: 3.205 ± 0.694
0.916AspTrp: 0.916 ± 0.331
2.656AspTyr: 2.656 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
4.121GluAla: 4.121 ± 0.76
0.183GluCys: 0.183 ± 0.124
3.297GluAsp: 3.297 ± 0.523
5.861GluGlu: 5.861 ± 1.295
2.29GluPhe: 2.29 ± 0.455
2.748GluGly: 2.748 ± 0.46
1.649GluHis: 1.649 ± 0.484
6.411GluIle: 6.411 ± 0.833
4.671GluLys: 4.671 ± 1.301
6.96GluLeu: 6.96 ± 0.996
2.748GluMet: 2.748 ± 0.621
4.03GluAsn: 4.03 ± 0.644
1.832GluPro: 1.832 ± 0.514
4.03GluGln: 4.03 ± 0.593
3.297GluArg: 3.297 ± 0.669
3.205GluSer: 3.205 ± 0.697
4.488GluThr: 4.488 ± 0.594
3.938GluVal: 3.938 ± 0.782
0.916GluTrp: 0.916 ± 0.303
3.938GluTyr: 3.938 ± 0.707
0.0GluXaa: 0.0 ± 0.0
Phe
3.297PheAla: 3.297 ± 0.58
0.092PheCys: 0.092 ± 0.083
3.48PheAsp: 3.48 ± 0.625
2.29PheGlu: 2.29 ± 0.526
2.106PhePhe: 2.106 ± 0.335
2.564PheGly: 2.564 ± 0.535
0.458PheHis: 0.458 ± 0.176
2.106PheIle: 2.106 ± 0.504
4.671PheLys: 4.671 ± 0.715
3.022PheLeu: 3.022 ± 0.448
0.641PheMet: 0.641 ± 0.246
3.297PheAsn: 3.297 ± 0.627
0.916PhePro: 0.916 ± 0.33
0.824PheGln: 0.824 ± 0.263
1.557PheArg: 1.557 ± 0.385
3.297PheSer: 3.297 ± 0.692
2.015PheThr: 2.015 ± 0.485
2.656PheVal: 2.656 ± 0.471
0.641PheTrp: 0.641 ± 0.228
2.106PheTyr: 2.106 ± 0.488
0.0PheXaa: 0.0 ± 0.0
Gly
2.656GlyAla: 2.656 ± 0.566
0.458GlyCys: 0.458 ± 0.281
3.847GlyAsp: 3.847 ± 0.693
4.396GlyGlu: 4.396 ± 0.587
3.114GlyPhe: 3.114 ± 0.52
3.755GlyGly: 3.755 ± 0.895
0.916GlyHis: 0.916 ± 0.451
4.579GlyIle: 4.579 ± 0.654
6.594GlyLys: 6.594 ± 1.271
6.411GlyLeu: 6.411 ± 1.049
1.832GlyMet: 1.832 ± 0.352
3.663GlyAsn: 3.663 ± 0.685
1.557GlyPro: 1.557 ± 0.652
3.114GlyGln: 3.114 ± 0.658
2.473GlyArg: 2.473 ± 0.544
4.304GlySer: 4.304 ± 0.71
4.579GlyThr: 4.579 ± 0.938
3.572GlyVal: 3.572 ± 0.744
0.916GlyTrp: 0.916 ± 0.29
2.748GlyTyr: 2.748 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
0.458HisAla: 0.458 ± 0.155
0.0HisCys: 0.0 ± 0.0
0.824HisAsp: 0.824 ± 0.348
0.366HisGlu: 0.366 ± 0.212
0.458HisPhe: 0.458 ± 0.168
0.641HisGly: 0.641 ± 0.241
0.55HisHis: 0.55 ± 0.222
0.733HisIle: 0.733 ± 0.25
1.282HisLys: 1.282 ± 0.471
1.649HisLeu: 1.649 ± 0.379
0.733HisMet: 0.733 ± 0.232
0.916HisAsn: 0.916 ± 0.267
0.55HisPro: 0.55 ± 0.231
0.824HisGln: 0.824 ± 0.283
0.916HisArg: 0.916 ± 0.34
0.824HisSer: 0.824 ± 0.31
0.641HisThr: 0.641 ± 0.289
1.374HisVal: 1.374 ± 0.326
0.092HisTrp: 0.092 ± 0.105
0.916HisTyr: 0.916 ± 0.311
0.0HisXaa: 0.0 ± 0.0
Ile
5.587IleAla: 5.587 ± 0.723
0.275IleCys: 0.275 ± 0.149
5.312IleAsp: 5.312 ± 0.717
5.587IleGlu: 5.587 ± 0.946
1.832IlePhe: 1.832 ± 0.32
4.121IleGly: 4.121 ± 0.618
0.824IleHis: 0.824 ± 0.264
3.297IleIle: 3.297 ± 0.667
6.502IleLys: 6.502 ± 0.692
4.213IleLeu: 4.213 ± 0.749
1.282IleMet: 1.282 ± 0.4
4.671IleAsn: 4.671 ± 0.633
2.748IlePro: 2.748 ± 0.477
3.114IleGln: 3.114 ± 0.477
2.015IleArg: 2.015 ± 0.418
4.03IleSer: 4.03 ± 0.551
3.755IleThr: 3.755 ± 0.469
2.564IleVal: 2.564 ± 0.471
0.916IleTrp: 0.916 ± 0.22
1.74IleTyr: 1.74 ± 0.377
0.0IleXaa: 0.0 ± 0.0
Lys
5.77LysAla: 5.77 ± 0.581
0.275LysCys: 0.275 ± 0.164
5.77LysAsp: 5.77 ± 0.778
7.693LysGlu: 7.693 ± 1.407
4.121LysPhe: 4.121 ± 0.961
5.495LysGly: 5.495 ± 0.897
1.557LysHis: 1.557 ± 0.47
5.403LysIle: 5.403 ± 0.611
7.601LysLys: 7.601 ± 1.694
7.418LysLeu: 7.418 ± 1.292
2.839LysMet: 2.839 ± 0.55
4.762LysAsn: 4.762 ± 0.589
3.389LysPro: 3.389 ± 0.49
4.762LysGln: 4.762 ± 0.906
3.572LysArg: 3.572 ± 0.677
3.938LysSer: 3.938 ± 0.642
4.946LysThr: 4.946 ± 0.642
3.663LysVal: 3.663 ± 0.583
1.282LysTrp: 1.282 ± 0.306
3.938LysTyr: 3.938 ± 0.759
0.0LysXaa: 0.0 ± 0.0
Leu
6.228LeuAla: 6.228 ± 1.087
0.458LeuCys: 0.458 ± 0.228
6.594LeuAsp: 6.594 ± 0.842
7.418LeuGlu: 7.418 ± 1.014
2.839LeuPhe: 2.839 ± 0.439
5.312LeuGly: 5.312 ± 0.884
0.824LeuHis: 0.824 ± 0.308
4.762LeuIle: 4.762 ± 0.562
8.151LeuLys: 8.151 ± 0.811
4.946LeuLeu: 4.946 ± 0.836
2.564LeuMet: 2.564 ± 0.41
5.403LeuAsn: 5.403 ± 0.586
2.748LeuPro: 2.748 ± 0.432
2.656LeuGln: 2.656 ± 0.48
3.022LeuArg: 3.022 ± 0.731
5.495LeuSer: 5.495 ± 0.594
5.495LeuThr: 5.495 ± 0.789
4.304LeuVal: 4.304 ± 0.583
0.733LeuTrp: 0.733 ± 0.353
2.381LeuTyr: 2.381 ± 0.548
0.0LeuXaa: 0.0 ± 0.0
Met
1.557MetAla: 1.557 ± 0.409
0.0MetCys: 0.0 ± 0.0
1.099MetAsp: 1.099 ± 0.362
1.374MetGlu: 1.374 ± 0.434
1.099MetPhe: 1.099 ± 0.3
1.191MetGly: 1.191 ± 0.373
0.092MetHis: 0.092 ± 0.107
2.198MetIle: 2.198 ± 0.436
3.114MetLys: 3.114 ± 0.774
2.015MetLeu: 2.015 ± 0.369
0.641MetMet: 0.641 ± 0.255
1.465MetAsn: 1.465 ± 0.331
0.733MetPro: 0.733 ± 0.181
1.099MetGln: 1.099 ± 0.301
1.007MetArg: 1.007 ± 0.261
1.923MetSer: 1.923 ± 0.429
1.465MetThr: 1.465 ± 0.339
1.557MetVal: 1.557 ± 0.39
0.092MetTrp: 0.092 ± 0.073
1.282MetTyr: 1.282 ± 0.387
0.0MetXaa: 0.0 ± 0.0
Asn
4.488AsnAla: 4.488 ± 0.981
0.183AsnCys: 0.183 ± 0.119
3.755AsnAsp: 3.755 ± 0.877
3.48AsnGlu: 3.48 ± 0.603
2.931AsnPhe: 2.931 ± 0.57
6.136AsnGly: 6.136 ± 0.975
1.099AsnHis: 1.099 ± 0.272
3.938AsnIle: 3.938 ± 0.639
3.938AsnLys: 3.938 ± 0.521
4.946AsnLeu: 4.946 ± 0.484
0.916AsnMet: 0.916 ± 0.28
3.663AsnAsn: 3.663 ± 0.627
3.389AsnPro: 3.389 ± 0.79
2.564AsnGln: 2.564 ± 0.488
2.198AsnArg: 2.198 ± 0.563
4.121AsnSer: 4.121 ± 0.548
3.114AsnThr: 3.114 ± 0.706
3.663AsnVal: 3.663 ± 0.569
1.374AsnTrp: 1.374 ± 0.365
2.473AsnTyr: 2.473 ± 0.562
0.0AsnXaa: 0.0 ± 0.0
Pro
2.198ProAla: 2.198 ± 0.406
0.0ProCys: 0.0 ± 0.0
1.649ProAsp: 1.649 ± 0.499
1.74ProGlu: 1.74 ± 0.576
1.099ProPhe: 1.099 ± 0.323
1.374ProGly: 1.374 ± 0.687
0.275ProHis: 0.275 ± 0.164
1.649ProIle: 1.649 ± 0.293
4.03ProLys: 4.03 ± 0.688
2.106ProLeu: 2.106 ± 0.392
0.366ProMet: 0.366 ± 0.184
3.022ProAsn: 3.022 ± 0.512
0.458ProPro: 0.458 ± 0.241
1.649ProGln: 1.649 ± 0.283
0.733ProArg: 0.733 ± 0.324
2.473ProSer: 2.473 ± 0.519
2.29ProThr: 2.29 ± 0.544
1.649ProVal: 1.649 ± 0.405
0.366ProTrp: 0.366 ± 0.172
1.099ProTyr: 1.099 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
3.938GlnAla: 3.938 ± 0.752
0.092GlnCys: 0.092 ± 0.086
2.748GlnAsp: 2.748 ± 0.537
3.297GlnGlu: 3.297 ± 0.727
1.191GlnPhe: 1.191 ± 0.296
3.297GlnGly: 3.297 ± 0.951
0.55GlnHis: 0.55 ± 0.241
2.198GlnIle: 2.198 ± 0.444
3.48GlnLys: 3.48 ± 0.57
4.671GlnLeu: 4.671 ± 0.7
1.282GlnMet: 1.282 ± 0.3
2.29GlnAsn: 2.29 ± 0.448
0.733GlnPro: 0.733 ± 0.262
3.663GlnGln: 3.663 ± 0.765
1.649GlnArg: 1.649 ± 0.416
2.198GlnSer: 2.198 ± 0.42
2.748GlnThr: 2.748 ± 0.393
1.923GlnVal: 1.923 ± 0.375
0.641GlnTrp: 0.641 ± 0.199
2.015GlnTyr: 2.015 ± 0.421
0.0GlnXaa: 0.0 ± 0.0
Arg
1.923ArgAla: 1.923 ± 0.395
0.092ArgCys: 0.092 ± 0.115
2.564ArgAsp: 2.564 ± 0.808
2.748ArgGlu: 2.748 ± 0.544
1.832ArgPhe: 1.832 ± 0.475
1.649ArgGly: 1.649 ± 0.36
0.641ArgHis: 0.641 ± 0.205
2.656ArgIle: 2.656 ± 0.433
3.297ArgLys: 3.297 ± 0.615
3.48ArgLeu: 3.48 ± 0.667
1.007ArgMet: 1.007 ± 0.381
2.29ArgAsn: 2.29 ± 0.45
1.007ArgPro: 1.007 ± 0.274
1.832ArgGln: 1.832 ± 0.442
1.374ArgArg: 1.374 ± 0.273
2.106ArgSer: 2.106 ± 0.507
2.748ArgThr: 2.748 ± 0.845
2.473ArgVal: 2.473 ± 0.411
0.733ArgTrp: 0.733 ± 0.263
1.923ArgTyr: 1.923 ± 0.436
0.0ArgXaa: 0.0 ± 0.0
Ser
3.48SerAla: 3.48 ± 0.565
0.366SerCys: 0.366 ± 0.3
4.671SerAsp: 4.671 ± 1.018
3.297SerGlu: 3.297 ± 0.578
2.931SerPhe: 2.931 ± 0.597
3.938SerGly: 3.938 ± 0.679
0.824SerHis: 0.824 ± 0.303
4.213SerIle: 4.213 ± 0.629
5.312SerLys: 5.312 ± 0.871
4.854SerLeu: 4.854 ± 0.585
1.649SerMet: 1.649 ± 0.318
4.304SerAsn: 4.304 ± 0.642
1.832SerPro: 1.832 ± 0.374
3.114SerGln: 3.114 ± 0.622
2.473SerArg: 2.473 ± 0.66
4.488SerSer: 4.488 ± 1.159
4.213SerThr: 4.213 ± 0.715
4.488SerVal: 4.488 ± 0.538
0.641SerTrp: 0.641 ± 0.208
1.923SerTyr: 1.923 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
4.213ThrAla: 4.213 ± 0.675
0.183ThrCys: 0.183 ± 0.131
4.579ThrAsp: 4.579 ± 0.89
4.03ThrGlu: 4.03 ± 0.693
2.381ThrPhe: 2.381 ± 0.433
3.663ThrGly: 3.663 ± 0.605
0.733ThrHis: 0.733 ± 0.21
4.854ThrIle: 4.854 ± 0.814
4.854ThrLys: 4.854 ± 0.737
5.037ThrLeu: 5.037 ± 0.821
0.641ThrMet: 0.641 ± 0.238
3.938ThrAsn: 3.938 ± 0.631
1.74ThrPro: 1.74 ± 0.499
2.656ThrGln: 2.656 ± 0.521
2.381ThrArg: 2.381 ± 0.462
3.663ThrSer: 3.663 ± 0.554
2.931ThrThr: 2.931 ± 0.793
3.938ThrVal: 3.938 ± 0.532
1.374ThrTrp: 1.374 ± 0.483
3.205ThrTyr: 3.205 ± 0.851
0.0ThrXaa: 0.0 ± 0.0
Val
5.129ValAla: 5.129 ± 0.974
0.366ValCys: 0.366 ± 0.151
3.847ValAsp: 3.847 ± 0.569
3.389ValGlu: 3.389 ± 0.834
2.473ValPhe: 2.473 ± 0.4
4.03ValGly: 4.03 ± 0.617
0.733ValHis: 0.733 ± 0.228
3.205ValIle: 3.205 ± 0.45
5.312ValLys: 5.312 ± 0.664
3.205ValLeu: 3.205 ± 0.719
0.824ValMet: 0.824 ± 0.28
3.938ValAsn: 3.938 ± 0.864
1.74ValPro: 1.74 ± 0.396
1.74ValGln: 1.74 ± 0.408
1.282ValArg: 1.282 ± 0.444
4.396ValSer: 4.396 ± 0.787
4.121ValThr: 4.121 ± 0.692
3.48ValVal: 3.48 ± 0.657
1.007ValTrp: 1.007 ± 0.286
1.74ValTyr: 1.74 ± 0.417
0.0ValXaa: 0.0 ± 0.0
Trp
0.824TrpAla: 0.824 ± 0.247
0.0TrpCys: 0.0 ± 0.0
0.824TrpAsp: 0.824 ± 0.436
0.916TrpGlu: 0.916 ± 0.26
0.824TrpPhe: 0.824 ± 0.289
0.733TrpGly: 0.733 ± 0.244
0.366TrpHis: 0.366 ± 0.191
0.916TrpIle: 0.916 ± 0.32
0.916TrpLys: 0.916 ± 0.277
1.374TrpLeu: 1.374 ± 0.346
0.092TrpMet: 0.092 ± 0.105
0.733TrpAsn: 0.733 ± 0.231
0.092TrpPro: 0.092 ± 0.103
0.733TrpGln: 0.733 ± 0.297
0.824TrpArg: 0.824 ± 0.257
1.282TrpSer: 1.282 ± 0.291
1.282TrpThr: 1.282 ± 0.761
0.916TrpVal: 0.916 ± 0.22
0.275TrpTrp: 0.275 ± 0.19
0.183TrpTyr: 0.183 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.839TyrAla: 2.839 ± 0.511
0.55TyrCys: 0.55 ± 0.275
2.381TyrAsp: 2.381 ± 0.547
3.022TyrGlu: 3.022 ± 0.681
1.374TyrPhe: 1.374 ± 0.341
2.473TyrGly: 2.473 ± 0.57
0.733TyrHis: 0.733 ± 0.267
2.748TyrIle: 2.748 ± 0.684
2.931TyrLys: 2.931 ± 0.541
3.663TyrLeu: 3.663 ± 0.563
1.282TyrMet: 1.282 ± 0.391
1.832TyrAsn: 1.832 ± 0.309
1.099TyrPro: 1.099 ± 0.358
1.923TyrGln: 1.923 ± 0.479
1.923TyrArg: 1.923 ± 0.28
3.205TyrSer: 3.205 ± 0.793
2.381TyrThr: 2.381 ± 0.584
2.29TyrVal: 2.29 ± 0.508
0.0TyrTrp: 0.0 ± 0.0
2.381TyrTyr: 2.381 ± 0.751
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10920 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski