Amino acid dipepetide frequency for Streptococcus phage Javan311

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.212AlaAla: 3.212 ± 1.251
0.165AlaCys: 0.165 ± 0.129
4.695AlaAsp: 4.695 ± 0.553
6.26AlaGlu: 6.26 ± 1.021
2.471AlaPhe: 2.471 ± 0.747
5.518AlaGly: 5.518 ± 0.864
0.741AlaHis: 0.741 ± 0.39
6.425AlaIle: 6.425 ± 0.797
5.189AlaLys: 5.189 ± 0.529
7.495AlaLeu: 7.495 ± 1.215
2.059AlaMet: 2.059 ± 0.537
4.777AlaAsn: 4.777 ± 0.743
3.13AlaPro: 3.13 ± 0.763
4.695AlaGln: 4.695 ± 0.986
3.706AlaArg: 3.706 ± 0.527
6.013AlaSer: 6.013 ± 1.511
4.86AlaThr: 4.86 ± 0.614
5.271AlaVal: 5.271 ± 0.784
0.906AlaTrp: 0.906 ± 0.277
2.471AlaTyr: 2.471 ± 0.526
0.0AlaXaa: 0.0 ± 0.0
Cys
0.247CysAla: 0.247 ± 0.155
0.0CysCys: 0.0 ± 0.0
0.247CysAsp: 0.247 ± 0.164
0.247CysGlu: 0.247 ± 0.122
0.412CysPhe: 0.412 ± 0.185
0.247CysGly: 0.247 ± 0.154
0.082CysHis: 0.082 ± 0.069
0.082CysIle: 0.082 ± 0.089
0.329CysLys: 0.329 ± 0.18
0.165CysLeu: 0.165 ± 0.102
0.165CysMet: 0.165 ± 0.11
0.329CysAsn: 0.329 ± 0.153
0.247CysPro: 0.247 ± 0.159
0.247CysGln: 0.247 ± 0.138
0.329CysArg: 0.329 ± 0.215
0.329CysSer: 0.329 ± 0.16
0.165CysThr: 0.165 ± 0.104
0.247CysVal: 0.247 ± 0.132
0.165CysTrp: 0.165 ± 0.109
0.082CysTyr: 0.082 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
4.448AspAla: 4.448 ± 0.807
0.577AspCys: 0.577 ± 0.243
2.883AspAsp: 2.883 ± 0.666
5.601AspGlu: 5.601 ± 0.74
3.212AspPhe: 3.212 ± 0.709
4.283AspGly: 4.283 ± 0.823
1.071AspHis: 1.071 ± 0.378
2.636AspIle: 2.636 ± 0.379
4.365AspLys: 4.365 ± 0.72
5.436AspLeu: 5.436 ± 0.842
1.235AspMet: 1.235 ± 0.287
2.389AspAsn: 2.389 ± 0.598
1.977AspPro: 1.977 ± 0.475
2.224AspGln: 2.224 ± 0.395
2.8AspArg: 2.8 ± 0.54
3.048AspSer: 3.048 ± 0.575
3.624AspThr: 3.624 ± 0.701
3.048AspVal: 3.048 ± 0.526
0.988AspTrp: 0.988 ± 0.338
2.142AspTyr: 2.142 ± 0.381
0.0AspXaa: 0.0 ± 0.0
Glu
6.836GluAla: 6.836 ± 0.66
0.329GluCys: 0.329 ± 0.198
4.118GluAsp: 4.118 ± 0.789
5.93GluGlu: 5.93 ± 1.154
2.306GluPhe: 2.306 ± 0.351
4.942GluGly: 4.942 ± 0.645
1.153GluHis: 1.153 ± 0.439
5.107GluIle: 5.107 ± 0.71
6.425GluLys: 6.425 ± 0.893
9.143GluLeu: 9.143 ± 1.082
1.894GluMet: 1.894 ± 0.501
4.283GluAsn: 4.283 ± 0.909
1.977GluPro: 1.977 ± 0.415
3.459GluGln: 3.459 ± 0.724
4.365GluArg: 4.365 ± 0.785
4.201GluSer: 4.201 ± 0.706
4.448GluThr: 4.448 ± 0.656
4.777GluVal: 4.777 ± 0.767
1.235GluTrp: 1.235 ± 0.289
2.718GluTyr: 2.718 ± 0.6
0.0GluXaa: 0.0 ± 0.0
Phe
2.636PheAla: 2.636 ± 0.582
0.165PheCys: 0.165 ± 0.113
3.13PheAsp: 3.13 ± 0.497
4.612PheGlu: 4.612 ± 0.71
1.565PhePhe: 1.565 ± 0.322
2.8PheGly: 2.8 ± 0.466
0.329PheHis: 0.329 ± 0.146
2.142PheIle: 2.142 ± 0.476
3.048PheLys: 3.048 ± 0.529
2.471PheLeu: 2.471 ± 0.49
0.824PheMet: 0.824 ± 0.271
2.306PheAsn: 2.306 ± 0.459
1.4PhePro: 1.4 ± 0.449
1.4PheGln: 1.4 ± 0.322
1.318PheArg: 1.318 ± 0.334
2.471PheSer: 2.471 ± 0.341
3.212PheThr: 3.212 ± 0.456
1.565PheVal: 1.565 ± 0.432
0.494PheTrp: 0.494 ± 0.261
1.235PheTyr: 1.235 ± 0.62
0.0PheXaa: 0.0 ± 0.0
Gly
4.695GlyAla: 4.695 ± 0.982
0.082GlyCys: 0.082 ± 0.069
3.048GlyAsp: 3.048 ± 0.632
4.036GlyGlu: 4.036 ± 0.531
3.459GlyPhe: 3.459 ± 0.56
3.871GlyGly: 3.871 ± 0.686
1.153GlyHis: 1.153 ± 0.319
4.86GlyIle: 4.86 ± 0.736
6.425GlyLys: 6.425 ± 0.722
6.095GlyLeu: 6.095 ± 0.87
1.977GlyMet: 1.977 ± 0.47
2.965GlyAsn: 2.965 ± 0.51
1.647GlyPro: 1.647 ± 0.421
3.13GlyGln: 3.13 ± 0.483
3.624GlyArg: 3.624 ± 0.666
4.612GlySer: 4.612 ± 0.841
4.612GlyThr: 4.612 ± 0.94
4.53GlyVal: 4.53 ± 0.428
0.577GlyTrp: 0.577 ± 0.241
2.553GlyTyr: 2.553 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
1.153HisAla: 1.153 ± 0.231
0.0HisCys: 0.0 ± 0.0
0.494HisAsp: 0.494 ± 0.19
0.906HisGlu: 0.906 ± 0.273
1.318HisPhe: 1.318 ± 0.356
1.071HisGly: 1.071 ± 0.288
0.329HisHis: 0.329 ± 0.166
0.741HisIle: 0.741 ± 0.236
1.071HisLys: 1.071 ± 0.344
0.988HisLeu: 0.988 ± 0.285
0.247HisMet: 0.247 ± 0.148
0.906HisAsn: 0.906 ± 0.271
0.165HisPro: 0.165 ± 0.113
0.577HisGln: 0.577 ± 0.242
0.494HisArg: 0.494 ± 0.194
0.577HisSer: 0.577 ± 0.174
0.988HisThr: 0.988 ± 0.234
0.494HisVal: 0.494 ± 0.229
0.165HisTrp: 0.165 ± 0.18
0.165HisTyr: 0.165 ± 0.114
0.0HisXaa: 0.0 ± 0.0
Ile
5.354IleAla: 5.354 ± 0.623
0.412IleCys: 0.412 ± 0.156
5.107IleAsp: 5.107 ± 0.576
6.507IleGlu: 6.507 ± 0.992
1.977IlePhe: 1.977 ± 0.474
5.024IleGly: 5.024 ± 0.749
1.153IleHis: 1.153 ± 0.337
2.636IleIle: 2.636 ± 0.495
4.777IleLys: 4.777 ± 0.484
4.118IleLeu: 4.118 ± 0.519
0.988IleMet: 0.988 ± 0.299
3.212IleAsn: 3.212 ± 0.472
2.059IlePro: 2.059 ± 0.53
1.812IleGln: 1.812 ± 0.459
2.059IleArg: 2.059 ± 0.436
4.201IleSer: 4.201 ± 0.627
3.542IleThr: 3.542 ± 0.689
4.365IleVal: 4.365 ± 0.698
0.412IleTrp: 0.412 ± 0.159
2.553IleTyr: 2.553 ± 0.618
0.0IleXaa: 0.0 ± 0.0
Lys
7.248LysAla: 7.248 ± 0.967
0.329LysCys: 0.329 ± 0.183
4.118LysAsp: 4.118 ± 0.539
6.754LysGlu: 6.754 ± 1.042
2.389LysPhe: 2.389 ± 0.432
5.107LysGly: 5.107 ± 0.561
0.741LysHis: 0.741 ± 0.253
5.518LysIle: 5.518 ± 0.858
6.013LysLys: 6.013 ± 1.033
6.177LysLeu: 6.177 ± 0.754
1.894LysMet: 1.894 ± 0.429
4.448LysAsn: 4.448 ± 0.689
2.471LysPro: 2.471 ± 0.715
3.542LysGln: 3.542 ± 0.609
3.706LysArg: 3.706 ± 0.673
3.706LysSer: 3.706 ± 0.575
4.86LysThr: 4.86 ± 0.736
5.271LysVal: 5.271 ± 0.732
1.4LysTrp: 1.4 ± 0.347
2.636LysTyr: 2.636 ± 0.51
0.0LysXaa: 0.0 ± 0.0
Leu
6.836LeuAla: 6.836 ± 0.78
0.247LeuCys: 0.247 ± 0.194
5.518LeuAsp: 5.518 ± 0.677
7.248LeuGlu: 7.248 ± 1.115
2.718LeuPhe: 2.718 ± 0.458
6.425LeuGly: 6.425 ± 0.954
0.824LeuHis: 0.824 ± 0.237
4.118LeuIle: 4.118 ± 0.566
7.907LeuLys: 7.907 ± 0.932
6.342LeuLeu: 6.342 ± 0.803
1.977LeuMet: 1.977 ± 0.397
4.942LeuAsn: 4.942 ± 0.708
4.448LeuPro: 4.448 ± 0.618
3.871LeuGln: 3.871 ± 0.671
3.459LeuArg: 3.459 ± 0.762
5.354LeuSer: 5.354 ± 0.827
4.942LeuThr: 4.942 ± 0.664
4.612LeuVal: 4.612 ± 0.623
0.577LeuTrp: 0.577 ± 0.243
2.636LeuTyr: 2.636 ± 0.51
0.0LeuXaa: 0.0 ± 0.0
Met
2.636MetAla: 2.636 ± 0.679
0.0MetCys: 0.0 ± 0.0
1.483MetAsp: 1.483 ± 0.373
1.4MetGlu: 1.4 ± 0.408
0.741MetPhe: 0.741 ± 0.22
1.071MetGly: 1.071 ± 0.299
0.082MetHis: 0.082 ± 0.065
1.894MetIle: 1.894 ± 0.333
1.73MetLys: 1.73 ± 0.393
1.647MetLeu: 1.647 ± 0.304
0.577MetMet: 0.577 ± 0.191
1.483MetAsn: 1.483 ± 0.599
0.412MetPro: 0.412 ± 0.213
1.153MetGln: 1.153 ± 0.545
1.235MetArg: 1.235 ± 0.299
2.306MetSer: 2.306 ± 0.533
1.73MetThr: 1.73 ± 0.323
1.153MetVal: 1.153 ± 0.288
0.247MetTrp: 0.247 ± 0.123
0.659MetTyr: 0.659 ± 0.246
0.0MetXaa: 0.0 ± 0.0
Asn
4.86AsnAla: 4.86 ± 1.134
0.329AsnCys: 0.329 ± 0.164
2.8AsnAsp: 2.8 ± 0.462
3.954AsnGlu: 3.954 ± 0.669
2.142AsnPhe: 2.142 ± 0.335
5.518AsnGly: 5.518 ± 0.636
0.906AsnHis: 0.906 ± 0.278
3.624AsnIle: 3.624 ± 0.396
3.377AsnLys: 3.377 ± 0.692
4.283AsnLeu: 4.283 ± 0.54
1.812AsnMet: 1.812 ± 0.435
3.624AsnAsn: 3.624 ± 0.612
2.224AsnPro: 2.224 ± 0.545
2.718AsnGln: 2.718 ± 0.563
2.142AsnArg: 2.142 ± 0.549
3.13AsnSer: 3.13 ± 0.747
2.553AsnThr: 2.553 ± 0.519
3.13AsnVal: 3.13 ± 0.533
1.153AsnTrp: 1.153 ± 0.396
2.306AsnTyr: 2.306 ± 0.489
0.0AsnXaa: 0.0 ± 0.0
Pro
3.13ProAla: 3.13 ± 0.652
0.247ProCys: 0.247 ± 0.187
2.471ProAsp: 2.471 ± 0.49
2.883ProGlu: 2.883 ± 1.009
1.483ProPhe: 1.483 ± 0.384
1.483ProGly: 1.483 ± 0.342
0.247ProHis: 0.247 ± 0.158
1.483ProIle: 1.483 ± 0.363
2.8ProLys: 2.8 ± 0.569
2.883ProLeu: 2.883 ± 0.647
0.577ProMet: 0.577 ± 0.217
1.235ProAsn: 1.235 ± 0.27
0.741ProPro: 0.741 ± 0.285
2.142ProGln: 2.142 ± 0.664
0.906ProArg: 0.906 ± 0.297
1.73ProSer: 1.73 ± 0.349
1.812ProThr: 1.812 ± 0.568
2.142ProVal: 2.142 ± 0.372
0.412ProTrp: 0.412 ± 0.217
0.741ProTyr: 0.741 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
4.695GlnAla: 4.695 ± 0.869
0.082GlnCys: 0.082 ± 0.08
1.318GlnAsp: 1.318 ± 0.36
3.212GlnGlu: 3.212 ± 0.808
1.977GlnPhe: 1.977 ± 0.351
2.8GlnGly: 2.8 ± 0.539
0.247GlnHis: 0.247 ± 0.154
2.883GlnIle: 2.883 ± 0.48
3.954GlnLys: 3.954 ± 0.587
4.53GlnLeu: 4.53 ± 0.978
1.565GlnMet: 1.565 ± 0.523
2.883GlnAsn: 2.883 ± 0.59
2.059GlnPro: 2.059 ± 0.609
2.718GlnGln: 2.718 ± 0.816
1.483GlnArg: 1.483 ± 0.355
2.142GlnSer: 2.142 ± 0.525
3.295GlnThr: 3.295 ± 0.456
2.718GlnVal: 2.718 ± 0.489
0.0GlnTrp: 0.0 ± 0.0
1.235GlnTyr: 1.235 ± 0.347
0.0GlnXaa: 0.0 ± 0.0
Arg
3.377ArgAla: 3.377 ± 0.664
0.082ArgCys: 0.082 ± 0.09
2.636ArgAsp: 2.636 ± 0.391
3.542ArgGlu: 3.542 ± 0.695
1.235ArgPhe: 1.235 ± 0.349
2.636ArgGly: 2.636 ± 0.524
0.988ArgHis: 0.988 ± 0.347
2.553ArgIle: 2.553 ± 0.564
4.036ArgLys: 4.036 ± 0.713
4.777ArgLeu: 4.777 ± 0.967
1.153ArgMet: 1.153 ± 0.28
2.471ArgAsn: 2.471 ± 0.542
1.235ArgPro: 1.235 ± 0.39
0.988ArgGln: 0.988 ± 0.28
2.224ArgArg: 2.224 ± 0.567
1.894ArgSer: 1.894 ± 0.371
2.142ArgThr: 2.142 ± 0.461
3.459ArgVal: 3.459 ± 0.544
0.412ArgTrp: 0.412 ± 0.194
1.977ArgTyr: 1.977 ± 0.479
0.0ArgXaa: 0.0 ± 0.0
Ser
5.766SerAla: 5.766 ± 1.328
0.494SerCys: 0.494 ± 0.205
2.965SerAsp: 2.965 ± 0.36
4.448SerGlu: 4.448 ± 0.741
2.965SerPhe: 2.965 ± 0.62
4.118SerGly: 4.118 ± 0.804
0.659SerHis: 0.659 ± 0.233
4.118SerIle: 4.118 ± 0.627
4.201SerLys: 4.201 ± 0.58
4.201SerLeu: 4.201 ± 0.714
1.4SerMet: 1.4 ± 0.329
3.13SerAsn: 3.13 ± 0.66
1.812SerPro: 1.812 ± 0.398
2.718SerGln: 2.718 ± 0.485
2.224SerArg: 2.224 ± 0.466
4.283SerSer: 4.283 ± 1.257
4.448SerThr: 4.448 ± 1.003
4.365SerVal: 4.365 ± 0.534
0.988SerTrp: 0.988 ± 0.211
2.059SerTyr: 2.059 ± 0.345
0.0SerXaa: 0.0 ± 0.0
Thr
6.177ThrAla: 6.177 ± 0.744
0.165ThrCys: 0.165 ± 0.096
3.542ThrAsp: 3.542 ± 0.47
4.201ThrGlu: 4.201 ± 0.62
2.553ThrPhe: 2.553 ± 0.413
5.024ThrGly: 5.024 ± 0.704
0.577ThrHis: 0.577 ± 0.226
4.53ThrIle: 4.53 ± 0.728
3.789ThrLys: 3.789 ± 0.594
4.365ThrLeu: 4.365 ± 0.457
1.73ThrMet: 1.73 ± 0.562
4.365ThrAsn: 4.365 ± 0.989
1.73ThrPro: 1.73 ± 0.377
3.048ThrGln: 3.048 ± 0.529
2.224ThrArg: 2.224 ± 0.472
4.036ThrSer: 4.036 ± 0.96
4.365ThrThr: 4.365 ± 0.926
4.612ThrVal: 4.612 ± 0.794
0.247ThrTrp: 0.247 ± 0.132
1.812ThrTyr: 1.812 ± 0.466
0.0ThrXaa: 0.0 ± 0.0
Val
4.777ValAla: 4.777 ± 0.662
0.329ValCys: 0.329 ± 0.174
4.283ValAsp: 4.283 ± 0.512
4.86ValGlu: 4.86 ± 1.016
1.647ValPhe: 1.647 ± 0.318
3.789ValGly: 3.789 ± 0.512
0.906ValHis: 0.906 ± 0.248
4.201ValIle: 4.201 ± 0.645
4.942ValLys: 4.942 ± 0.66
5.518ValLeu: 5.518 ± 0.74
0.824ValMet: 0.824 ± 0.349
4.118ValAsn: 4.118 ± 0.523
1.235ValPro: 1.235 ± 0.344
3.295ValGln: 3.295 ± 0.739
2.718ValArg: 2.718 ± 0.527
4.53ValSer: 4.53 ± 0.812
4.448ValThr: 4.448 ± 0.653
3.789ValVal: 3.789 ± 0.974
0.412ValTrp: 0.412 ± 0.167
1.565ValTyr: 1.565 ± 0.502
0.0ValXaa: 0.0 ± 0.0
Trp
0.577TrpAla: 0.577 ± 0.21
0.082TrpCys: 0.082 ± 0.074
0.988TrpAsp: 0.988 ± 0.344
0.494TrpGlu: 0.494 ± 0.21
0.741TrpPhe: 0.741 ± 0.287
0.741TrpGly: 0.741 ± 0.229
0.165TrpHis: 0.165 ± 0.126
0.824TrpIle: 0.824 ± 0.311
1.235TrpLys: 1.235 ± 0.341
0.988TrpLeu: 0.988 ± 0.271
0.082TrpMet: 0.082 ± 0.09
0.824TrpAsn: 0.824 ± 0.344
0.0TrpPro: 0.0 ± 0.0
0.329TrpGln: 0.329 ± 0.16
0.659TrpArg: 0.659 ± 0.241
0.741TrpSer: 0.741 ± 0.33
0.906TrpThr: 0.906 ± 0.251
0.741TrpVal: 0.741 ± 0.255
0.0TrpTrp: 0.0 ± 0.0
0.247TrpTyr: 0.247 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.4TyrAla: 1.4 ± 0.335
0.247TyrCys: 0.247 ± 0.131
2.142TyrAsp: 2.142 ± 0.452
2.553TyrGlu: 2.553 ± 0.498
1.812TyrPhe: 1.812 ± 0.422
1.565TyrGly: 1.565 ± 0.469
0.494TyrHis: 0.494 ± 0.243
1.894TyrIle: 1.894 ± 0.462
2.718TyrLys: 2.718 ± 0.496
3.377TyrLeu: 3.377 ± 0.564
0.577TyrMet: 0.577 ± 0.204
1.894TyrAsn: 1.894 ± 0.491
0.577TyrPro: 0.577 ± 0.226
1.73TyrGln: 1.73 ± 0.358
2.142TyrArg: 2.142 ± 0.642
2.059TyrSer: 2.059 ± 0.389
2.142TyrThr: 2.142 ± 0.36
1.894TyrVal: 1.894 ± 0.487
0.577TyrTrp: 0.577 ± 0.245
1.235TyrTyr: 1.235 ± 0.386
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (12142 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski