Amino acid dipepetide frequency for Streptococcus phage Javan202

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.653AlaAla: 4.653 ± 1.105
0.499AlaCys: 0.499 ± 0.308
5.484AlaAsp: 5.484 ± 0.7
6.149AlaGlu: 6.149 ± 0.659
2.742AlaPhe: 2.742 ± 0.554
4.986AlaGly: 4.986 ± 0.944
0.665AlaHis: 0.665 ± 0.23
5.651AlaIle: 5.651 ± 0.849
6.731AlaLys: 6.731 ± 0.883
5.401AlaLeu: 5.401 ± 0.755
1.246AlaMet: 1.246 ± 0.382
4.072AlaAsn: 4.072 ± 0.523
1.828AlaPro: 1.828 ± 0.365
1.994AlaGln: 1.994 ± 0.408
1.994AlaArg: 1.994 ± 0.403
3.407AlaSer: 3.407 ± 0.773
4.653AlaThr: 4.653 ± 0.97
4.82AlaVal: 4.82 ± 0.625
1.163AlaTrp: 1.163 ± 0.448
3.324AlaTyr: 3.324 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.332CysAla: 0.332 ± 0.197
0.083CysCys: 0.083 ± 0.079
0.415CysAsp: 0.415 ± 0.215
0.831CysGlu: 0.831 ± 0.303
0.083CysPhe: 0.083 ± 0.082
0.831CysGly: 0.831 ± 0.304
0.166CysHis: 0.166 ± 0.114
0.166CysIle: 0.166 ± 0.117
0.665CysLys: 0.665 ± 0.269
0.332CysLeu: 0.332 ± 0.152
0.166CysMet: 0.166 ± 0.114
0.582CysAsn: 0.582 ± 0.255
0.332CysPro: 0.332 ± 0.253
0.332CysGln: 0.332 ± 0.182
0.415CysArg: 0.415 ± 0.236
0.831CysSer: 0.831 ± 0.265
0.499CysThr: 0.499 ± 0.237
0.249CysVal: 0.249 ± 0.132
0.083CysTrp: 0.083 ± 0.087
0.914CysTyr: 0.914 ± 0.307
0.0CysXaa: 0.0 ± 0.0
Asp
2.742AspAla: 2.742 ± 0.678
0.748AspCys: 0.748 ± 0.211
4.238AspAsp: 4.238 ± 0.884
6.565AspGlu: 6.565 ± 0.949
3.407AspPhe: 3.407 ± 0.42
5.401AspGly: 5.401 ± 0.715
0.665AspHis: 0.665 ± 0.212
3.906AspIle: 3.906 ± 0.663
5.568AspLys: 5.568 ± 0.729
4.82AspLeu: 4.82 ± 0.71
1.33AspMet: 1.33 ± 0.37
3.906AspAsn: 3.906 ± 0.573
1.246AspPro: 1.246 ± 0.322
1.662AspGln: 1.662 ± 0.536
1.994AspArg: 1.994 ± 0.384
3.906AspSer: 3.906 ± 0.724
3.324AspThr: 3.324 ± 0.526
3.823AspVal: 3.823 ± 0.51
0.748AspTrp: 0.748 ± 0.279
2.825AspTyr: 2.825 ± 0.534
0.0AspXaa: 0.0 ± 0.0
Glu
5.318GluAla: 5.318 ± 0.627
0.582GluCys: 0.582 ± 0.304
4.57GluAsp: 4.57 ± 0.715
5.152GluGlu: 5.152 ± 0.738
3.573GluPhe: 3.573 ± 0.538
4.321GluGly: 4.321 ± 0.693
0.914GluHis: 0.914 ± 0.244
5.318GluIle: 5.318 ± 0.702
5.318GluLys: 5.318 ± 0.867
7.396GluLeu: 7.396 ± 1.08
2.659GluMet: 2.659 ± 0.581
4.487GluAsn: 4.487 ± 0.654
2.576GluPro: 2.576 ± 0.675
2.825GluGln: 2.825 ± 0.522
2.659GluArg: 2.659 ± 0.587
3.241GluSer: 3.241 ± 0.47
5.651GluThr: 5.651 ± 0.507
6.814GluVal: 6.814 ± 0.71
1.163GluTrp: 1.163 ± 0.355
2.576GluTyr: 2.576 ± 0.525
0.0GluXaa: 0.0 ± 0.0
Phe
2.41PheAla: 2.41 ± 0.598
0.499PheCys: 0.499 ± 0.254
2.825PheAsp: 2.825 ± 0.432
4.072PheGlu: 4.072 ± 0.693
2.41PhePhe: 2.41 ± 0.703
2.41PheGly: 2.41 ± 0.454
0.665PheHis: 0.665 ± 0.187
1.579PheIle: 1.579 ± 0.402
3.906PheLys: 3.906 ± 0.613
2.576PheLeu: 2.576 ± 0.434
0.249PheMet: 0.249 ± 0.133
2.908PheAsn: 2.908 ± 0.593
0.914PhePro: 0.914 ± 0.279
0.997PheGln: 0.997 ± 0.256
1.33PheArg: 1.33 ± 0.347
3.158PheSer: 3.158 ± 0.893
2.576PheThr: 2.576 ± 0.591
1.745PheVal: 1.745 ± 0.396
1.08PheTrp: 1.08 ± 0.281
1.911PheTyr: 1.911 ± 0.293
0.0PheXaa: 0.0 ± 0.0
Gly
4.321GlyAla: 4.321 ± 0.838
0.332GlyCys: 0.332 ± 0.138
2.908GlyAsp: 2.908 ± 0.439
4.238GlyGlu: 4.238 ± 0.6
3.407GlyPhe: 3.407 ± 0.569
5.651GlyGly: 5.651 ± 0.857
1.496GlyHis: 1.496 ± 0.319
4.404GlyIle: 4.404 ± 1.587
6.565GlyLys: 6.565 ± 0.681
5.401GlyLeu: 5.401 ± 0.623
1.33GlyMet: 1.33 ± 0.342
3.158GlyAsn: 3.158 ± 0.653
0.582GlyPro: 0.582 ± 0.227
3.075GlyGln: 3.075 ± 0.55
2.161GlyArg: 2.161 ± 0.498
4.986GlySer: 4.986 ± 1.053
5.152GlyThr: 5.152 ± 0.777
5.318GlyVal: 5.318 ± 1.042
0.997GlyTrp: 0.997 ± 0.302
3.989GlyTyr: 3.989 ± 0.663
0.0GlyXaa: 0.0 ± 0.0
His
1.08HisAla: 1.08 ± 0.393
0.249HisCys: 0.249 ± 0.144
1.08HisAsp: 1.08 ± 0.314
0.831HisGlu: 0.831 ± 0.263
0.332HisPhe: 0.332 ± 0.189
0.914HisGly: 0.914 ± 0.218
0.0HisHis: 0.0 ± 0.0
0.914HisIle: 0.914 ± 0.288
0.582HisLys: 0.582 ± 0.231
1.163HisLeu: 1.163 ± 0.339
0.332HisMet: 0.332 ± 0.144
0.914HisAsn: 0.914 ± 0.272
0.582HisPro: 0.582 ± 0.176
0.415HisGln: 0.415 ± 0.15
0.249HisArg: 0.249 ± 0.143
1.08HisSer: 1.08 ± 0.219
0.499HisThr: 0.499 ± 0.248
0.831HisVal: 0.831 ± 0.288
0.332HisTrp: 0.332 ± 0.153
0.332HisTyr: 0.332 ± 0.182
0.0HisXaa: 0.0 ± 0.0
Ile
4.238IleAla: 4.238 ± 0.614
0.083IleCys: 0.083 ± 0.081
4.487IleAsp: 4.487 ± 0.797
4.903IleGlu: 4.903 ± 0.631
2.41IlePhe: 2.41 ± 0.438
3.075IleGly: 3.075 ± 0.534
0.665IleHis: 0.665 ± 0.232
3.241IleIle: 3.241 ± 0.469
5.734IleLys: 5.734 ± 0.859
4.238IleLeu: 4.238 ± 0.483
1.246IleMet: 1.246 ± 0.265
4.072IleAsn: 4.072 ± 0.618
1.745IlePro: 1.745 ± 0.331
1.994IleGln: 1.994 ± 0.369
2.077IleArg: 2.077 ± 0.377
4.82IleSer: 4.82 ± 0.614
5.401IleThr: 5.401 ± 0.842
2.992IleVal: 2.992 ± 0.856
0.748IleTrp: 0.748 ± 0.296
2.41IleTyr: 2.41 ± 0.474
0.0IleXaa: 0.0 ± 0.0
Lys
5.734LysAla: 5.734 ± 0.803
0.914LysCys: 0.914 ± 0.262
5.484LysAsp: 5.484 ± 0.802
6.897LysGlu: 6.897 ± 1.073
2.077LysPhe: 2.077 ± 0.377
5.235LysGly: 5.235 ± 0.551
1.246LysHis: 1.246 ± 0.327
5.651LysIle: 5.651 ± 0.822
5.568LysLys: 5.568 ± 1.111
7.645LysLeu: 7.645 ± 1.095
2.161LysMet: 2.161 ± 0.436
5.152LysAsn: 5.152 ± 0.554
2.161LysPro: 2.161 ± 0.508
3.906LysGln: 3.906 ± 0.613
3.324LysArg: 3.324 ± 0.534
5.734LysSer: 5.734 ± 0.75
6.897LysThr: 6.897 ± 0.805
4.986LysVal: 4.986 ± 0.53
1.08LysTrp: 1.08 ± 0.27
2.908LysTyr: 2.908 ± 0.707
0.0LysXaa: 0.0 ± 0.0
Leu
5.235LeuAla: 5.235 ± 0.613
0.415LeuCys: 0.415 ± 0.242
4.903LeuAsp: 4.903 ± 0.756
6.232LeuGlu: 6.232 ± 0.902
2.576LeuPhe: 2.576 ± 0.492
5.817LeuGly: 5.817 ± 0.703
1.163LeuHis: 1.163 ± 0.294
4.155LeuIle: 4.155 ± 0.656
8.642LeuLys: 8.642 ± 1.168
4.903LeuLeu: 4.903 ± 0.674
1.662LeuMet: 1.662 ± 0.344
3.656LeuAsn: 3.656 ± 0.627
2.742LeuPro: 2.742 ± 0.458
2.41LeuGln: 2.41 ± 0.463
3.075LeuArg: 3.075 ± 0.441
5.568LeuSer: 5.568 ± 0.651
6.149LeuThr: 6.149 ± 0.743
4.903LeuVal: 4.903 ± 0.578
0.748LeuTrp: 0.748 ± 0.461
1.828LeuTyr: 1.828 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
1.994MetAla: 1.994 ± 0.291
0.083MetCys: 0.083 ± 0.077
0.582MetAsp: 0.582 ± 0.221
1.496MetGlu: 1.496 ± 0.347
0.748MetPhe: 0.748 ± 0.261
0.997MetGly: 0.997 ± 0.262
0.083MetHis: 0.083 ± 0.077
1.496MetIle: 1.496 ± 0.388
2.327MetLys: 2.327 ± 0.496
2.327MetLeu: 2.327 ± 0.504
0.415MetMet: 0.415 ± 0.171
1.496MetAsn: 1.496 ± 0.316
0.665MetPro: 0.665 ± 0.233
1.33MetGln: 1.33 ± 0.366
0.831MetArg: 0.831 ± 0.26
1.33MetSer: 1.33 ± 0.339
1.828MetThr: 1.828 ± 0.346
1.163MetVal: 1.163 ± 0.274
0.415MetTrp: 0.415 ± 0.153
0.997MetTyr: 0.997 ± 0.264
0.0MetXaa: 0.0 ± 0.0
Asn
5.152AsnAla: 5.152 ± 0.765
0.582AsnCys: 0.582 ± 0.203
3.075AsnAsp: 3.075 ± 0.42
4.238AsnGlu: 4.238 ± 0.677
1.911AsnPhe: 1.911 ± 0.389
5.401AsnGly: 5.401 ± 0.917
0.582AsnHis: 0.582 ± 0.244
3.407AsnIle: 3.407 ± 0.513
4.487AsnLys: 4.487 ± 0.579
4.238AsnLeu: 4.238 ± 0.498
1.33AsnMet: 1.33 ± 0.368
3.573AsnAsn: 3.573 ± 0.641
2.161AsnPro: 2.161 ± 0.362
2.493AsnGln: 2.493 ± 0.438
2.41AsnArg: 2.41 ± 0.456
2.742AsnSer: 2.742 ± 0.504
2.992AsnThr: 2.992 ± 0.544
3.656AsnVal: 3.656 ± 0.446
0.914AsnTrp: 0.914 ± 0.311
2.825AsnTyr: 2.825 ± 0.383
0.0AsnXaa: 0.0 ± 0.0
Pro
2.576ProAla: 2.576 ± 0.47
0.083ProCys: 0.083 ± 0.079
1.911ProAsp: 1.911 ± 0.379
2.992ProGlu: 2.992 ± 0.702
1.08ProPhe: 1.08 ± 0.213
1.246ProGly: 1.246 ± 0.283
0.332ProHis: 0.332 ± 0.188
1.33ProIle: 1.33 ± 0.381
2.41ProLys: 2.41 ± 0.417
1.33ProLeu: 1.33 ± 0.432
0.415ProMet: 0.415 ± 0.159
0.997ProAsn: 0.997 ± 0.246
0.499ProPro: 0.499 ± 0.239
0.499ProGln: 0.499 ± 0.177
1.246ProArg: 1.246 ± 0.312
1.496ProSer: 1.496 ± 0.369
2.161ProThr: 2.161 ± 0.438
1.662ProVal: 1.662 ± 0.378
0.249ProTrp: 0.249 ± 0.185
1.08ProTyr: 1.08 ± 0.3
0.0ProXaa: 0.0 ± 0.0
Gln
2.992GlnAla: 2.992 ± 0.43
0.582GlnCys: 0.582 ± 0.206
2.161GlnAsp: 2.161 ± 0.352
3.656GlnGlu: 3.656 ± 0.585
1.246GlnPhe: 1.246 ± 0.27
1.911GlnGly: 1.911 ± 0.396
0.332GlnHis: 0.332 ± 0.185
2.327GlnIle: 2.327 ± 0.532
2.742GlnLys: 2.742 ± 0.454
2.992GlnLeu: 2.992 ± 0.349
0.831GlnMet: 0.831 ± 0.282
2.244GlnAsn: 2.244 ± 0.409
0.914GlnPro: 0.914 ± 0.289
2.077GlnGln: 2.077 ± 0.558
1.662GlnArg: 1.662 ± 0.33
1.579GlnSer: 1.579 ± 0.434
2.493GlnThr: 2.493 ± 0.501
2.327GlnVal: 2.327 ± 0.399
0.415GlnTrp: 0.415 ± 0.225
1.662GlnTyr: 1.662 ± 0.383
0.0GlnXaa: 0.0 ± 0.0
Arg
2.825ArgAla: 2.825 ± 0.523
0.582ArgCys: 0.582 ± 0.227
1.662ArgAsp: 1.662 ± 0.356
1.911ArgGlu: 1.911 ± 0.384
1.33ArgPhe: 1.33 ± 0.384
2.077ArgGly: 2.077 ± 0.489
0.415ArgHis: 0.415 ± 0.14
2.244ArgIle: 2.244 ± 0.439
3.49ArgLys: 3.49 ± 0.56
3.075ArgLeu: 3.075 ± 0.474
1.33ArgMet: 1.33 ± 0.381
2.327ArgAsn: 2.327 ± 0.345
0.665ArgPro: 0.665 ± 0.243
1.662ArgGln: 1.662 ± 0.346
1.994ArgArg: 1.994 ± 0.456
1.246ArgSer: 1.246 ± 0.278
2.244ArgThr: 2.244 ± 0.395
3.075ArgVal: 3.075 ± 0.497
0.997ArgTrp: 0.997 ± 0.281
2.244ArgTyr: 2.244 ± 0.494
0.0ArgXaa: 0.0 ± 0.0
Ser
4.487SerAla: 4.487 ± 0.897
0.332SerCys: 0.332 ± 0.16
4.487SerAsp: 4.487 ± 0.684
4.072SerGlu: 4.072 ± 0.613
2.659SerPhe: 2.659 ± 0.509
6.149SerGly: 6.149 ± 1.024
0.665SerHis: 0.665 ± 0.204
3.739SerIle: 3.739 ± 0.454
5.318SerLys: 5.318 ± 0.885
4.155SerLeu: 4.155 ± 0.53
1.579SerMet: 1.579 ± 0.471
4.487SerAsn: 4.487 ± 0.636
0.499SerPro: 0.499 ± 0.126
2.742SerGln: 2.742 ± 0.414
1.662SerArg: 1.662 ± 0.398
3.989SerSer: 3.989 ± 0.622
4.155SerThr: 4.155 ± 0.585
4.155SerVal: 4.155 ± 0.771
0.831SerTrp: 0.831 ± 0.272
2.077SerTyr: 2.077 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
6.066ThrAla: 6.066 ± 0.684
0.332ThrCys: 0.332 ± 0.145
4.072ThrAsp: 4.072 ± 0.64
4.238ThrGlu: 4.238 ± 0.645
2.161ThrPhe: 2.161 ± 0.416
5.069ThrGly: 5.069 ± 0.927
0.748ThrHis: 0.748 ± 0.238
5.235ThrIle: 5.235 ± 0.704
5.734ThrLys: 5.734 ± 0.732
5.568ThrLeu: 5.568 ± 0.807
1.08ThrMet: 1.08 ± 0.281
3.075ThrAsn: 3.075 ± 0.58
2.161ThrPro: 2.161 ± 0.392
2.244ThrGln: 2.244 ± 0.466
2.41ThrArg: 2.41 ± 0.468
4.404ThrSer: 4.404 ± 0.804
3.989ThrThr: 3.989 ± 0.906
6.731ThrVal: 6.731 ± 1.027
0.831ThrTrp: 0.831 ± 0.28
3.49ThrTyr: 3.49 ± 0.511
0.0ThrXaa: 0.0 ± 0.0
Val
5.734ValAla: 5.734 ± 0.939
0.499ValCys: 0.499 ± 0.183
5.235ValAsp: 5.235 ± 0.703
4.903ValGlu: 4.903 ± 0.639
3.49ValPhe: 3.49 ± 0.659
4.404ValGly: 4.404 ± 0.71
0.665ValHis: 0.665 ± 0.259
2.908ValIle: 2.908 ± 0.449
4.653ValLys: 4.653 ± 0.605
4.487ValLeu: 4.487 ± 0.62
1.745ValMet: 1.745 ± 0.385
2.825ValAsn: 2.825 ± 0.512
1.745ValPro: 1.745 ± 0.351
2.244ValGln: 2.244 ± 0.405
2.576ValArg: 2.576 ± 0.479
5.235ValSer: 5.235 ± 0.822
5.152ValThr: 5.152 ± 0.661
3.656ValVal: 3.656 ± 0.512
1.163ValTrp: 1.163 ± 0.553
2.41ValTyr: 2.41 ± 0.516
0.0ValXaa: 0.0 ± 0.0
Trp
1.163TrpAla: 1.163 ± 0.322
0.083TrpCys: 0.083 ± 0.082
0.499TrpAsp: 0.499 ± 0.236
1.08TrpGlu: 1.08 ± 0.296
0.415TrpPhe: 0.415 ± 0.238
0.997TrpGly: 0.997 ± 0.248
0.415TrpHis: 0.415 ± 0.167
0.997TrpIle: 0.997 ± 0.218
0.997TrpLys: 0.997 ± 0.31
1.246TrpLeu: 1.246 ± 0.439
0.166TrpMet: 0.166 ± 0.097
1.745TrpAsn: 1.745 ± 0.704
0.083TrpPro: 0.083 ± 0.08
0.665TrpGln: 0.665 ± 0.286
0.831TrpArg: 0.831 ± 0.233
0.831TrpSer: 0.831 ± 0.345
1.163TrpThr: 1.163 ± 0.444
0.914TrpVal: 0.914 ± 0.285
0.166TrpTrp: 0.166 ± 0.163
0.499TrpTyr: 0.499 ± 0.236
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.41TyrAla: 2.41 ± 0.413
0.748TyrCys: 0.748 ± 0.262
2.742TyrAsp: 2.742 ± 0.509
2.659TyrGlu: 2.659 ± 0.622
2.161TyrPhe: 2.161 ± 0.501
2.576TyrGly: 2.576 ± 0.606
0.831TyrHis: 0.831 ± 0.301
1.911TyrIle: 1.911 ± 0.341
3.324TyrLys: 3.324 ± 0.538
3.49TyrLeu: 3.49 ± 0.598
1.246TyrMet: 1.246 ± 0.281
2.493TyrAsn: 2.493 ± 0.472
1.496TyrPro: 1.496 ± 0.394
1.579TyrGln: 1.579 ± 0.472
2.576TyrArg: 2.576 ± 0.409
2.659TyrSer: 2.659 ± 0.436
2.659TyrThr: 2.659 ± 0.515
1.911TyrVal: 1.911 ± 0.421
0.831TyrTrp: 0.831 ± 0.22
1.994TyrTyr: 1.994 ± 0.512
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12035 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski