Amino acid dipepetide frequency for Streptococcus phage Javan470

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.893AlaAla: 3.893 ± 1.375
0.497AlaCys: 0.497 ± 0.184
4.638AlaAsp: 4.638 ± 0.543
6.212AlaGlu: 6.212 ± 0.788
1.491AlaPhe: 1.491 ± 0.34
3.313AlaGly: 3.313 ± 0.536
1.408AlaHis: 1.408 ± 0.41
6.129AlaIle: 6.129 ± 0.796
6.461AlaLys: 6.461 ± 0.682
7.041AlaLeu: 7.041 ± 0.936
1.988AlaMet: 1.988 ± 0.419
4.804AlaAsn: 4.804 ± 0.556
2.568AlaPro: 2.568 ± 0.401
3.727AlaGln: 3.727 ± 0.781
3.23AlaArg: 3.23 ± 0.547
5.053AlaSer: 5.053 ± 1.123
4.887AlaThr: 4.887 ± 0.658
3.562AlaVal: 3.562 ± 0.588
0.994AlaTrp: 0.994 ± 0.31
2.071AlaTyr: 2.071 ± 0.352
0.0AlaXaa: 0.0 ± 0.0
Cys
0.497CysAla: 0.497 ± 0.186
0.166CysCys: 0.166 ± 0.11
0.331CysAsp: 0.331 ± 0.147
0.58CysGlu: 0.58 ± 0.205
0.414CysPhe: 0.414 ± 0.166
0.248CysGly: 0.248 ± 0.181
0.166CysHis: 0.166 ± 0.115
0.248CysIle: 0.248 ± 0.162
0.497CysLys: 0.497 ± 0.185
0.745CysLeu: 0.745 ± 0.232
0.083CysMet: 0.083 ± 0.091
0.248CysAsn: 0.248 ± 0.14
0.248CysPro: 0.248 ± 0.136
0.0CysGln: 0.0 ± 0.0
0.414CysArg: 0.414 ± 0.174
0.248CysSer: 0.248 ± 0.151
0.0CysThr: 0.0 ± 0.0
0.331CysVal: 0.331 ± 0.158
0.083CysTrp: 0.083 ± 0.077
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.141AspAla: 4.141 ± 0.745
0.248AspCys: 0.248 ± 0.188
4.804AspAsp: 4.804 ± 0.766
4.97AspGlu: 4.97 ± 0.495
2.816AspPhe: 2.816 ± 0.379
6.129AspGly: 6.129 ± 0.73
0.911AspHis: 0.911 ± 0.255
4.141AspIle: 4.141 ± 0.711
6.544AspLys: 6.544 ± 0.695
6.295AspLeu: 6.295 ± 0.862
1.905AspMet: 1.905 ± 0.348
4.141AspAsn: 4.141 ± 0.541
1.657AspPro: 1.657 ± 0.366
2.154AspGln: 2.154 ± 0.509
2.816AspArg: 2.816 ± 0.536
2.733AspSer: 2.733 ± 0.379
3.893AspThr: 3.893 ± 0.667
4.141AspVal: 4.141 ± 0.595
0.994AspTrp: 0.994 ± 0.276
3.065AspTyr: 3.065 ± 0.633
0.0AspXaa: 0.0 ± 0.0
Glu
5.218GluAla: 5.218 ± 0.743
0.58GluCys: 0.58 ± 0.207
3.479GluAsp: 3.479 ± 0.597
6.129GluGlu: 6.129 ± 0.802
2.485GluPhe: 2.485 ± 0.454
3.976GluGly: 3.976 ± 0.533
1.491GluHis: 1.491 ± 0.479
6.378GluIle: 6.378 ± 0.804
5.632GluLys: 5.632 ± 0.729
9.111GluLeu: 9.111 ± 1.148
1.739GluMet: 1.739 ± 0.41
4.059GluAsn: 4.059 ± 0.479
1.657GluPro: 1.657 ± 0.448
2.899GluGln: 2.899 ± 0.497
3.479GluArg: 3.479 ± 0.629
3.81GluSer: 3.81 ± 0.558
3.893GluThr: 3.893 ± 0.561
6.047GluVal: 6.047 ± 0.841
1.077GluTrp: 1.077 ± 0.277
2.733GluTyr: 2.733 ± 0.448
0.0GluXaa: 0.0 ± 0.0
Phe
2.651PheAla: 2.651 ± 0.454
0.248PheCys: 0.248 ± 0.182
4.141PheAsp: 4.141 ± 0.44
3.065PheGlu: 3.065 ± 0.697
0.745PhePhe: 0.745 ± 0.283
1.988PheGly: 1.988 ± 0.371
0.166PheHis: 0.166 ± 0.122
2.982PheIle: 2.982 ± 0.463
2.402PheLys: 2.402 ± 0.426
2.071PheLeu: 2.071 ± 0.345
0.745PheMet: 0.745 ± 0.236
2.402PheAsn: 2.402 ± 0.372
0.828PhePro: 0.828 ± 0.282
0.497PheGln: 0.497 ± 0.242
1.574PheArg: 1.574 ± 0.421
1.574PheSer: 1.574 ± 0.451
2.154PheThr: 2.154 ± 0.425
1.988PheVal: 1.988 ± 0.304
0.166PheTrp: 0.166 ± 0.182
1.325PheTyr: 1.325 ± 0.296
0.0PheXaa: 0.0 ± 0.0
Gly
4.39GlyAla: 4.39 ± 0.764
0.497GlyCys: 0.497 ± 0.212
3.396GlyAsp: 3.396 ± 0.523
3.23GlyGlu: 3.23 ± 0.529
2.816GlyPhe: 2.816 ± 0.391
3.644GlyGly: 3.644 ± 0.556
1.408GlyHis: 1.408 ± 0.301
5.053GlyIle: 5.053 ± 0.771
6.626GlyLys: 6.626 ± 0.561
4.224GlyLeu: 4.224 ± 0.841
1.491GlyMet: 1.491 ± 0.314
3.479GlyAsn: 3.479 ± 0.556
0.994GlyPro: 0.994 ± 0.249
2.899GlyGln: 2.899 ± 0.509
1.822GlyArg: 1.822 ± 0.272
3.313GlySer: 3.313 ± 0.411
3.313GlyThr: 3.313 ± 0.439
5.715GlyVal: 5.715 ± 0.945
1.574GlyTrp: 1.574 ± 0.412
2.402GlyTyr: 2.402 ± 0.571
0.0GlyXaa: 0.0 ± 0.0
His
1.491HisAla: 1.491 ± 0.37
0.0HisCys: 0.0 ± 0.0
0.745HisAsp: 0.745 ± 0.25
1.574HisGlu: 1.574 ± 0.402
1.242HisPhe: 1.242 ± 0.33
1.16HisGly: 1.16 ± 0.306
0.414HisHis: 0.414 ± 0.194
0.745HisIle: 0.745 ± 0.262
0.911HisLys: 0.911 ± 0.271
1.657HisLeu: 1.657 ± 0.455
0.166HisMet: 0.166 ± 0.112
0.414HisAsn: 0.414 ± 0.172
0.497HisPro: 0.497 ± 0.196
0.497HisGln: 0.497 ± 0.236
0.497HisArg: 0.497 ± 0.152
1.325HisSer: 1.325 ± 0.43
1.16HisThr: 1.16 ± 0.309
0.663HisVal: 0.663 ± 0.243
0.166HisTrp: 0.166 ± 0.13
0.58HisTyr: 0.58 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
5.135IleAla: 5.135 ± 0.697
0.248IleCys: 0.248 ± 0.139
6.212IleAsp: 6.212 ± 0.777
5.55IleGlu: 5.55 ± 0.713
2.071IlePhe: 2.071 ± 0.507
3.562IleGly: 3.562 ± 0.454
0.497IleHis: 0.497 ± 0.212
3.727IleIle: 3.727 ± 0.682
7.952IleLys: 7.952 ± 0.836
4.141IleLeu: 4.141 ± 0.65
1.16IleMet: 1.16 ± 0.339
5.881IleAsn: 5.881 ± 0.707
1.408IlePro: 1.408 ± 0.368
1.822IleGln: 1.822 ± 0.292
2.154IleArg: 2.154 ± 0.354
4.473IleSer: 4.473 ± 0.647
4.556IleThr: 4.556 ± 0.434
4.059IleVal: 4.059 ± 0.851
0.497IleTrp: 0.497 ± 0.254
2.899IleTyr: 2.899 ± 0.487
0.0IleXaa: 0.0 ± 0.0
Lys
7.703LysAla: 7.703 ± 0.697
0.497LysCys: 0.497 ± 0.216
5.053LysAsp: 5.053 ± 0.746
6.295LysGlu: 6.295 ± 0.737
2.651LysPhe: 2.651 ± 0.526
4.556LysGly: 4.556 ± 0.66
1.077LysHis: 1.077 ± 0.249
5.798LysIle: 5.798 ± 0.627
6.378LysLys: 6.378 ± 0.857
6.875LysLeu: 6.875 ± 0.873
2.485LysMet: 2.485 ± 0.355
4.887LysAsn: 4.887 ± 0.568
2.899LysPro: 2.899 ± 0.448
4.141LysGln: 4.141 ± 0.516
3.727LysArg: 3.727 ± 0.632
4.307LysSer: 4.307 ± 0.589
5.384LysThr: 5.384 ± 0.571
6.875LysVal: 6.875 ± 0.895
1.077LysTrp: 1.077 ± 0.297
3.644LysTyr: 3.644 ± 0.406
0.0LysXaa: 0.0 ± 0.0
Leu
6.709LeuAla: 6.709 ± 1.007
0.414LeuCys: 0.414 ± 0.214
6.461LeuAsp: 6.461 ± 0.775
7.62LeuGlu: 7.62 ± 0.843
3.23LeuPhe: 3.23 ± 0.406
5.798LeuGly: 5.798 ± 0.61
1.408LeuHis: 1.408 ± 0.329
5.301LeuIle: 5.301 ± 0.893
7.537LeuLys: 7.537 ± 0.819
5.715LeuLeu: 5.715 ± 0.728
1.16LeuMet: 1.16 ± 0.257
5.053LeuAsn: 5.053 ± 0.832
3.23LeuPro: 3.23 ± 0.509
2.982LeuGln: 2.982 ± 0.391
4.059LeuArg: 4.059 ± 0.739
4.887LeuSer: 4.887 ± 0.636
6.129LeuThr: 6.129 ± 0.74
4.307LeuVal: 4.307 ± 0.499
0.994LeuTrp: 0.994 ± 0.33
2.319LeuTyr: 2.319 ± 0.424
0.0LeuXaa: 0.0 ± 0.0
Met
1.574MetAla: 1.574 ± 0.335
0.083MetCys: 0.083 ± 0.087
1.657MetAsp: 1.657 ± 0.476
1.905MetGlu: 1.905 ± 0.382
0.414MetPhe: 0.414 ± 0.177
1.16MetGly: 1.16 ± 0.35
0.248MetHis: 0.248 ± 0.134
1.242MetIle: 1.242 ± 0.289
1.657MetLys: 1.657 ± 0.448
1.491MetLeu: 1.491 ± 0.336
0.331MetMet: 0.331 ± 0.175
0.911MetAsn: 0.911 ± 0.288
1.077MetPro: 1.077 ± 0.296
1.408MetGln: 1.408 ± 0.403
1.822MetArg: 1.822 ± 0.268
1.822MetSer: 1.822 ± 0.438
2.154MetThr: 2.154 ± 0.464
1.574MetVal: 1.574 ± 0.446
0.497MetTrp: 0.497 ± 0.181
0.331MetTyr: 0.331 ± 0.149
0.0MetXaa: 0.0 ± 0.0
Asn
5.053AsnAla: 5.053 ± 0.786
0.331AsnCys: 0.331 ± 0.167
2.982AsnAsp: 2.982 ± 0.473
3.81AsnGlu: 3.81 ± 0.633
1.905AsnPhe: 1.905 ± 0.479
4.39AsnGly: 4.39 ± 0.668
0.911AsnHis: 0.911 ± 0.323
3.81AsnIle: 3.81 ± 0.575
4.97AsnLys: 4.97 ± 0.635
4.97AsnLeu: 4.97 ± 0.57
1.739AsnMet: 1.739 ± 0.306
3.23AsnAsn: 3.23 ± 0.517
1.988AsnPro: 1.988 ± 0.461
3.148AsnGln: 3.148 ± 0.396
1.988AsnArg: 1.988 ± 0.337
4.141AsnSer: 4.141 ± 0.852
2.982AsnThr: 2.982 ± 0.567
2.651AsnVal: 2.651 ± 0.405
0.994AsnTrp: 0.994 ± 0.362
1.657AsnTyr: 1.657 ± 0.341
0.0AsnXaa: 0.0 ± 0.0
Pro
1.325ProAla: 1.325 ± 0.337
0.083ProCys: 0.083 ± 0.085
2.485ProAsp: 2.485 ± 0.417
2.651ProGlu: 2.651 ± 0.398
0.994ProPhe: 0.994 ± 0.307
1.325ProGly: 1.325 ± 0.405
0.497ProHis: 0.497 ± 0.21
1.657ProIle: 1.657 ± 0.464
3.148ProLys: 3.148 ± 0.502
3.065ProLeu: 3.065 ± 0.545
0.58ProMet: 0.58 ± 0.201
1.574ProAsn: 1.574 ± 0.353
0.58ProPro: 0.58 ± 0.194
1.739ProGln: 1.739 ± 0.612
0.828ProArg: 0.828 ± 0.246
1.822ProSer: 1.822 ± 0.38
2.319ProThr: 2.319 ± 0.482
1.739ProVal: 1.739 ± 0.398
0.0ProTrp: 0.0 ± 0.0
1.077ProTyr: 1.077 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
3.313GlnAla: 3.313 ± 0.546
0.248GlnCys: 0.248 ± 0.123
1.574GlnAsp: 1.574 ± 0.402
3.479GlnGlu: 3.479 ± 0.587
1.739GlnPhe: 1.739 ± 0.361
2.651GlnGly: 2.651 ± 0.581
0.745GlnHis: 0.745 ± 0.242
2.982GlnIle: 2.982 ± 0.428
4.059GlnLys: 4.059 ± 0.611
3.893GlnLeu: 3.893 ± 0.661
1.408GlnMet: 1.408 ± 0.262
2.485GlnAsn: 2.485 ± 0.497
1.905GlnPro: 1.905 ± 0.641
2.899GlnGln: 2.899 ± 0.846
2.154GlnArg: 2.154 ± 0.367
2.154GlnSer: 2.154 ± 0.351
3.23GlnThr: 3.23 ± 0.531
1.491GlnVal: 1.491 ± 0.371
0.58GlnTrp: 0.58 ± 0.195
0.745GlnTyr: 0.745 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
2.733ArgAla: 2.733 ± 0.467
0.166ArgCys: 0.166 ± 0.155
2.236ArgAsp: 2.236 ± 0.415
2.651ArgGlu: 2.651 ± 0.527
1.325ArgPhe: 1.325 ± 0.273
2.485ArgGly: 2.485 ± 0.535
0.911ArgHis: 0.911 ± 0.268
2.236ArgIle: 2.236 ± 0.288
4.224ArgLys: 4.224 ± 0.653
3.976ArgLeu: 3.976 ± 0.533
0.745ArgMet: 0.745 ± 0.183
2.154ArgAsn: 2.154 ± 0.383
0.911ArgPro: 0.911 ± 0.323
1.739ArgGln: 1.739 ± 0.377
1.408ArgArg: 1.408 ± 0.306
1.739ArgSer: 1.739 ± 0.378
2.154ArgThr: 2.154 ± 0.353
3.148ArgVal: 3.148 ± 0.412
0.414ArgTrp: 0.414 ± 0.177
2.154ArgTyr: 2.154 ± 0.532
0.0ArgXaa: 0.0 ± 0.0
Ser
3.893SerAla: 3.893 ± 0.778
0.248SerCys: 0.248 ± 0.136
4.556SerAsp: 4.556 ± 0.524
4.39SerGlu: 4.39 ± 0.519
2.402SerPhe: 2.402 ± 0.457
3.313SerGly: 3.313 ± 0.623
0.828SerHis: 0.828 ± 0.27
3.479SerIle: 3.479 ± 0.626
4.638SerLys: 4.638 ± 0.605
5.218SerLeu: 5.218 ± 0.711
1.242SerMet: 1.242 ± 0.22
3.23SerAsn: 3.23 ± 0.722
1.325SerPro: 1.325 ± 0.271
2.982SerGln: 2.982 ± 0.502
2.071SerArg: 2.071 ± 0.414
2.651SerSer: 2.651 ± 0.565
3.23SerThr: 3.23 ± 0.813
3.23SerVal: 3.23 ± 0.488
0.745SerTrp: 0.745 ± 0.277
1.739SerTyr: 1.739 ± 0.375
0.0SerXaa: 0.0 ± 0.0
Thr
6.461ThrAla: 6.461 ± 0.893
0.166ThrCys: 0.166 ± 0.11
4.141ThrAsp: 4.141 ± 0.524
3.81ThrGlu: 3.81 ± 0.595
1.408ThrPhe: 1.408 ± 0.305
4.804ThrGly: 4.804 ± 0.54
1.16ThrHis: 1.16 ± 0.331
4.307ThrIle: 4.307 ± 0.493
3.81ThrLys: 3.81 ± 0.606
5.715ThrLeu: 5.715 ± 0.59
1.325ThrMet: 1.325 ± 0.381
3.23ThrAsn: 3.23 ± 0.634
2.485ThrPro: 2.485 ± 0.417
2.816ThrGln: 2.816 ± 0.576
1.739ThrArg: 1.739 ± 0.353
3.479ThrSer: 3.479 ± 0.481
5.467ThrThr: 5.467 ± 0.777
4.804ThrVal: 4.804 ± 0.579
0.828ThrTrp: 0.828 ± 0.246
2.651ThrTyr: 2.651 ± 0.509
0.0ThrXaa: 0.0 ± 0.0
Val
4.39ValAla: 4.39 ± 0.631
0.331ValCys: 0.331 ± 0.152
4.97ValAsp: 4.97 ± 0.516
4.887ValGlu: 4.887 ± 0.661
2.154ValPhe: 2.154 ± 0.545
4.721ValGly: 4.721 ± 0.673
0.663ValHis: 0.663 ± 0.273
4.307ValIle: 4.307 ± 0.529
5.135ValLys: 5.135 ± 0.591
4.721ValLeu: 4.721 ± 0.681
1.988ValMet: 1.988 ± 0.443
2.982ValAsn: 2.982 ± 0.432
1.739ValPro: 1.739 ± 0.461
1.988ValGln: 1.988 ± 0.449
2.071ValArg: 2.071 ± 0.361
3.81ValSer: 3.81 ± 0.61
4.556ValThr: 4.556 ± 0.534
4.39ValVal: 4.39 ± 0.54
0.745ValTrp: 0.745 ± 0.309
2.236ValTyr: 2.236 ± 0.619
0.0ValXaa: 0.0 ± 0.0
Trp
0.828TrpAla: 0.828 ± 0.249
0.166TrpCys: 0.166 ± 0.115
1.739TrpAsp: 1.739 ± 0.442
0.911TrpGlu: 0.911 ± 0.274
0.497TrpPhe: 0.497 ± 0.162
0.911TrpGly: 0.911 ± 0.342
0.166TrpHis: 0.166 ± 0.114
1.491TrpIle: 1.491 ± 0.42
0.745TrpLys: 0.745 ± 0.241
0.911TrpLeu: 0.911 ± 0.248
0.414TrpMet: 0.414 ± 0.198
0.663TrpAsn: 0.663 ± 0.225
0.083TrpPro: 0.083 ± 0.085
0.58TrpGln: 0.58 ± 0.218
0.58TrpArg: 0.58 ± 0.235
0.828TrpSer: 0.828 ± 0.242
0.58TrpThr: 0.58 ± 0.268
0.497TrpVal: 0.497 ± 0.251
0.0TrpTrp: 0.0 ± 0.0
0.331TrpTyr: 0.331 ± 0.162
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.651TyrAla: 2.651 ± 0.406
0.331TyrCys: 0.331 ± 0.154
2.899TyrAsp: 2.899 ± 0.448
2.154TyrGlu: 2.154 ± 0.34
0.911TyrPhe: 0.911 ± 0.246
2.154TyrGly: 2.154 ± 0.426
0.828TyrHis: 0.828 ± 0.319
1.988TyrIle: 1.988 ± 0.401
2.982TyrLys: 2.982 ± 0.617
3.479TyrLeu: 3.479 ± 0.629
0.58TyrMet: 0.58 ± 0.231
1.822TyrAsn: 1.822 ± 0.377
1.408TyrPro: 1.408 ± 0.309
2.816TyrGln: 2.816 ± 0.752
1.077TyrArg: 1.077 ± 0.295
1.242TyrSer: 1.242 ± 0.322
2.568TyrThr: 2.568 ± 0.587
1.574TyrVal: 1.574 ± 0.396
0.497TyrTrp: 0.497 ± 0.195
1.16TyrTyr: 1.16 ± 0.311
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (12074 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski