Amino acid dipepetide frequency for Streptococcus phage Javan494

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.132AlaAla: 5.132 ± 0.936
0.248AlaCys: 0.248 ± 0.113
5.381AlaAsp: 5.381 ± 0.691
6.705AlaGlu: 6.705 ± 0.84
3.228AlaPhe: 3.228 ± 0.73
4.222AlaGly: 4.222 ± 0.83
0.414AlaHis: 0.414 ± 0.201
6.457AlaIle: 6.457 ± 0.77
6.871AlaLys: 6.871 ± 0.975
7.368AlaLeu: 7.368 ± 0.962
2.235AlaMet: 2.235 ± 0.493
5.795AlaAsn: 5.795 ± 0.924
1.242AlaPro: 1.242 ± 0.294
4.222AlaGln: 4.222 ± 0.791
2.483AlaArg: 2.483 ± 0.541
4.719AlaSer: 4.719 ± 1.136
5.546AlaThr: 5.546 ± 0.935
4.47AlaVal: 4.47 ± 0.814
0.414AlaTrp: 0.414 ± 0.175
3.56AlaTyr: 3.56 ± 0.725
0.0AlaXaa: 0.0 ± 0.0
Cys
0.083CysAla: 0.083 ± 0.073
0.083CysCys: 0.083 ± 0.073
0.083CysAsp: 0.083 ± 0.073
0.662CysGlu: 0.662 ± 0.258
0.083CysPhe: 0.083 ± 0.071
0.248CysGly: 0.248 ± 0.152
0.083CysHis: 0.083 ± 0.08
0.248CysIle: 0.248 ± 0.128
0.579CysLys: 0.579 ± 0.214
0.579CysLeu: 0.579 ± 0.225
0.083CysMet: 0.083 ± 0.083
0.083CysAsn: 0.083 ± 0.072
0.166CysPro: 0.166 ± 0.123
0.414CysGln: 0.414 ± 0.161
0.414CysArg: 0.414 ± 0.218
0.166CysSer: 0.166 ± 0.116
0.248CysThr: 0.248 ± 0.155
0.166CysVal: 0.166 ± 0.115
0.166CysTrp: 0.166 ± 0.108
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.56AspAla: 3.56 ± 0.48
0.248AspCys: 0.248 ± 0.141
3.891AspAsp: 3.891 ± 0.639
5.05AspGlu: 5.05 ± 0.95
3.394AspPhe: 3.394 ± 0.611
3.642AspGly: 3.642 ± 0.694
0.579AspHis: 0.579 ± 0.238
5.132AspIle: 5.132 ± 0.645
6.54AspLys: 6.54 ± 0.915
5.298AspLeu: 5.298 ± 0.841
1.738AspMet: 1.738 ± 0.366
3.146AspAsn: 3.146 ± 0.54
1.49AspPro: 1.49 ± 0.385
1.325AspGln: 1.325 ± 0.278
2.649AspArg: 2.649 ± 0.482
3.56AspSer: 3.56 ± 0.6
3.146AspThr: 3.146 ± 0.627
3.311AspVal: 3.311 ± 0.531
0.828AspTrp: 0.828 ± 0.241
3.642AspTyr: 3.642 ± 0.671
0.0AspXaa: 0.0 ± 0.0
Glu
5.877GluAla: 5.877 ± 0.716
0.248GluCys: 0.248 ± 0.124
3.063GluAsp: 3.063 ± 0.606
5.795GluGlu: 5.795 ± 0.901
2.07GluPhe: 2.07 ± 0.403
3.725GluGly: 3.725 ± 0.609
1.325GluHis: 1.325 ± 0.453
4.967GluIle: 4.967 ± 0.637
6.209GluLys: 6.209 ± 0.824
8.195GluLeu: 8.195 ± 1.125
1.987GluMet: 1.987 ± 0.454
3.725GluAsn: 3.725 ± 0.753
1.821GluPro: 1.821 ± 0.374
2.649GluGln: 2.649 ± 0.517
4.222GluArg: 4.222 ± 0.93
3.477GluSer: 3.477 ± 0.541
3.56GluThr: 3.56 ± 0.601
4.056GluVal: 4.056 ± 0.649
0.993GluTrp: 0.993 ± 0.226
3.56GluTyr: 3.56 ± 0.643
0.0GluXaa: 0.0 ± 0.0
Phe
3.146PheAla: 3.146 ± 0.717
0.248PheCys: 0.248 ± 0.157
2.401PheAsp: 2.401 ± 0.416
2.401PheGlu: 2.401 ± 0.407
1.076PhePhe: 1.076 ± 0.353
2.732PheGly: 2.732 ± 0.522
0.331PheHis: 0.331 ± 0.157
2.649PheIle: 2.649 ± 0.528
3.725PheLys: 3.725 ± 0.672
2.649PheLeu: 2.649 ± 0.541
0.745PheMet: 0.745 ± 0.251
2.732PheAsn: 2.732 ± 0.341
0.497PhePro: 0.497 ± 0.162
0.662PheGln: 0.662 ± 0.196
1.325PheArg: 1.325 ± 0.312
2.483PheSer: 2.483 ± 0.456
2.732PheThr: 2.732 ± 0.476
2.318PheVal: 2.318 ± 0.393
0.745PheTrp: 0.745 ± 0.325
1.242PheTyr: 1.242 ± 0.352
0.0PheXaa: 0.0 ± 0.0
Gly
5.132GlyAla: 5.132 ± 0.761
0.083GlyCys: 0.083 ± 0.081
4.222GlyAsp: 4.222 ± 0.622
3.477GlyGlu: 3.477 ± 0.588
2.98GlyPhe: 2.98 ± 0.589
4.553GlyGly: 4.553 ± 0.66
0.828GlyHis: 0.828 ± 0.288
4.222GlyIle: 4.222 ± 1.095
6.457GlyLys: 6.457 ± 0.823
6.623GlyLeu: 6.623 ± 0.808
1.821GlyMet: 1.821 ± 0.451
2.152GlyAsn: 2.152 ± 0.442
1.076GlyPro: 1.076 ± 0.466
1.821GlyGln: 1.821 ± 0.285
2.897GlyArg: 2.897 ± 0.515
3.311GlySer: 3.311 ± 0.461
4.47GlyThr: 4.47 ± 0.666
4.47GlyVal: 4.47 ± 0.602
0.414GlyTrp: 0.414 ± 0.197
2.732GlyTyr: 2.732 ± 0.513
0.0GlyXaa: 0.0 ± 0.0
His
0.993HisAla: 0.993 ± 0.275
0.083HisCys: 0.083 ± 0.082
0.579HisAsp: 0.579 ± 0.245
0.993HisGlu: 0.993 ± 0.344
0.828HisPhe: 0.828 ± 0.278
0.911HisGly: 0.911 ± 0.294
0.414HisHis: 0.414 ± 0.165
1.159HisIle: 1.159 ± 0.361
0.828HisLys: 0.828 ± 0.256
0.911HisLeu: 0.911 ± 0.255
0.083HisMet: 0.083 ± 0.085
0.331HisAsn: 0.331 ± 0.145
0.745HisPro: 0.745 ± 0.272
0.331HisGln: 0.331 ± 0.142
0.579HisArg: 0.579 ± 0.201
0.745HisSer: 0.745 ± 0.277
0.911HisThr: 0.911 ± 0.3
0.579HisVal: 0.579 ± 0.206
0.083HisTrp: 0.083 ± 0.073
0.414HisTyr: 0.414 ± 0.166
0.0HisXaa: 0.0 ± 0.0
Ile
5.381IleAla: 5.381 ± 0.842
0.248IleCys: 0.248 ± 0.138
7.036IleAsp: 7.036 ± 0.606
4.801IleGlu: 4.801 ± 0.765
1.987IlePhe: 1.987 ± 0.346
4.056IleGly: 4.056 ± 0.581
0.993IleHis: 0.993 ± 0.34
5.215IleIle: 5.215 ± 0.74
7.616IleLys: 7.616 ± 0.983
4.636IleLeu: 4.636 ± 0.667
1.242IleMet: 1.242 ± 0.231
4.387IleAsn: 4.387 ± 0.755
1.904IlePro: 1.904 ± 0.363
2.566IleGln: 2.566 ± 0.689
1.987IleArg: 1.987 ± 0.45
4.305IleSer: 4.305 ± 0.628
4.387IleThr: 4.387 ± 0.526
4.222IleVal: 4.222 ± 0.582
0.579IleTrp: 0.579 ± 0.229
3.808IleTyr: 3.808 ± 0.673
0.0IleXaa: 0.0 ± 0.0
Lys
6.954LysAla: 6.954 ± 0.735
0.248LysCys: 0.248 ± 0.126
5.132LysAsp: 5.132 ± 0.746
6.209LysGlu: 6.209 ± 0.907
1.904LysPhe: 1.904 ± 0.354
5.298LysGly: 5.298 ± 0.673
1.076LysHis: 1.076 ± 0.404
7.368LysIle: 7.368 ± 0.856
7.616LysLys: 7.616 ± 1.14
6.54LysLeu: 6.54 ± 0.808
2.815LysMet: 2.815 ± 0.395
4.967LysAsn: 4.967 ± 0.73
3.394LysPro: 3.394 ± 0.649
4.139LysGln: 4.139 ± 0.848
4.139LysArg: 4.139 ± 0.735
4.139LysSer: 4.139 ± 0.529
5.05LysThr: 5.05 ± 0.735
5.464LysVal: 5.464 ± 0.733
1.407LysTrp: 1.407 ± 0.401
3.642LysTyr: 3.642 ± 0.707
0.0LysXaa: 0.0 ± 0.0
Leu
6.705LeuAla: 6.705 ± 0.889
0.414LeuCys: 0.414 ± 0.199
6.043LeuAsp: 6.043 ± 0.61
6.871LeuGlu: 6.871 ± 1.164
2.649LeuPhe: 2.649 ± 0.427
5.629LeuGly: 5.629 ± 0.836
0.662LeuHis: 0.662 ± 0.245
4.884LeuIle: 4.884 ± 0.601
8.361LeuLys: 8.361 ± 1.03
6.209LeuLeu: 6.209 ± 0.788
1.821LeuMet: 1.821 ± 0.408
5.464LeuAsn: 5.464 ± 0.523
3.725LeuPro: 3.725 ± 0.849
3.642LeuGln: 3.642 ± 0.434
3.725LeuArg: 3.725 ± 0.62
7.368LeuSer: 7.368 ± 0.852
5.298LeuThr: 5.298 ± 0.669
5.381LeuVal: 5.381 ± 0.514
0.662LeuTrp: 0.662 ± 0.213
2.235LeuTyr: 2.235 ± 0.51
0.0LeuXaa: 0.0 ± 0.0
Met
2.98MetAla: 2.98 ± 0.698
0.083MetCys: 0.083 ± 0.073
1.407MetAsp: 1.407 ± 0.343
1.573MetGlu: 1.573 ± 0.321
0.828MetPhe: 0.828 ± 0.244
1.325MetGly: 1.325 ± 0.309
0.166MetHis: 0.166 ± 0.103
1.325MetIle: 1.325 ± 0.253
1.159MetLys: 1.159 ± 0.279
2.07MetLeu: 2.07 ± 0.425
0.414MetMet: 0.414 ± 0.187
1.573MetAsn: 1.573 ± 0.396
0.828MetPro: 0.828 ± 0.234
0.745MetGln: 0.745 ± 0.222
1.573MetArg: 1.573 ± 0.354
2.649MetSer: 2.649 ± 0.638
1.076MetThr: 1.076 ± 0.277
0.911MetVal: 0.911 ± 0.31
0.083MetTrp: 0.083 ± 0.065
0.497MetTyr: 0.497 ± 0.177
0.0MetXaa: 0.0 ± 0.0
Asn
5.546AsnAla: 5.546 ± 0.729
0.248AsnCys: 0.248 ± 0.142
2.566AsnAsp: 2.566 ± 0.502
3.063AsnGlu: 3.063 ± 0.717
1.738AsnPhe: 1.738 ± 0.478
3.974AsnGly: 3.974 ± 0.545
0.828AsnHis: 0.828 ± 0.252
4.056AsnIle: 4.056 ± 0.582
3.725AsnLys: 3.725 ± 0.539
5.298AsnLeu: 5.298 ± 0.669
0.828AsnMet: 0.828 ± 0.219
2.566AsnAsn: 2.566 ± 0.562
2.152AsnPro: 2.152 ± 0.447
1.656AsnGln: 1.656 ± 0.367
2.483AsnArg: 2.483 ± 0.398
4.884AsnSer: 4.884 ± 0.895
2.98AsnThr: 2.98 ± 0.463
2.815AsnVal: 2.815 ± 0.481
0.745AsnTrp: 0.745 ± 0.233
1.987AsnTyr: 1.987 ± 0.431
0.0AsnXaa: 0.0 ± 0.0
Pro
1.573ProAla: 1.573 ± 0.381
0.248ProCys: 0.248 ± 0.163
1.49ProAsp: 1.49 ± 0.391
2.401ProGlu: 2.401 ± 0.597
0.911ProPhe: 0.911 ± 0.371
1.821ProGly: 1.821 ± 0.361
0.331ProHis: 0.331 ± 0.183
1.656ProIle: 1.656 ± 0.364
2.732ProLys: 2.732 ± 0.517
1.904ProLeu: 1.904 ± 0.419
0.414ProMet: 0.414 ± 0.207
1.738ProAsn: 1.738 ± 0.401
0.331ProPro: 0.331 ± 0.123
1.573ProGln: 1.573 ± 0.322
1.242ProArg: 1.242 ± 0.359
1.821ProSer: 1.821 ± 0.371
1.904ProThr: 1.904 ± 0.397
2.401ProVal: 2.401 ± 0.471
0.166ProTrp: 0.166 ± 0.102
0.828ProTyr: 0.828 ± 0.383
0.0ProXaa: 0.0 ± 0.0
Gln
3.974GlnAla: 3.974 ± 0.761
0.166GlnCys: 0.166 ± 0.099
1.821GlnAsp: 1.821 ± 0.306
3.228GlnGlu: 3.228 ± 0.583
1.656GlnPhe: 1.656 ± 0.383
2.566GlnGly: 2.566 ± 0.459
0.166GlnHis: 0.166 ± 0.113
3.477GlnIle: 3.477 ± 0.684
3.228GlnLys: 3.228 ± 0.453
3.725GlnLeu: 3.725 ± 0.611
1.159GlnMet: 1.159 ± 0.369
2.152GlnAsn: 2.152 ± 0.419
0.414GlnPro: 0.414 ± 0.185
1.325GlnGln: 1.325 ± 0.423
1.407GlnArg: 1.407 ± 0.273
2.98GlnSer: 2.98 ± 0.628
2.235GlnThr: 2.235 ± 0.388
0.993GlnVal: 0.993 ± 0.265
0.414GlnTrp: 0.414 ± 0.205
0.993GlnTyr: 0.993 ± 0.284
0.0GlnXaa: 0.0 ± 0.0
Arg
2.98ArgAla: 2.98 ± 0.587
0.331ArgCys: 0.331 ± 0.176
2.318ArgAsp: 2.318 ± 0.507
3.311ArgGlu: 3.311 ± 0.586
2.401ArgPhe: 2.401 ± 0.482
1.904ArgGly: 1.904 ± 0.379
0.993ArgHis: 0.993 ± 0.239
3.56ArgIle: 3.56 ± 0.521
3.228ArgLys: 3.228 ± 0.562
5.629ArgLeu: 5.629 ± 0.982
0.993ArgMet: 0.993 ± 0.29
2.235ArgAsn: 2.235 ± 0.5
0.911ArgPro: 0.911 ± 0.205
1.49ArgGln: 1.49 ± 0.342
1.325ArgArg: 1.325 ± 0.389
1.573ArgSer: 1.573 ± 0.375
2.566ArgThr: 2.566 ± 0.541
2.649ArgVal: 2.649 ± 0.478
0.497ArgTrp: 0.497 ± 0.191
1.821ArgTyr: 1.821 ± 0.43
0.0ArgXaa: 0.0 ± 0.0
Ser
6.623SerAla: 6.623 ± 1.622
0.414SerCys: 0.414 ± 0.198
3.56SerAsp: 3.56 ± 0.797
3.146SerGlu: 3.146 ± 0.424
1.821SerPhe: 1.821 ± 0.541
5.877SerGly: 5.877 ± 0.861
0.993SerHis: 0.993 ± 0.377
4.719SerIle: 4.719 ± 0.71
5.05SerLys: 5.05 ± 0.557
5.381SerLeu: 5.381 ± 1.546
1.738SerMet: 1.738 ± 0.464
3.394SerAsn: 3.394 ± 0.652
1.573SerPro: 1.573 ± 0.333
2.318SerGln: 2.318 ± 0.452
3.228SerArg: 3.228 ± 0.543
5.381SerSer: 5.381 ± 1.569
3.311SerThr: 3.311 ± 0.716
4.719SerVal: 4.719 ± 0.711
0.497SerTrp: 0.497 ± 0.16
2.483SerTyr: 2.483 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
4.801ThrAla: 4.801 ± 0.662
0.331ThrCys: 0.331 ± 0.176
3.394ThrAsp: 3.394 ± 0.603
2.732ThrGlu: 2.732 ± 0.506
2.98ThrPhe: 2.98 ± 0.493
4.884ThrGly: 4.884 ± 0.839
0.745ThrHis: 0.745 ± 0.254
3.808ThrIle: 3.808 ± 0.513
4.801ThrLys: 4.801 ± 0.61
5.629ThrLeu: 5.629 ± 0.735
1.159ThrMet: 1.159 ± 0.313
2.649ThrAsn: 2.649 ± 0.509
1.987ThrPro: 1.987 ± 0.563
2.483ThrGln: 2.483 ± 0.48
1.904ThrArg: 1.904 ± 0.507
4.636ThrSer: 4.636 ± 1.408
4.553ThrThr: 4.553 ± 0.756
3.808ThrVal: 3.808 ± 0.469
0.662ThrTrp: 0.662 ± 0.322
1.904ThrTyr: 1.904 ± 0.492
0.0ThrXaa: 0.0 ± 0.0
Val
5.05ValAla: 5.05 ± 0.673
0.166ValCys: 0.166 ± 0.113
3.891ValAsp: 3.891 ± 0.555
4.387ValGlu: 4.387 ± 0.821
2.401ValPhe: 2.401 ± 0.445
3.891ValGly: 3.891 ± 0.64
0.993ValHis: 0.993 ± 0.295
4.222ValIle: 4.222 ± 0.567
4.967ValLys: 4.967 ± 0.481
4.387ValLeu: 4.387 ± 0.562
1.656ValMet: 1.656 ± 0.394
2.815ValAsn: 2.815 ± 0.433
1.656ValPro: 1.656 ± 0.407
2.152ValGln: 2.152 ± 0.67
2.401ValArg: 2.401 ± 0.462
4.139ValSer: 4.139 ± 0.742
3.725ValThr: 3.725 ± 0.516
4.056ValVal: 4.056 ± 0.412
0.497ValTrp: 0.497 ± 0.181
2.318ValTyr: 2.318 ± 0.551
0.0ValXaa: 0.0 ± 0.0
Trp
0.745TrpAla: 0.745 ± 0.234
0.0TrpCys: 0.0 ± 0.0
0.828TrpAsp: 0.828 ± 0.232
1.242TrpGlu: 1.242 ± 0.278
0.497TrpPhe: 0.497 ± 0.163
0.497TrpGly: 0.497 ± 0.186
0.083TrpHis: 0.083 ± 0.08
0.745TrpIle: 0.745 ± 0.189
0.662TrpLys: 0.662 ± 0.171
0.745TrpLeu: 0.745 ± 0.238
0.0TrpMet: 0.0 ± 0.0
0.579TrpAsn: 0.579 ± 0.156
0.662TrpPro: 0.662 ± 0.303
0.497TrpGln: 0.497 ± 0.201
0.662TrpArg: 0.662 ± 0.237
0.497TrpSer: 0.497 ± 0.201
0.083TrpThr: 0.083 ± 0.072
0.828TrpVal: 0.828 ± 0.344
0.166TrpTrp: 0.166 ± 0.114
0.166TrpTyr: 0.166 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.477TyrAla: 3.477 ± 0.429
0.497TyrCys: 0.497 ± 0.175
3.063TyrAsp: 3.063 ± 0.654
3.56TyrGlu: 3.56 ± 0.589
1.407TyrPhe: 1.407 ± 0.278
1.987TyrGly: 1.987 ± 0.436
0.497TyrHis: 0.497 ± 0.195
1.49TyrIle: 1.49 ± 0.414
3.146TyrLys: 3.146 ± 0.599
3.974TyrLeu: 3.974 ± 0.638
0.248TyrMet: 0.248 ± 0.128
1.656TyrAsn: 1.656 ± 0.438
0.911TyrPro: 0.911 ± 0.377
2.07TyrGln: 2.07 ± 0.438
2.07TyrArg: 2.07 ± 0.583
3.394TyrSer: 3.394 ± 0.685
2.07TyrThr: 2.07 ± 0.488
2.152TyrVal: 2.152 ± 0.485
0.166TyrTrp: 0.166 ± 0.101
1.738TyrTyr: 1.738 ± 0.417
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (12081 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski