Amino acid dipepetide frequency for Hafnia phage vB_HpaA_yong1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.204AlaAla: 10.204 ± 0.915
0.996AlaCys: 0.996 ± 0.279
5.392AlaAsp: 5.392 ± 0.745
5.807AlaGlu: 5.807 ± 0.988
3.401AlaPhe: 3.401 ± 0.421
7.052AlaGly: 7.052 ± 1.087
0.664AlaHis: 0.664 ± 0.206
5.392AlaIle: 5.392 ± 0.761
6.056AlaLys: 6.056 ± 0.711
6.886AlaLeu: 6.886 ± 0.862
2.738AlaMet: 2.738 ± 0.543
3.318AlaAsn: 3.318 ± 0.547
2.489AlaPro: 2.489 ± 0.619
3.152AlaGln: 3.152 ± 0.508
3.899AlaArg: 3.899 ± 0.473
4.812AlaSer: 4.812 ± 0.513
4.314AlaThr: 4.314 ± 0.677
5.973AlaVal: 5.973 ± 0.92
1.41AlaTrp: 1.41 ± 0.433
2.738AlaTyr: 2.738 ± 0.418
0.0AlaXaa: 0.0 ± 0.0
Cys
0.664CysAla: 0.664 ± 0.192
0.083CysCys: 0.083 ± 0.077
0.581CysAsp: 0.581 ± 0.35
0.581CysGlu: 0.581 ± 0.177
0.664CysPhe: 0.664 ± 0.254
0.996CysGly: 0.996 ± 0.338
0.498CysHis: 0.498 ± 0.227
0.166CysIle: 0.166 ± 0.117
0.581CysLys: 0.581 ± 0.283
1.078CysLeu: 1.078 ± 0.382
0.415CysMet: 0.415 ± 0.271
0.415CysAsn: 0.415 ± 0.192
0.498CysPro: 0.498 ± 0.227
0.166CysGln: 0.166 ± 0.125
0.996CysArg: 0.996 ± 0.363
0.664CysSer: 0.664 ± 0.267
0.249CysThr: 0.249 ± 0.146
0.664CysVal: 0.664 ± 0.213
0.249CysTrp: 0.249 ± 0.133
0.332CysTyr: 0.332 ± 0.194
0.0CysXaa: 0.0 ± 0.0
Asp
6.139AspAla: 6.139 ± 0.725
0.498AspCys: 0.498 ± 0.291
4.646AspAsp: 4.646 ± 0.718
3.982AspGlu: 3.982 ± 0.444
2.074AspPhe: 2.074 ± 0.434
6.139AspGly: 6.139 ± 0.73
1.41AspHis: 1.41 ± 0.358
2.904AspIle: 2.904 ± 0.461
3.567AspLys: 3.567 ± 0.543
5.558AspLeu: 5.558 ± 0.706
2.157AspMet: 2.157 ± 0.388
2.738AspAsn: 2.738 ± 0.466
2.904AspPro: 2.904 ± 0.676
1.742AspGln: 1.742 ± 0.36
2.323AspArg: 2.323 ± 0.468
3.733AspSer: 3.733 ± 0.411
4.231AspThr: 4.231 ± 0.538
4.563AspVal: 4.563 ± 0.621
0.996AspTrp: 0.996 ± 0.328
1.991AspTyr: 1.991 ± 0.329
0.0AspXaa: 0.0 ± 0.0
Glu
7.135GluAla: 7.135 ± 1.017
0.498GluCys: 0.498 ± 0.208
4.978GluAsp: 4.978 ± 0.783
5.144GluGlu: 5.144 ± 0.83
2.904GluPhe: 2.904 ± 0.525
5.061GluGly: 5.061 ± 0.693
0.913GluHis: 0.913 ± 0.284
2.738GluIle: 2.738 ± 0.411
2.821GluLys: 2.821 ± 0.484
6.305GluLeu: 6.305 ± 0.752
2.323GluMet: 2.323 ± 0.493
2.323GluAsn: 2.323 ± 0.422
2.157GluPro: 2.157 ± 0.362
2.821GluGln: 2.821 ± 0.52
4.314GluArg: 4.314 ± 0.695
4.065GluSer: 4.065 ± 0.584
3.65GluThr: 3.65 ± 0.455
3.816GluVal: 3.816 ± 0.623
1.327GluTrp: 1.327 ± 0.235
3.318GluTyr: 3.318 ± 0.527
0.0GluXaa: 0.0 ± 0.0
Phe
2.904PheAla: 2.904 ± 0.541
0.415PheCys: 0.415 ± 0.194
2.987PheAsp: 2.987 ± 0.418
1.991PheGlu: 1.991 ± 0.363
1.161PhePhe: 1.161 ± 0.366
2.406PheGly: 2.406 ± 0.449
0.83PheHis: 0.83 ± 0.206
1.825PheIle: 1.825 ± 0.438
3.318PheLys: 3.318 ± 0.55
2.821PheLeu: 2.821 ± 0.383
1.244PheMet: 1.244 ± 0.433
2.738PheAsn: 2.738 ± 0.45
1.244PhePro: 1.244 ± 0.306
0.996PheGln: 0.996 ± 0.273
1.659PheArg: 1.659 ± 0.338
2.406PheSer: 2.406 ± 0.362
2.074PheThr: 2.074 ± 0.326
2.489PheVal: 2.489 ± 0.485
0.498PheTrp: 0.498 ± 0.182
1.161PheTyr: 1.161 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
6.72GlyAla: 6.72 ± 1.053
0.415GlyCys: 0.415 ± 0.183
5.144GlyAsp: 5.144 ± 0.807
5.226GlyGlu: 5.226 ± 0.582
2.323GlyPhe: 2.323 ± 0.413
5.724GlyGly: 5.724 ± 0.681
1.078GlyHis: 1.078 ± 0.331
4.148GlyIle: 4.148 ± 0.473
5.89GlyLys: 5.89 ± 0.835
6.056GlyLeu: 6.056 ± 0.695
2.074GlyMet: 2.074 ± 0.439
3.401GlyAsn: 3.401 ± 0.723
1.41GlyPro: 1.41 ± 0.342
2.821GlyGln: 2.821 ± 0.411
5.392GlyArg: 5.392 ± 0.661
5.89GlySer: 5.89 ± 0.815
4.895GlyThr: 4.895 ± 0.624
5.724GlyVal: 5.724 ± 0.769
1.078GlyTrp: 1.078 ± 0.277
3.65GlyTyr: 3.65 ± 0.465
0.0GlyXaa: 0.0 ± 0.0
His
0.747HisAla: 0.747 ± 0.347
0.415HisCys: 0.415 ± 0.219
1.41HisAsp: 1.41 ± 0.392
1.493HisGlu: 1.493 ± 0.475
0.664HisPhe: 0.664 ± 0.29
1.244HisGly: 1.244 ± 0.37
0.498HisHis: 0.498 ± 0.21
1.161HisIle: 1.161 ± 0.281
1.327HisLys: 1.327 ± 0.302
1.742HisLeu: 1.742 ± 0.436
0.581HisMet: 0.581 ± 0.227
0.664HisAsn: 0.664 ± 0.216
0.415HisPro: 0.415 ± 0.173
0.581HisGln: 0.581 ± 0.191
0.913HisArg: 0.913 ± 0.222
0.83HisSer: 0.83 ± 0.214
1.078HisThr: 1.078 ± 0.301
0.913HisVal: 0.913 ± 0.262
0.332HisTrp: 0.332 ± 0.18
0.581HisTyr: 0.581 ± 0.228
0.0HisXaa: 0.0 ± 0.0
Ile
3.899IleAla: 3.899 ± 0.637
0.747IleCys: 0.747 ± 0.297
3.152IleAsp: 3.152 ± 0.456
3.484IleGlu: 3.484 ± 0.445
1.327IlePhe: 1.327 ± 0.364
3.982IleGly: 3.982 ± 0.564
1.078IleHis: 1.078 ± 0.279
1.991IleIle: 1.991 ± 0.426
3.484IleLys: 3.484 ± 0.451
3.318IleLeu: 3.318 ± 0.576
0.747IleMet: 0.747 ± 0.251
2.572IleAsn: 2.572 ± 0.547
1.825IlePro: 1.825 ± 0.524
1.742IleGln: 1.742 ± 0.419
2.904IleArg: 2.904 ± 0.443
2.323IleSer: 2.323 ± 0.48
3.65IleThr: 3.65 ± 0.74
4.314IleVal: 4.314 ± 0.48
0.498IleTrp: 0.498 ± 0.177
1.244IleTyr: 1.244 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
7.383LysAla: 7.383 ± 0.87
0.747LysCys: 0.747 ± 0.254
4.563LysAsp: 4.563 ± 0.684
4.148LysGlu: 4.148 ± 0.424
2.489LysPhe: 2.489 ± 0.551
3.733LysGly: 3.733 ± 0.541
1.41LysHis: 1.41 ± 0.422
2.157LysIle: 2.157 ± 0.479
4.48LysLys: 4.48 ± 0.864
5.309LysLeu: 5.309 ± 0.689
1.659LysMet: 1.659 ± 0.338
2.157LysAsn: 2.157 ± 0.403
2.406LysPro: 2.406 ± 0.507
2.24LysGln: 2.24 ± 0.623
4.231LysArg: 4.231 ± 0.659
4.314LysSer: 4.314 ± 0.49
3.816LysThr: 3.816 ± 0.474
5.309LysVal: 5.309 ± 0.77
1.078LysTrp: 1.078 ± 0.307
2.323LysTyr: 2.323 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
6.388LeuAla: 6.388 ± 0.827
0.498LeuCys: 0.498 ± 0.231
4.148LeuAsp: 4.148 ± 0.497
6.969LeuGlu: 6.969 ± 0.706
2.24LeuPhe: 2.24 ± 0.387
5.309LeuGly: 5.309 ± 0.766
0.913LeuHis: 0.913 ± 0.282
3.401LeuIle: 3.401 ± 0.587
6.886LeuLys: 6.886 ± 0.761
5.475LeuLeu: 5.475 ± 0.563
2.655LeuMet: 2.655 ± 0.476
4.729LeuAsn: 4.729 ± 0.689
3.401LeuPro: 3.401 ± 0.394
3.733LeuGln: 3.733 ± 0.599
5.226LeuArg: 5.226 ± 0.545
4.978LeuSer: 4.978 ± 0.626
5.724LeuThr: 5.724 ± 0.747
4.978LeuVal: 4.978 ± 0.604
0.913LeuTrp: 0.913 ± 0.283
2.406LeuTyr: 2.406 ± 0.619
0.0LeuXaa: 0.0 ± 0.0
Met
3.235MetAla: 3.235 ± 0.524
0.415MetCys: 0.415 ± 0.231
1.327MetAsp: 1.327 ± 0.337
1.825MetGlu: 1.825 ± 0.361
1.244MetPhe: 1.244 ± 0.379
1.991MetGly: 1.991 ± 0.424
0.415MetHis: 0.415 ± 0.185
1.161MetIle: 1.161 ± 0.212
1.078MetLys: 1.078 ± 0.297
2.655MetLeu: 2.655 ± 0.453
0.498MetMet: 0.498 ± 0.222
0.913MetAsn: 0.913 ± 0.217
0.996MetPro: 0.996 ± 0.318
0.664MetGln: 0.664 ± 0.269
1.41MetArg: 1.41 ± 0.4
2.24MetSer: 2.24 ± 0.428
1.908MetThr: 1.908 ± 0.465
2.572MetVal: 2.572 ± 0.437
0.332MetTrp: 0.332 ± 0.193
0.996MetTyr: 0.996 ± 0.276
0.0MetXaa: 0.0 ± 0.0
Asn
4.729AsnAla: 4.729 ± 0.601
0.747AsnCys: 0.747 ± 0.263
2.323AsnAsp: 2.323 ± 0.499
2.904AsnGlu: 2.904 ± 0.444
1.659AsnPhe: 1.659 ± 0.3
4.895AsnGly: 4.895 ± 0.695
0.498AsnHis: 0.498 ± 0.183
2.24AsnIle: 2.24 ± 0.376
2.406AsnLys: 2.406 ± 0.381
3.65AsnLeu: 3.65 ± 0.701
1.078AsnMet: 1.078 ± 0.272
2.406AsnAsn: 2.406 ± 0.497
2.904AsnPro: 2.904 ± 0.571
1.244AsnGln: 1.244 ± 0.291
2.655AsnArg: 2.655 ± 0.49
2.406AsnSer: 2.406 ± 0.37
1.991AsnThr: 1.991 ± 0.542
3.235AsnVal: 3.235 ± 0.538
0.166AsnTrp: 0.166 ± 0.134
1.576AsnTyr: 1.576 ± 0.345
0.0AsnXaa: 0.0 ± 0.0
Pro
2.987ProAla: 2.987 ± 0.502
0.332ProCys: 0.332 ± 0.198
2.323ProAsp: 2.323 ± 0.509
2.655ProGlu: 2.655 ± 0.593
1.244ProPhe: 1.244 ± 0.245
1.825ProGly: 1.825 ± 0.322
0.664ProHis: 0.664 ± 0.223
1.825ProIle: 1.825 ± 0.341
3.318ProLys: 3.318 ± 0.563
2.24ProLeu: 2.24 ± 0.45
0.913ProMet: 0.913 ± 0.25
2.24ProAsn: 2.24 ± 0.375
0.747ProPro: 0.747 ± 0.248
1.493ProGln: 1.493 ± 0.332
1.576ProArg: 1.576 ± 0.369
2.572ProSer: 2.572 ± 0.313
2.655ProThr: 2.655 ± 0.403
2.987ProVal: 2.987 ± 0.372
0.83ProTrp: 0.83 ± 0.23
0.913ProTyr: 0.913 ± 0.181
0.0ProXaa: 0.0 ± 0.0
Gln
3.65GlnAla: 3.65 ± 0.516
0.415GlnCys: 0.415 ± 0.187
2.655GlnAsp: 2.655 ± 0.707
2.406GlnGlu: 2.406 ± 0.419
1.825GlnPhe: 1.825 ± 0.283
2.738GlnGly: 2.738 ± 0.525
0.498GlnHis: 0.498 ± 0.22
1.244GlnIle: 1.244 ± 0.326
1.825GlnLys: 1.825 ± 0.337
3.733GlnLeu: 3.733 ± 0.619
1.41GlnMet: 1.41 ± 0.351
1.576GlnAsn: 1.576 ± 0.403
0.996GlnPro: 0.996 ± 0.313
1.742GlnGln: 1.742 ± 0.566
2.406GlnArg: 2.406 ± 0.605
2.738GlnSer: 2.738 ± 0.469
2.074GlnThr: 2.074 ± 0.394
2.821GlnVal: 2.821 ± 0.388
0.664GlnTrp: 0.664 ± 0.232
1.078GlnTyr: 1.078 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
4.148ArgAla: 4.148 ± 0.802
0.581ArgCys: 0.581 ± 0.211
4.314ArgAsp: 4.314 ± 0.446
4.48ArgGlu: 4.48 ± 0.65
2.904ArgPhe: 2.904 ± 0.425
4.314ArgGly: 4.314 ± 0.464
0.913ArgHis: 0.913 ± 0.379
3.567ArgIle: 3.567 ± 0.616
3.07ArgLys: 3.07 ± 0.53
5.973ArgLeu: 5.973 ± 0.653
1.078ArgMet: 1.078 ± 0.318
2.572ArgAsn: 2.572 ± 0.486
1.742ArgPro: 1.742 ± 0.329
2.489ArgGln: 2.489 ± 0.44
2.738ArgArg: 2.738 ± 0.447
3.152ArgSer: 3.152 ± 0.509
2.157ArgThr: 2.157 ± 0.339
2.489ArgVal: 2.489 ± 0.554
0.747ArgTrp: 0.747 ± 0.22
1.576ArgTyr: 1.576 ± 0.308
0.0ArgXaa: 0.0 ± 0.0
Ser
3.982SerAla: 3.982 ± 0.547
1.244SerCys: 1.244 ± 0.466
5.061SerAsp: 5.061 ± 0.538
3.235SerGlu: 3.235 ± 0.629
2.406SerPhe: 2.406 ± 0.375
5.724SerGly: 5.724 ± 0.892
2.24SerHis: 2.24 ± 0.43
3.07SerIle: 3.07 ± 0.646
3.484SerLys: 3.484 ± 0.423
3.235SerLeu: 3.235 ± 0.572
1.659SerMet: 1.659 ± 0.356
2.904SerAsn: 2.904 ± 0.553
2.904SerPro: 2.904 ± 0.434
2.323SerGln: 2.323 ± 0.52
2.987SerArg: 2.987 ± 0.578
3.567SerSer: 3.567 ± 0.645
2.738SerThr: 2.738 ± 0.493
4.563SerVal: 4.563 ± 0.619
0.747SerTrp: 0.747 ± 0.196
2.572SerTyr: 2.572 ± 0.577
0.0SerXaa: 0.0 ± 0.0
Thr
3.401ThrAla: 3.401 ± 0.638
0.664ThrCys: 0.664 ± 0.285
2.904ThrAsp: 2.904 ± 0.452
4.895ThrGlu: 4.895 ± 0.603
2.406ThrPhe: 2.406 ± 0.467
5.724ThrGly: 5.724 ± 0.632
0.581ThrHis: 0.581 ± 0.196
4.148ThrIle: 4.148 ± 0.767
3.401ThrLys: 3.401 ± 0.462
4.978ThrLeu: 4.978 ± 0.498
1.576ThrMet: 1.576 ± 0.285
2.074ThrAsn: 2.074 ± 0.406
2.904ThrPro: 2.904 ± 0.394
2.821ThrGln: 2.821 ± 0.505
2.157ThrArg: 2.157 ± 0.464
2.655ThrSer: 2.655 ± 0.7
3.07ThrThr: 3.07 ± 0.526
4.729ThrVal: 4.729 ± 0.503
0.581ThrTrp: 0.581 ± 0.177
1.327ThrTyr: 1.327 ± 0.234
0.0ThrXaa: 0.0 ± 0.0
Val
4.812ValAla: 4.812 ± 0.666
0.498ValCys: 0.498 ± 0.231
3.401ValAsp: 3.401 ± 0.526
4.812ValGlu: 4.812 ± 0.789
2.406ValPhe: 2.406 ± 0.528
6.222ValGly: 6.222 ± 0.711
1.576ValHis: 1.576 ± 0.535
3.152ValIle: 3.152 ± 0.584
5.641ValLys: 5.641 ± 0.732
5.475ValLeu: 5.475 ± 0.847
1.825ValMet: 1.825 ± 0.539
3.567ValAsn: 3.567 ± 0.517
2.987ValPro: 2.987 ± 0.501
3.235ValGln: 3.235 ± 0.47
4.065ValArg: 4.065 ± 0.559
4.397ValSer: 4.397 ± 0.518
4.231ValThr: 4.231 ± 0.499
6.305ValVal: 6.305 ± 0.992
0.83ValTrp: 0.83 ± 0.299
2.406ValTyr: 2.406 ± 0.42
0.0ValXaa: 0.0 ± 0.0
Trp
0.498TrpAla: 0.498 ± 0.159
0.083TrpCys: 0.083 ± 0.089
0.747TrpAsp: 0.747 ± 0.242
0.913TrpGlu: 0.913 ± 0.254
0.664TrpPhe: 0.664 ± 0.243
0.83TrpGly: 0.83 ± 0.25
0.332TrpHis: 0.332 ± 0.186
0.415TrpIle: 0.415 ± 0.202
1.327TrpLys: 1.327 ± 0.328
1.991TrpLeu: 1.991 ± 0.396
0.166TrpMet: 0.166 ± 0.106
0.747TrpAsn: 0.747 ± 0.217
0.249TrpPro: 0.249 ± 0.133
0.664TrpGln: 0.664 ± 0.273
0.747TrpArg: 0.747 ± 0.263
1.161TrpSer: 1.161 ± 0.492
0.664TrpThr: 0.664 ± 0.196
1.244TrpVal: 1.244 ± 0.346
0.166TrpTrp: 0.166 ± 0.139
0.415TrpTyr: 0.415 ± 0.178
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.07TyrAla: 3.07 ± 0.59
0.249TyrCys: 0.249 ± 0.149
2.24TyrAsp: 2.24 ± 0.409
1.742TyrGlu: 1.742 ± 0.436
1.161TyrPhe: 1.161 ± 0.261
3.152TyrGly: 3.152 ± 0.471
0.581TyrHis: 0.581 ± 0.277
1.576TyrIle: 1.576 ± 0.505
1.991TyrLys: 1.991 ± 0.402
2.489TyrLeu: 2.489 ± 0.381
0.996TyrMet: 0.996 ± 0.311
1.659TyrAsn: 1.659 ± 0.436
1.161TyrPro: 1.161 ± 0.355
1.825TyrGln: 1.825 ± 0.545
2.406TyrArg: 2.406 ± 0.48
1.742TyrSer: 1.742 ± 0.333
1.825TyrThr: 1.825 ± 0.315
2.157TyrVal: 2.157 ± 0.436
0.581TyrTrp: 0.581 ± 0.193
1.327TyrTyr: 1.327 ± 0.291
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (12055 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski