Amino acid dipepetide frequency for Haemophilus phage Aaphi23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.829AlaAla: 5.829 ± 1.062
0.812AlaCys: 0.812 ± 0.235
6.05AlaAsp: 6.05 ± 0.728
5.681AlaGlu: 5.681 ± 0.609
3.837AlaPhe: 3.837 ± 0.611
3.911AlaGly: 3.911 ± 0.549
1.476AlaHis: 1.476 ± 0.296
6.345AlaIle: 6.345 ± 0.676
6.419AlaLys: 6.419 ± 0.937
7.969AlaLeu: 7.969 ± 0.793
2.14AlaMet: 2.14 ± 0.45
5.386AlaAsn: 5.386 ± 0.767
2.14AlaPro: 2.14 ± 0.377
4.058AlaGln: 4.058 ± 0.609
3.542AlaArg: 3.542 ± 0.59
2.656AlaSer: 2.656 ± 0.404
4.796AlaThr: 4.796 ± 0.661
4.87AlaVal: 4.87 ± 0.782
1.107AlaTrp: 1.107 ± 0.301
2.73AlaTyr: 2.73 ± 0.392
0.0AlaXaa: 0.0 ± 0.0
Cys
0.738CysAla: 0.738 ± 0.259
0.0CysCys: 0.0 ± 0.0
0.59CysAsp: 0.59 ± 0.226
0.516CysGlu: 0.516 ± 0.191
0.443CysPhe: 0.443 ± 0.187
0.59CysGly: 0.59 ± 0.207
0.221CysHis: 0.221 ± 0.123
0.516CysIle: 0.516 ± 0.224
0.59CysLys: 0.59 ± 0.208
0.738CysLeu: 0.738 ± 0.195
0.074CysMet: 0.074 ± 0.074
0.295CysAsn: 0.295 ± 0.153
0.148CysPro: 0.148 ± 0.13
0.074CysGln: 0.074 ± 0.063
0.664CysArg: 0.664 ± 0.273
0.812CysSer: 0.812 ± 0.226
0.295CysThr: 0.295 ± 0.143
1.181CysVal: 1.181 ± 0.269
0.221CysTrp: 0.221 ± 0.127
0.369CysTyr: 0.369 ± 0.193
0.0CysXaa: 0.0 ± 0.0
Asp
4.575AspAla: 4.575 ± 0.722
0.516AspCys: 0.516 ± 0.171
3.025AspAsp: 3.025 ± 0.469
4.87AspGlu: 4.87 ± 0.918
3.468AspPhe: 3.468 ± 0.369
4.206AspGly: 4.206 ± 0.548
0.664AspHis: 0.664 ± 0.248
3.911AspIle: 3.911 ± 0.633
4.722AspLys: 4.722 ± 0.667
5.755AspLeu: 5.755 ± 0.68
1.181AspMet: 1.181 ± 0.281
2.582AspAsn: 2.582 ± 0.605
1.771AspPro: 1.771 ± 0.341
1.845AspGln: 1.845 ± 0.358
1.918AspArg: 1.918 ± 0.36
2.878AspSer: 2.878 ± 0.482
2.509AspThr: 2.509 ± 0.446
4.279AspVal: 4.279 ± 0.445
1.402AspTrp: 1.402 ± 0.34
1.918AspTyr: 1.918 ± 0.368
0.0AspXaa: 0.0 ± 0.0
Glu
4.427GluAla: 4.427 ± 0.659
0.812GluCys: 0.812 ± 0.272
3.247GluAsp: 3.247 ± 0.461
3.025GluGlu: 3.025 ± 0.717
2.14GluPhe: 2.14 ± 0.371
2.582GluGly: 2.582 ± 0.371
1.549GluHis: 1.549 ± 0.34
5.608GluIle: 5.608 ± 0.517
6.493GluLys: 6.493 ± 0.706
7.083GluLeu: 7.083 ± 0.733
2.361GluMet: 2.361 ± 0.496
4.279GluAsn: 4.279 ± 0.535
1.771GluPro: 1.771 ± 0.397
4.648GluGln: 4.648 ± 0.732
3.542GluArg: 3.542 ± 0.52
3.542GluSer: 3.542 ± 0.551
3.173GluThr: 3.173 ± 0.453
2.73GluVal: 2.73 ± 0.404
0.885GluTrp: 0.885 ± 0.268
2.287GluTyr: 2.287 ± 0.345
0.0GluXaa: 0.0 ± 0.0
Phe
3.099PheAla: 3.099 ± 0.749
0.369PheCys: 0.369 ± 0.162
3.763PheAsp: 3.763 ± 0.489
2.73PheGlu: 2.73 ± 0.476
1.697PhePhe: 1.697 ± 0.377
3.468PheGly: 3.468 ± 0.47
0.516PheHis: 0.516 ± 0.182
3.763PheIle: 3.763 ± 0.514
2.361PheLys: 2.361 ± 0.462
3.394PheLeu: 3.394 ± 0.516
0.664PheMet: 0.664 ± 0.218
2.951PheAsn: 2.951 ± 0.506
0.959PhePro: 0.959 ± 0.326
0.664PheGln: 0.664 ± 0.194
1.623PheArg: 1.623 ± 0.355
2.878PheSer: 2.878 ± 0.476
3.173PheThr: 3.173 ± 0.397
2.656PheVal: 2.656 ± 0.446
0.443PheTrp: 0.443 ± 0.17
1.328PheTyr: 1.328 ± 0.302
0.0PheXaa: 0.0 ± 0.0
Gly
5.165GlyAla: 5.165 ± 0.709
0.443GlyCys: 0.443 ± 0.188
3.911GlyAsp: 3.911 ± 0.575
4.353GlyGlu: 4.353 ± 0.613
3.763GlyPhe: 3.763 ± 0.537
5.165GlyGly: 5.165 ± 0.754
1.033GlyHis: 1.033 ± 0.222
3.837GlyIle: 3.837 ± 0.415
5.608GlyLys: 5.608 ± 0.686
6.198GlyLeu: 6.198 ± 0.787
1.697GlyMet: 1.697 ± 0.347
3.173GlyAsn: 3.173 ± 0.499
0.148GlyPro: 0.148 ± 0.102
2.582GlyGln: 2.582 ± 0.393
2.582GlyArg: 2.582 ± 0.422
3.173GlySer: 3.173 ± 0.475
3.099GlyThr: 3.099 ± 0.441
4.575GlyVal: 4.575 ± 0.594
0.812GlyTrp: 0.812 ± 0.217
2.361GlyTyr: 2.361 ± 0.45
0.0GlyXaa: 0.0 ± 0.0
His
1.254HisAla: 1.254 ± 0.292
0.221HisCys: 0.221 ± 0.127
0.959HisAsp: 0.959 ± 0.261
1.033HisGlu: 1.033 ± 0.301
0.59HisPhe: 0.59 ± 0.206
1.181HisGly: 1.181 ± 0.278
0.369HisHis: 0.369 ± 0.183
1.107HisIle: 1.107 ± 0.244
1.107HisLys: 1.107 ± 0.252
1.918HisLeu: 1.918 ± 0.419
0.148HisMet: 0.148 ± 0.103
0.516HisAsn: 0.516 ± 0.231
0.59HisPro: 0.59 ± 0.248
0.812HisGln: 0.812 ± 0.218
0.738HisArg: 0.738 ± 0.23
1.328HisSer: 1.328 ± 0.343
1.107HisThr: 1.107 ± 0.249
0.664HisVal: 0.664 ± 0.225
0.074HisTrp: 0.074 ± 0.075
0.295HisTyr: 0.295 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
5.312IleAla: 5.312 ± 0.637
0.516IleCys: 0.516 ± 0.206
4.501IleAsp: 4.501 ± 0.468
5.46IleGlu: 5.46 ± 0.661
2.361IlePhe: 2.361 ± 0.43
5.829IleGly: 5.829 ± 0.707
1.033IleHis: 1.033 ± 0.239
3.32IleIle: 3.32 ± 0.582
6.272IleLys: 6.272 ± 0.69
4.575IleLeu: 4.575 ± 0.462
0.885IleMet: 0.885 ± 0.25
3.689IleAsn: 3.689 ± 0.517
1.992IlePro: 1.992 ± 0.39
3.173IleGln: 3.173 ± 0.48
3.099IleArg: 3.099 ± 0.61
4.796IleSer: 4.796 ± 0.588
4.058IleThr: 4.058 ± 0.548
4.058IleVal: 4.058 ± 0.576
0.812IleTrp: 0.812 ± 0.261
2.435IleTyr: 2.435 ± 0.368
0.0IleXaa: 0.0 ± 0.0
Lys
6.862LysAla: 6.862 ± 0.749
0.959LysCys: 0.959 ± 0.29
3.763LysAsp: 3.763 ± 0.542
4.575LysGlu: 4.575 ± 0.611
2.73LysPhe: 2.73 ± 0.413
3.837LysGly: 3.837 ± 0.505
1.402LysHis: 1.402 ± 0.284
5.312LysIle: 5.312 ± 0.609
5.091LysLys: 5.091 ± 0.808
6.936LysLeu: 6.936 ± 0.82
1.918LysMet: 1.918 ± 0.418
3.984LysAsn: 3.984 ± 0.429
3.542LysPro: 3.542 ± 0.56
4.427LysGln: 4.427 ± 0.483
3.247LysArg: 3.247 ± 0.516
5.534LysSer: 5.534 ± 0.607
5.386LysThr: 5.386 ± 0.687
4.353LysVal: 4.353 ± 0.472
1.254LysTrp: 1.254 ± 0.262
2.582LysTyr: 2.582 ± 0.478
0.0LysXaa: 0.0 ± 0.0
Leu
7.747LeuAla: 7.747 ± 0.707
0.369LeuCys: 0.369 ± 0.212
5.091LeuAsp: 5.091 ± 0.702
6.714LeuGlu: 6.714 ± 0.833
2.951LeuPhe: 2.951 ± 0.422
5.681LeuGly: 5.681 ± 0.82
1.328LeuHis: 1.328 ± 0.292
5.386LeuIle: 5.386 ± 0.73
6.936LeuLys: 6.936 ± 0.816
9.371LeuLeu: 9.371 ± 0.935
1.992LeuMet: 1.992 ± 0.434
5.608LeuAsn: 5.608 ± 0.587
3.468LeuPro: 3.468 ± 0.508
3.025LeuGln: 3.025 ± 0.517
4.944LeuArg: 4.944 ± 0.575
8.411LeuSer: 8.411 ± 0.846
6.345LeuThr: 6.345 ± 0.677
4.796LeuVal: 4.796 ± 0.568
1.107LeuTrp: 1.107 ± 0.306
2.287LeuTyr: 2.287 ± 0.378
0.0LeuXaa: 0.0 ± 0.0
Met
2.361MetAla: 2.361 ± 0.445
0.0MetCys: 0.0 ± 0.0
0.885MetAsp: 0.885 ± 0.193
1.549MetGlu: 1.549 ± 0.357
1.033MetPhe: 1.033 ± 0.238
0.812MetGly: 0.812 ± 0.266
0.369MetHis: 0.369 ± 0.166
1.549MetIle: 1.549 ± 0.424
1.107MetLys: 1.107 ± 0.257
2.435MetLeu: 2.435 ± 0.361
0.221MetMet: 0.221 ± 0.134
0.959MetAsn: 0.959 ± 0.263
0.59MetPro: 0.59 ± 0.157
1.181MetGln: 1.181 ± 0.289
0.885MetArg: 0.885 ± 0.258
2.287MetSer: 2.287 ± 0.462
1.771MetThr: 1.771 ± 0.332
1.254MetVal: 1.254 ± 0.305
0.295MetTrp: 0.295 ± 0.133
0.516MetTyr: 0.516 ± 0.214
0.0MetXaa: 0.0 ± 0.0
Asn
5.165AsnAla: 5.165 ± 0.587
0.516AsnCys: 0.516 ± 0.216
2.73AsnAsp: 2.73 ± 0.412
3.025AsnGlu: 3.025 ± 0.522
1.771AsnPhe: 1.771 ± 0.357
4.353AsnGly: 4.353 ± 0.529
0.959AsnHis: 0.959 ± 0.272
3.542AsnIle: 3.542 ± 0.523
3.247AsnLys: 3.247 ± 0.45
4.944AsnLeu: 4.944 ± 0.519
1.623AsnMet: 1.623 ± 0.346
2.951AsnAsn: 2.951 ± 0.428
1.771AsnPro: 1.771 ± 0.401
3.173AsnGln: 3.173 ± 0.55
1.918AsnArg: 1.918 ± 0.423
3.468AsnSer: 3.468 ± 0.543
3.025AsnThr: 3.025 ± 0.544
2.435AsnVal: 2.435 ± 0.473
0.664AsnTrp: 0.664 ± 0.262
1.328AsnTyr: 1.328 ± 0.349
0.0AsnXaa: 0.0 ± 0.0
Pro
2.361ProAla: 2.361 ± 0.51
0.295ProCys: 0.295 ± 0.169
1.697ProAsp: 1.697 ± 0.505
2.878ProGlu: 2.878 ± 0.439
1.181ProPhe: 1.181 ± 0.349
0.516ProGly: 0.516 ± 0.157
0.664ProHis: 0.664 ± 0.227
2.361ProIle: 2.361 ± 0.408
2.951ProLys: 2.951 ± 0.392
2.804ProLeu: 2.804 ± 0.471
0.664ProMet: 0.664 ± 0.195
1.992ProAsn: 1.992 ± 0.405
0.738ProPro: 0.738 ± 0.236
1.254ProGln: 1.254 ± 0.284
1.328ProArg: 1.328 ± 0.289
1.918ProSer: 1.918 ± 0.467
1.328ProThr: 1.328 ± 0.353
1.771ProVal: 1.771 ± 0.337
0.664ProTrp: 0.664 ± 0.232
0.369ProTyr: 0.369 ± 0.168
0.0ProXaa: 0.0 ± 0.0
Gln
4.353GlnAla: 4.353 ± 0.551
0.369GlnCys: 0.369 ± 0.152
1.992GlnAsp: 1.992 ± 0.451
2.878GlnGlu: 2.878 ± 0.458
1.697GlnPhe: 1.697 ± 0.434
2.287GlnGly: 2.287 ± 0.419
0.885GlnHis: 0.885 ± 0.289
3.615GlnIle: 3.615 ± 0.552
3.025GlnLys: 3.025 ± 0.327
4.501GlnLeu: 4.501 ± 0.594
1.623GlnMet: 1.623 ± 0.371
2.14GlnAsn: 2.14 ± 0.431
1.697GlnPro: 1.697 ± 0.351
3.173GlnGln: 3.173 ± 0.629
1.771GlnArg: 1.771 ± 0.362
3.025GlnSer: 3.025 ± 0.469
2.951GlnThr: 2.951 ± 0.454
2.435GlnVal: 2.435 ± 0.512
0.738GlnTrp: 0.738 ± 0.215
1.549GlnTyr: 1.549 ± 0.423
0.0GlnXaa: 0.0 ± 0.0
Arg
2.509ArgAla: 2.509 ± 0.427
0.885ArgCys: 0.885 ± 0.232
2.804ArgAsp: 2.804 ± 0.529
2.361ArgGlu: 2.361 ± 0.47
2.951ArgPhe: 2.951 ± 0.424
3.247ArgGly: 3.247 ± 0.463
0.812ArgHis: 0.812 ± 0.246
3.32ArgIle: 3.32 ± 0.518
3.542ArgLys: 3.542 ± 0.526
4.206ArgLeu: 4.206 ± 0.628
0.959ArgMet: 0.959 ± 0.238
2.804ArgAsn: 2.804 ± 0.454
1.107ArgPro: 1.107 ± 0.26
1.549ArgGln: 1.549 ± 0.347
3.025ArgArg: 3.025 ± 0.446
2.287ArgSer: 2.287 ± 0.464
2.14ArgThr: 2.14 ± 0.462
3.394ArgVal: 3.394 ± 0.505
1.107ArgTrp: 1.107 ± 0.279
1.623ArgTyr: 1.623 ± 0.316
0.0ArgXaa: 0.0 ± 0.0
Ser
5.091SerAla: 5.091 ± 0.665
0.812SerCys: 0.812 ± 0.197
4.648SerAsp: 4.648 ± 0.524
4.279SerGlu: 4.279 ± 0.51
2.509SerPhe: 2.509 ± 0.43
5.091SerGly: 5.091 ± 0.699
1.033SerHis: 1.033 ± 0.23
3.542SerIle: 3.542 ± 0.38
4.722SerLys: 4.722 ± 0.59
5.681SerLeu: 5.681 ± 0.683
1.254SerMet: 1.254 ± 0.241
3.025SerAsn: 3.025 ± 0.482
1.918SerPro: 1.918 ± 0.353
2.287SerGln: 2.287 ± 0.395
3.173SerArg: 3.173 ± 0.418
3.247SerSer: 3.247 ± 0.654
3.173SerThr: 3.173 ± 0.491
3.984SerVal: 3.984 ± 0.581
0.885SerTrp: 0.885 ± 0.258
1.845SerTyr: 1.845 ± 0.485
0.0SerXaa: 0.0 ± 0.0
Thr
5.977ThrAla: 5.977 ± 0.704
0.074ThrCys: 0.074 ± 0.061
3.025ThrAsp: 3.025 ± 0.334
3.615ThrGlu: 3.615 ± 0.444
1.845ThrPhe: 1.845 ± 0.358
4.501ThrGly: 4.501 ± 0.51
0.443ThrHis: 0.443 ± 0.245
4.722ThrIle: 4.722 ± 0.634
4.575ThrLys: 4.575 ± 0.616
5.681ThrLeu: 5.681 ± 0.773
1.033ThrMet: 1.033 ± 0.347
1.918ThrAsn: 1.918 ± 0.363
2.656ThrPro: 2.656 ± 0.371
3.468ThrGln: 3.468 ± 0.485
2.435ThrArg: 2.435 ± 0.413
2.509ThrSer: 2.509 ± 0.492
3.099ThrThr: 3.099 ± 0.532
4.206ThrVal: 4.206 ± 0.559
0.664ThrTrp: 0.664 ± 0.217
1.845ThrTyr: 1.845 ± 0.389
0.0ThrXaa: 0.0 ± 0.0
Val
5.165ValAla: 5.165 ± 0.802
0.664ValCys: 0.664 ± 0.196
3.394ValAsp: 3.394 ± 0.452
3.911ValGlu: 3.911 ± 0.482
3.099ValPhe: 3.099 ± 0.542
3.763ValGly: 3.763 ± 0.49
0.295ValHis: 0.295 ± 0.149
4.132ValIle: 4.132 ± 0.469
5.165ValLys: 5.165 ± 0.514
5.091ValLeu: 5.091 ± 0.67
1.107ValMet: 1.107 ± 0.313
2.435ValAsn: 2.435 ± 0.467
1.549ValPro: 1.549 ± 0.267
3.025ValGln: 3.025 ± 0.494
3.025ValArg: 3.025 ± 0.484
4.132ValSer: 4.132 ± 0.508
4.279ValThr: 4.279 ± 0.618
3.394ValVal: 3.394 ± 0.476
0.369ValTrp: 0.369 ± 0.169
1.697ValTyr: 1.697 ± 0.388
0.0ValXaa: 0.0 ± 0.0
Trp
1.181TrpAla: 1.181 ± 0.265
0.148TrpCys: 0.148 ± 0.11
0.443TrpAsp: 0.443 ± 0.2
0.664TrpGlu: 0.664 ± 0.216
1.107TrpPhe: 1.107 ± 0.254
0.959TrpGly: 0.959 ± 0.342
0.221TrpHis: 0.221 ± 0.134
0.885TrpIle: 0.885 ± 0.3
1.254TrpLys: 1.254 ± 0.306
1.623TrpLeu: 1.623 ± 0.461
0.0TrpMet: 0.0 ± 0.0
0.369TrpAsn: 0.369 ± 0.144
0.0TrpPro: 0.0 ± 0.0
0.516TrpGln: 0.516 ± 0.186
1.033TrpArg: 1.033 ± 0.247
1.107TrpSer: 1.107 ± 0.305
0.516TrpThr: 0.516 ± 0.188
1.107TrpVal: 1.107 ± 0.286
0.221TrpTrp: 0.221 ± 0.142
0.738TrpTyr: 0.738 ± 0.253
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.582TyrAla: 2.582 ± 0.524
0.295TyrCys: 0.295 ± 0.154
1.476TyrAsp: 1.476 ± 0.395
2.14TyrGlu: 2.14 ± 0.425
1.328TyrPhe: 1.328 ± 0.375
1.697TyrGly: 1.697 ± 0.311
0.738TyrHis: 0.738 ± 0.195
1.402TyrIle: 1.402 ± 0.293
2.509TyrLys: 2.509 ± 0.437
2.656TyrLeu: 2.656 ± 0.465
0.443TyrMet: 0.443 ± 0.173
1.549TyrAsn: 1.549 ± 0.33
1.181TyrPro: 1.181 ± 0.267
1.771TyrGln: 1.771 ± 0.401
2.14TyrArg: 2.14 ± 0.412
2.214TyrSer: 2.214 ± 0.483
2.066TyrThr: 2.066 ± 0.451
1.549TyrVal: 1.549 ± 0.438
0.369TyrTrp: 0.369 ± 0.154
1.254TyrTyr: 1.254 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 65 proteins (13554 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski