Amino acid dipepetide frequency for Enterobacteria phage 9g

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.251AlaAla: 9.251 ± 1.878
0.825AlaCys: 0.825 ± 0.219
4.949AlaAsp: 4.949 ± 0.742
5.421AlaGlu: 5.421 ± 0.715
2.593AlaPhe: 2.593 ± 0.41
5.774AlaGly: 5.774 ± 0.774
1.355AlaHis: 1.355 ± 0.37
5.951AlaIle: 5.951 ± 0.714
5.597AlaLys: 5.597 ± 0.841
5.833AlaLeu: 5.833 ± 0.763
2.946AlaMet: 2.946 ± 0.355
4.183AlaAsn: 4.183 ± 0.576
2.651AlaPro: 2.651 ± 0.366
4.949AlaGln: 4.949 ± 0.98
3.476AlaArg: 3.476 ± 0.518
4.773AlaSer: 4.773 ± 1.207
5.48AlaThr: 5.48 ± 1.017
4.655AlaVal: 4.655 ± 0.444
1.061AlaTrp: 1.061 ± 0.258
3.417AlaTyr: 3.417 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.589CysAla: 0.589 ± 0.177
0.177CysCys: 0.177 ± 0.109
0.943CysAsp: 0.943 ± 0.21
0.884CysGlu: 0.884 ± 0.232
0.589CysPhe: 0.589 ± 0.143
1.296CysGly: 1.296 ± 0.298
0.354CysHis: 0.354 ± 0.168
1.061CysIle: 1.061 ± 0.313
1.237CysLys: 1.237 ± 0.247
0.707CysLeu: 0.707 ± 0.195
0.236CysMet: 0.236 ± 0.115
0.707CysAsn: 0.707 ± 0.234
0.648CysPro: 0.648 ± 0.225
0.412CysGln: 0.412 ± 0.131
0.589CysArg: 0.589 ± 0.178
0.53CysSer: 0.53 ± 0.159
0.825CysThr: 0.825 ± 0.257
1.119CysVal: 1.119 ± 0.263
0.177CysTrp: 0.177 ± 0.087
0.53CysTyr: 0.53 ± 0.182
0.0CysXaa: 0.0 ± 0.0
Asp
5.008AspAla: 5.008 ± 0.584
0.53AspCys: 0.53 ± 0.212
3.241AspAsp: 3.241 ± 0.451
3.948AspGlu: 3.948 ± 0.542
2.239AspPhe: 2.239 ± 0.34
5.951AspGly: 5.951 ± 0.567
1.355AspHis: 1.355 ± 0.228
4.242AspIle: 4.242 ± 0.489
3.182AspLys: 3.182 ± 0.409
4.714AspLeu: 4.714 ± 0.566
1.237AspMet: 1.237 ± 0.248
2.475AspAsn: 2.475 ± 0.397
1.709AspPro: 1.709 ± 0.303
1.885AspGln: 1.885 ± 0.282
2.003AspArg: 2.003 ± 0.329
3.358AspSer: 3.358 ± 0.519
3.476AspThr: 3.476 ± 0.503
4.301AspVal: 4.301 ± 0.38
0.766AspTrp: 0.766 ± 0.232
2.475AspTyr: 2.475 ± 0.435
0.0AspXaa: 0.0 ± 0.0
Glu
5.126GluAla: 5.126 ± 0.609
1.002GluCys: 1.002 ± 0.257
3.948GluAsp: 3.948 ± 0.453
4.949GluGlu: 4.949 ± 0.507
2.357GluPhe: 2.357 ± 0.369
4.596GluGly: 4.596 ± 0.501
0.884GluHis: 0.884 ± 0.224
4.242GluIle: 4.242 ± 0.415
3.241GluLys: 3.241 ± 0.655
5.421GluLeu: 5.421 ± 0.721
2.651GluMet: 2.651 ± 0.525
3.182GluAsn: 3.182 ± 0.398
2.121GluPro: 2.121 ± 0.444
3.83GluGln: 3.83 ± 0.67
4.714GluArg: 4.714 ± 0.449
3.653GluSer: 3.653 ± 0.5
3.123GluThr: 3.123 ± 0.47
4.301GluVal: 4.301 ± 0.682
0.53GluTrp: 0.53 ± 0.247
2.357GluTyr: 2.357 ± 0.485
0.0GluXaa: 0.0 ± 0.0
Phe
2.593PheAla: 2.593 ± 0.364
0.648PheCys: 0.648 ± 0.177
3.005PheAsp: 3.005 ± 0.478
1.709PheGlu: 1.709 ± 0.316
0.943PhePhe: 0.943 ± 0.213
2.475PheGly: 2.475 ± 0.428
1.061PheHis: 1.061 ± 0.218
2.593PheIle: 2.593 ± 0.432
2.121PheLys: 2.121 ± 0.411
2.003PheLeu: 2.003 ± 0.346
1.061PheMet: 1.061 ± 0.242
2.769PheAsn: 2.769 ± 0.441
1.473PhePro: 1.473 ± 0.286
1.061PheGln: 1.061 ± 0.262
2.121PheArg: 2.121 ± 0.415
2.121PheSer: 2.121 ± 0.409
2.062PheThr: 2.062 ± 0.303
1.885PheVal: 1.885 ± 0.35
0.295PheTrp: 0.295 ± 0.142
1.061PheTyr: 1.061 ± 0.301
0.0PheXaa: 0.0 ± 0.0
Gly
5.244GlyAla: 5.244 ± 0.969
0.825GlyCys: 0.825 ± 0.285
3.535GlyAsp: 3.535 ± 0.351
4.596GlyGlu: 4.596 ± 0.622
3.241GlyPhe: 3.241 ± 0.469
4.831GlyGly: 4.831 ± 0.572
1.709GlyHis: 1.709 ± 0.341
3.712GlyIle: 3.712 ± 0.484
4.537GlyLys: 4.537 ± 0.54
5.597GlyLeu: 5.597 ± 0.408
2.475GlyMet: 2.475 ± 0.393
2.651GlyAsn: 2.651 ± 0.411
1.473GlyPro: 1.473 ± 0.233
3.358GlyGln: 3.358 ± 0.41
3.594GlyArg: 3.594 ± 0.345
6.01GlySer: 6.01 ± 0.661
4.596GlyThr: 4.596 ± 0.517
5.833GlyVal: 5.833 ± 0.616
1.119GlyTrp: 1.119 ± 0.306
2.298GlyTyr: 2.298 ± 0.301
0.0GlyXaa: 0.0 ± 0.0
His
1.532HisAla: 1.532 ± 0.303
0.354HisCys: 0.354 ± 0.137
1.002HisAsp: 1.002 ± 0.253
1.532HisGlu: 1.532 ± 0.307
0.707HisPhe: 0.707 ± 0.235
1.473HisGly: 1.473 ± 0.301
0.412HisHis: 0.412 ± 0.119
0.943HisIle: 0.943 ± 0.238
0.943HisLys: 0.943 ± 0.283
1.178HisLeu: 1.178 ± 0.289
0.707HisMet: 0.707 ± 0.219
0.884HisAsn: 0.884 ± 0.216
0.707HisPro: 0.707 ± 0.187
0.648HisGln: 0.648 ± 0.236
0.884HisArg: 0.884 ± 0.202
1.414HisSer: 1.414 ± 0.265
1.473HisThr: 1.473 ± 0.551
1.414HisVal: 1.414 ± 0.239
0.589HisTrp: 0.589 ± 0.174
0.707HisTyr: 0.707 ± 0.235
0.0HisXaa: 0.0 ± 0.0
Ile
4.89IleAla: 4.89 ± 0.446
0.766IleCys: 0.766 ± 0.192
4.478IleAsp: 4.478 ± 0.493
3.594IleGlu: 3.594 ± 0.502
1.355IlePhe: 1.355 ± 0.331
3.83IleGly: 3.83 ± 0.547
1.119IleHis: 1.119 ± 0.195
4.183IleIle: 4.183 ± 0.543
4.714IleLys: 4.714 ± 0.479
3.948IleLeu: 3.948 ± 0.486
1.532IleMet: 1.532 ± 0.29
3.417IleAsn: 3.417 ± 0.459
2.651IlePro: 2.651 ± 0.315
2.887IleGln: 2.887 ± 0.536
3.241IleArg: 3.241 ± 0.474
3.3IleSer: 3.3 ± 0.414
3.771IleThr: 3.771 ± 0.463
3.889IleVal: 3.889 ± 0.57
0.707IleTrp: 0.707 ± 0.238
2.534IleTyr: 2.534 ± 0.315
0.0IleXaa: 0.0 ± 0.0
Lys
6.128LysAla: 6.128 ± 0.775
0.884LysCys: 0.884 ± 0.272
3.005LysAsp: 3.005 ± 0.486
4.714LysGlu: 4.714 ± 0.674
2.71LysPhe: 2.71 ± 0.419
3.182LysGly: 3.182 ± 0.423
1.296LysHis: 1.296 ± 0.251
4.007LysIle: 4.007 ± 0.453
3.653LysLys: 3.653 ± 0.522
4.478LysLeu: 4.478 ± 0.541
1.532LysMet: 1.532 ± 0.322
1.885LysAsn: 1.885 ± 0.394
2.651LysPro: 2.651 ± 0.438
2.71LysGln: 2.71 ± 0.428
3.358LysArg: 3.358 ± 0.459
3.3LysSer: 3.3 ± 0.374
3.712LysThr: 3.712 ± 0.508
3.83LysVal: 3.83 ± 0.468
1.119LysTrp: 1.119 ± 0.24
2.003LysTyr: 2.003 ± 0.387
0.0LysXaa: 0.0 ± 0.0
Leu
5.597LeuAla: 5.597 ± 0.607
1.061LeuCys: 1.061 ± 0.269
4.36LeuAsp: 4.36 ± 0.531
4.537LeuGlu: 4.537 ± 0.657
2.475LeuPhe: 2.475 ± 0.356
4.242LeuGly: 4.242 ± 0.487
1.237LeuHis: 1.237 ± 0.262
3.889LeuIle: 3.889 ± 0.469
4.36LeuLys: 4.36 ± 0.55
4.478LeuLeu: 4.478 ± 0.502
2.062LeuMet: 2.062 ± 0.385
4.36LeuAsn: 4.36 ± 0.436
3.948LeuPro: 3.948 ± 0.511
4.183LeuGln: 4.183 ± 0.487
4.655LeuArg: 4.655 ± 0.528
4.478LeuSer: 4.478 ± 0.646
5.48LeuThr: 5.48 ± 0.735
5.362LeuVal: 5.362 ± 0.588
0.825LeuTrp: 0.825 ± 0.179
1.944LeuTyr: 1.944 ± 0.387
0.0LeuXaa: 0.0 ± 0.0
Met
2.534MetAla: 2.534 ± 0.414
0.471MetCys: 0.471 ± 0.227
1.65MetAsp: 1.65 ± 0.342
2.003MetGlu: 2.003 ± 0.405
1.002MetPhe: 1.002 ± 0.279
1.591MetGly: 1.591 ± 0.291
0.589MetHis: 0.589 ± 0.169
1.65MetIle: 1.65 ± 0.283
1.709MetLys: 1.709 ± 0.324
2.769MetLeu: 2.769 ± 0.419
0.707MetMet: 0.707 ± 0.208
1.119MetAsn: 1.119 ± 0.233
1.002MetPro: 1.002 ± 0.233
1.885MetGln: 1.885 ± 0.33
2.003MetArg: 2.003 ± 0.382
2.121MetSer: 2.121 ± 0.505
1.473MetThr: 1.473 ± 0.311
1.414MetVal: 1.414 ± 0.283
0.295MetTrp: 0.295 ± 0.12
0.943MetTyr: 0.943 ± 0.296
0.0MetXaa: 0.0 ± 0.0
Asn
4.655AsnAla: 4.655 ± 0.964
0.766AsnCys: 0.766 ± 0.181
2.593AsnAsp: 2.593 ± 0.323
3.476AsnGlu: 3.476 ± 0.546
0.884AsnPhe: 0.884 ± 0.166
4.949AsnGly: 4.949 ± 0.639
1.061AsnHis: 1.061 ± 0.226
2.769AsnIle: 2.769 ± 0.356
2.946AsnLys: 2.946 ± 0.422
3.535AsnLeu: 3.535 ± 0.479
1.002AsnMet: 1.002 ± 0.281
3.123AsnAsn: 3.123 ± 0.51
1.768AsnPro: 1.768 ± 0.276
1.65AsnGln: 1.65 ± 0.377
2.062AsnArg: 2.062 ± 0.334
2.946AsnSer: 2.946 ± 0.524
2.71AsnThr: 2.71 ± 0.421
3.123AsnVal: 3.123 ± 0.434
0.648AsnTrp: 0.648 ± 0.235
2.121AsnTyr: 2.121 ± 0.434
0.0AsnXaa: 0.0 ± 0.0
Pro
2.121ProAla: 2.121 ± 0.419
0.412ProCys: 0.412 ± 0.12
2.71ProAsp: 2.71 ± 0.414
3.358ProGlu: 3.358 ± 0.456
1.827ProPhe: 1.827 ± 0.408
3.005ProGly: 3.005 ± 0.42
0.943ProHis: 0.943 ± 0.292
2.062ProIle: 2.062 ± 0.374
1.827ProLys: 1.827 ± 0.282
2.357ProLeu: 2.357 ± 0.408
1.296ProMet: 1.296 ± 0.265
1.885ProAsn: 1.885 ± 0.375
2.121ProPro: 2.121 ± 0.358
1.473ProGln: 1.473 ± 0.484
2.239ProArg: 2.239 ± 0.356
2.121ProSer: 2.121 ± 0.339
2.062ProThr: 2.062 ± 0.4
2.887ProVal: 2.887 ± 0.421
0.648ProTrp: 0.648 ± 0.179
1.355ProTyr: 1.355 ± 0.268
0.0ProXaa: 0.0 ± 0.0
Gln
3.889GlnAla: 3.889 ± 0.765
0.412GlnCys: 0.412 ± 0.179
2.357GlnAsp: 2.357 ± 0.362
3.417GlnGlu: 3.417 ± 0.471
1.296GlnPhe: 1.296 ± 0.209
2.357GlnGly: 2.357 ± 0.503
0.589GlnHis: 0.589 ± 0.229
3.005GlnIle: 3.005 ± 0.359
1.65GlnLys: 1.65 ± 0.375
3.948GlnLeu: 3.948 ± 0.425
1.532GlnMet: 1.532 ± 0.364
2.239GlnAsn: 2.239 ± 0.443
1.944GlnPro: 1.944 ± 0.719
4.066GlnGln: 4.066 ± 1.724
2.416GlnArg: 2.416 ± 0.483
2.828GlnSer: 2.828 ± 0.428
3.358GlnThr: 3.358 ± 0.475
2.651GlnVal: 2.651 ± 0.365
0.707GlnTrp: 0.707 ± 0.205
1.885GlnTyr: 1.885 ± 0.283
0.0GlnXaa: 0.0 ± 0.0
Arg
3.535ArgAla: 3.535 ± 0.495
0.766ArgCys: 0.766 ± 0.221
2.71ArgAsp: 2.71 ± 0.385
3.241ArgGlu: 3.241 ± 0.602
2.062ArgPhe: 2.062 ± 0.373
3.417ArgGly: 3.417 ± 0.399
1.119ArgHis: 1.119 ± 0.345
3.182ArgIle: 3.182 ± 0.525
3.005ArgLys: 3.005 ± 0.39
4.89ArgLeu: 4.89 ± 0.564
1.944ArgMet: 1.944 ± 0.412
2.946ArgAsn: 2.946 ± 0.482
2.18ArgPro: 2.18 ± 0.416
2.416ArgGln: 2.416 ± 0.458
2.239ArgArg: 2.239 ± 0.405
2.887ArgSer: 2.887 ± 0.353
2.946ArgThr: 2.946 ± 0.413
3.83ArgVal: 3.83 ± 0.421
0.766ArgTrp: 0.766 ± 0.26
2.239ArgTyr: 2.239 ± 0.356
0.0ArgXaa: 0.0 ± 0.0
Ser
6.246SerAla: 6.246 ± 1.332
1.296SerCys: 1.296 ± 0.319
2.534SerAsp: 2.534 ± 0.42
3.889SerGlu: 3.889 ± 0.651
2.71SerPhe: 2.71 ± 0.408
5.539SerGly: 5.539 ± 0.746
1.532SerHis: 1.532 ± 0.342
2.828SerIle: 2.828 ± 0.348
3.889SerLys: 3.889 ± 0.398
4.242SerLeu: 4.242 ± 0.471
1.65SerMet: 1.65 ± 0.32
1.944SerAsn: 1.944 ± 0.314
1.65SerPro: 1.65 ± 0.238
2.475SerGln: 2.475 ± 0.428
3.064SerArg: 3.064 ± 0.5
4.36SerSer: 4.36 ± 0.535
3.948SerThr: 3.948 ± 0.45
3.889SerVal: 3.889 ± 0.369
0.884SerTrp: 0.884 ± 0.23
2.239SerTyr: 2.239 ± 0.386
0.0SerXaa: 0.0 ± 0.0
Thr
6.069ThrAla: 6.069 ± 1.132
0.589ThrCys: 0.589 ± 0.221
3.889ThrAsp: 3.889 ± 0.534
3.476ThrGlu: 3.476 ± 0.446
2.062ThrPhe: 2.062 ± 0.335
4.89ThrGly: 4.89 ± 0.631
1.355ThrHis: 1.355 ± 0.358
3.653ThrIle: 3.653 ± 0.454
2.828ThrLys: 2.828 ± 0.496
4.773ThrLeu: 4.773 ± 0.593
1.532ThrMet: 1.532 ± 0.33
3.241ThrAsn: 3.241 ± 0.424
3.241ThrPro: 3.241 ± 0.439
1.827ThrGln: 1.827 ± 0.33
2.769ThrArg: 2.769 ± 0.484
4.183ThrSer: 4.183 ± 0.508
3.594ThrThr: 3.594 ± 0.535
4.89ThrVal: 4.89 ± 0.69
1.002ThrTrp: 1.002 ± 0.204
1.65ThrTyr: 1.65 ± 0.294
0.0ThrXaa: 0.0 ± 0.0
Val
6.01ValAla: 6.01 ± 0.657
1.002ValCys: 1.002 ± 0.231
3.594ValAsp: 3.594 ± 0.503
4.419ValGlu: 4.419 ± 0.52
2.18ValPhe: 2.18 ± 0.311
4.596ValGly: 4.596 ± 0.592
0.825ValHis: 0.825 ± 0.215
3.771ValIle: 3.771 ± 0.429
5.362ValLys: 5.362 ± 0.624
4.831ValLeu: 4.831 ± 0.53
1.532ValMet: 1.532 ± 0.332
3.417ValAsn: 3.417 ± 0.637
2.887ValPro: 2.887 ± 0.376
3.3ValGln: 3.3 ± 0.587
3.653ValArg: 3.653 ± 0.363
3.889ValSer: 3.889 ± 0.445
4.478ValThr: 4.478 ± 0.558
5.774ValVal: 5.774 ± 0.521
1.296ValTrp: 1.296 ± 0.284
2.239ValTyr: 2.239 ± 0.472
0.0ValXaa: 0.0 ± 0.0
Trp
1.532TrpAla: 1.532 ± 0.288
0.295TrpCys: 0.295 ± 0.136
1.296TrpAsp: 1.296 ± 0.232
0.766TrpGlu: 0.766 ± 0.246
0.648TrpPhe: 0.648 ± 0.181
0.648TrpGly: 0.648 ± 0.149
0.118TrpHis: 0.118 ± 0.077
0.766TrpIle: 0.766 ± 0.224
0.943TrpLys: 0.943 ± 0.239
1.355TrpLeu: 1.355 ± 0.307
0.177TrpMet: 0.177 ± 0.093
0.825TrpAsn: 0.825 ± 0.217
0.412TrpPro: 0.412 ± 0.129
0.412TrpGln: 0.412 ± 0.19
1.061TrpArg: 1.061 ± 0.255
0.707TrpSer: 0.707 ± 0.194
0.648TrpThr: 0.648 ± 0.17
0.884TrpVal: 0.884 ± 0.25
0.236TrpTrp: 0.236 ± 0.108
0.53TrpTyr: 0.53 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.593TyrAla: 2.593 ± 0.361
0.53TyrCys: 0.53 ± 0.149
2.298TyrAsp: 2.298 ± 0.39
2.298TyrGlu: 2.298 ± 0.392
1.178TyrPhe: 1.178 ± 0.276
2.651TyrGly: 2.651 ± 0.442
0.589TyrHis: 0.589 ± 0.207
2.298TyrIle: 2.298 ± 0.308
2.416TyrLys: 2.416 ± 0.343
2.475TyrLeu: 2.475 ± 0.47
1.061TyrMet: 1.061 ± 0.256
1.532TyrAsn: 1.532 ± 0.259
1.473TyrPro: 1.473 ± 0.314
1.119TyrGln: 1.119 ± 0.257
2.18TyrArg: 2.18 ± 0.365
1.827TyrSer: 1.827 ± 0.381
2.298TyrThr: 2.298 ± 0.386
3.123TyrVal: 3.123 ± 0.397
0.589TyrTrp: 0.589 ± 0.162
1.178TyrTyr: 1.178 ± 0.321
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (16973 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski