Amino acid dipepetide frequency for Klebsiella phage SopranoGao

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.792AlaAla: 14.792 ± 1.327
0.833AlaCys: 0.833 ± 0.294
7.188AlaAsp: 7.188 ± 0.571
8.125AlaGlu: 8.125 ± 0.847
3.177AlaPhe: 3.177 ± 0.432
8.594AlaGly: 8.594 ± 0.659
1.458AlaHis: 1.458 ± 0.271
5.208AlaIle: 5.208 ± 0.452
6.875AlaLys: 6.875 ± 0.653
8.073AlaLeu: 8.073 ± 0.614
3.542AlaMet: 3.542 ± 0.4
4.01AlaAsn: 4.01 ± 0.433
3.229AlaPro: 3.229 ± 0.436
5.312AlaGln: 5.312 ± 0.54
6.979AlaArg: 6.979 ± 0.85
5.417AlaSer: 5.417 ± 0.505
5.208AlaThr: 5.208 ± 0.46
5.99AlaVal: 5.99 ± 0.516
1.562AlaTrp: 1.562 ± 0.256
2.76AlaTyr: 2.76 ± 0.309
0.0AlaXaa: 0.0 ± 0.0
Cys
0.885CysAla: 0.885 ± 0.265
0.104CysCys: 0.104 ± 0.084
0.417CysAsp: 0.417 ± 0.185
0.417CysGlu: 0.417 ± 0.184
0.104CysPhe: 0.104 ± 0.083
0.729CysGly: 0.729 ± 0.32
0.312CysHis: 0.312 ± 0.151
0.417CysIle: 0.417 ± 0.151
0.521CysLys: 0.521 ± 0.203
0.677CysLeu: 0.677 ± 0.257
0.26CysMet: 0.26 ± 0.135
0.208CysAsn: 0.208 ± 0.123
0.417CysPro: 0.417 ± 0.253
0.417CysGln: 0.417 ± 0.183
0.521CysArg: 0.521 ± 0.207
0.365CysSer: 0.365 ± 0.161
0.573CysThr: 0.573 ± 0.219
0.26CysVal: 0.26 ± 0.13
0.26CysTrp: 0.26 ± 0.17
0.365CysTyr: 0.365 ± 0.161
0.0CysXaa: 0.0 ± 0.0
Asp
6.927AspAla: 6.927 ± 0.696
0.469AspCys: 0.469 ± 0.185
3.646AspAsp: 3.646 ± 0.43
4.844AspGlu: 4.844 ± 0.537
1.615AspPhe: 1.615 ± 0.252
6.615AspGly: 6.615 ± 0.593
1.302AspHis: 1.302 ± 0.329
3.75AspIle: 3.75 ± 0.405
2.917AspLys: 2.917 ± 0.31
4.948AspLeu: 4.948 ± 0.536
1.771AspMet: 1.771 ± 0.282
2.292AspAsn: 2.292 ± 0.344
2.448AspPro: 2.448 ± 0.419
2.135AspGln: 2.135 ± 0.308
3.333AspArg: 3.333 ± 0.401
3.125AspSer: 3.125 ± 0.352
2.604AspThr: 2.604 ± 0.286
3.698AspVal: 3.698 ± 0.363
1.51AspTrp: 1.51 ± 0.333
1.667AspTyr: 1.667 ± 0.247
0.0AspXaa: 0.0 ± 0.0
Glu
6.719GluAla: 6.719 ± 0.663
0.573GluCys: 0.573 ± 0.215
3.49GluAsp: 3.49 ± 0.596
6.198GluGlu: 6.198 ± 1.048
1.771GluPhe: 1.771 ± 0.335
4.792GluGly: 4.792 ± 0.518
0.99GluHis: 0.99 ± 0.198
3.802GluIle: 3.802 ± 0.369
4.844GluLys: 4.844 ± 0.521
6.198GluLeu: 6.198 ± 0.611
1.51GluMet: 1.51 ± 0.358
2.865GluAsn: 2.865 ± 0.428
2.396GluPro: 2.396 ± 0.382
4.74GluGln: 4.74 ± 0.77
5.521GluArg: 5.521 ± 0.664
3.438GluSer: 3.438 ± 0.386
2.708GluThr: 2.708 ± 0.414
3.438GluVal: 3.438 ± 0.41
1.615GluTrp: 1.615 ± 0.277
2.031GluTyr: 2.031 ± 0.248
0.0GluXaa: 0.0 ± 0.0
Phe
2.917PheAla: 2.917 ± 0.355
0.417PheCys: 0.417 ± 0.158
2.292PheAsp: 2.292 ± 0.314
2.292PheGlu: 2.292 ± 0.341
0.885PhePhe: 0.885 ± 0.255
2.5PheGly: 2.5 ± 0.461
0.938PheHis: 0.938 ± 0.196
1.771PheIle: 1.771 ± 0.381
1.458PheLys: 1.458 ± 0.316
1.615PheLeu: 1.615 ± 0.257
0.833PheMet: 0.833 ± 0.235
1.615PheAsn: 1.615 ± 0.22
1.458PhePro: 1.458 ± 0.251
1.146PheGln: 1.146 ± 0.23
1.667PheArg: 1.667 ± 0.34
2.24PheSer: 2.24 ± 0.353
1.667PheThr: 1.667 ± 0.226
1.823PheVal: 1.823 ± 0.231
0.469PheTrp: 0.469 ± 0.182
1.094PheTyr: 1.094 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
6.979GlyAla: 6.979 ± 0.773
0.365GlyCys: 0.365 ± 0.166
4.844GlyAsp: 4.844 ± 0.433
4.479GlyGlu: 4.479 ± 0.508
2.76GlyPhe: 2.76 ± 0.419
6.042GlyGly: 6.042 ± 0.649
0.521GlyHis: 0.521 ± 0.192
4.271GlyIle: 4.271 ± 0.387
5.052GlyLys: 5.052 ± 0.568
5.365GlyLeu: 5.365 ± 0.518
2.604GlyMet: 2.604 ± 0.329
2.76GlyAsn: 2.76 ± 0.332
2.188GlyPro: 2.188 ± 0.328
3.698GlyGln: 3.698 ± 0.484
4.323GlyArg: 4.323 ± 0.607
5.0GlySer: 5.0 ± 0.646
5.052GlyThr: 5.052 ± 0.559
4.531GlyVal: 4.531 ± 0.495
2.031GlyTrp: 2.031 ± 0.359
3.438GlyTyr: 3.438 ± 0.494
0.0GlyXaa: 0.0 ± 0.0
His
1.406HisAla: 1.406 ± 0.254
0.312HisCys: 0.312 ± 0.172
1.25HisAsp: 1.25 ± 0.283
0.99HisGlu: 0.99 ± 0.202
0.729HisPhe: 0.729 ± 0.2
1.094HisGly: 1.094 ± 0.194
0.781HisHis: 0.781 ± 0.247
0.938HisIle: 0.938 ± 0.237
0.469HisLys: 0.469 ± 0.256
1.406HisLeu: 1.406 ± 0.227
0.208HisMet: 0.208 ± 0.093
0.625HisAsn: 0.625 ± 0.204
0.833HisPro: 0.833 ± 0.273
0.885HisGln: 0.885 ± 0.156
1.198HisArg: 1.198 ± 0.304
0.833HisSer: 0.833 ± 0.16
1.094HisThr: 1.094 ± 0.298
0.885HisVal: 0.885 ± 0.211
0.521HisTrp: 0.521 ± 0.193
0.573HisTyr: 0.573 ± 0.172
0.0HisXaa: 0.0 ± 0.0
Ile
4.688IleAla: 4.688 ± 0.566
0.469IleCys: 0.469 ± 0.209
3.698IleAsp: 3.698 ± 0.508
4.115IleGlu: 4.115 ± 0.432
1.094IlePhe: 1.094 ± 0.265
4.167IleGly: 4.167 ± 0.531
0.938IleHis: 0.938 ± 0.191
2.396IleIle: 2.396 ± 0.393
1.875IleLys: 1.875 ± 0.385
3.229IleLeu: 3.229 ± 0.374
1.094IleMet: 1.094 ± 0.199
2.135IleAsn: 2.135 ± 0.433
2.76IlePro: 2.76 ± 0.473
1.615IleGln: 1.615 ± 0.29
2.812IleArg: 2.812 ± 0.396
3.75IleSer: 3.75 ± 0.547
3.698IleThr: 3.698 ± 0.359
3.073IleVal: 3.073 ± 0.366
0.573IleTrp: 0.573 ± 0.141
1.51IleTyr: 1.51 ± 0.321
0.0IleXaa: 0.0 ± 0.0
Lys
7.396LysAla: 7.396 ± 0.665
0.417LysCys: 0.417 ± 0.166
2.917LysAsp: 2.917 ± 0.369
3.542LysGlu: 3.542 ± 0.508
1.771LysPhe: 1.771 ± 0.301
3.646LysGly: 3.646 ± 0.489
0.885LysHis: 0.885 ± 0.283
2.24LysIle: 2.24 ± 0.352
3.177LysLys: 3.177 ± 0.519
4.219LysLeu: 4.219 ± 0.466
1.823LysMet: 1.823 ± 0.378
2.344LysAsn: 2.344 ± 0.416
3.438LysPro: 3.438 ± 0.417
2.552LysGln: 2.552 ± 0.303
4.375LysArg: 4.375 ± 0.824
1.615LysSer: 1.615 ± 0.311
3.177LysThr: 3.177 ± 0.334
2.76LysVal: 2.76 ± 0.379
0.677LysTrp: 0.677 ± 0.137
1.771LysTyr: 1.771 ± 0.325
0.0LysXaa: 0.0 ± 0.0
Leu
8.594LeuAla: 8.594 ± 0.669
0.469LeuCys: 0.469 ± 0.174
4.948LeuAsp: 4.948 ± 0.371
4.479LeuGlu: 4.479 ± 0.613
1.979LeuPhe: 1.979 ± 0.295
5.365LeuGly: 5.365 ± 0.516
1.458LeuHis: 1.458 ± 0.288
2.865LeuIle: 2.865 ± 0.413
3.854LeuLys: 3.854 ± 0.39
6.042LeuLeu: 6.042 ± 0.732
1.719LeuMet: 1.719 ± 0.32
2.76LeuAsn: 2.76 ± 0.354
3.958LeuPro: 3.958 ± 0.466
3.021LeuGln: 3.021 ± 0.38
5.833LeuArg: 5.833 ± 0.543
4.844LeuSer: 4.844 ± 0.42
4.844LeuThr: 4.844 ± 0.465
4.323LeuVal: 4.323 ± 0.453
0.99LeuTrp: 0.99 ± 0.336
2.396LeuTyr: 2.396 ± 0.376
0.0LeuXaa: 0.0 ± 0.0
Met
2.76MetAla: 2.76 ± 0.376
0.469MetCys: 0.469 ± 0.194
1.458MetAsp: 1.458 ± 0.329
1.25MetGlu: 1.25 ± 0.27
0.938MetPhe: 0.938 ± 0.212
1.927MetGly: 1.927 ± 0.42
0.312MetHis: 0.312 ± 0.118
1.25MetIle: 1.25 ± 0.215
1.719MetLys: 1.719 ± 0.287
1.719MetLeu: 1.719 ± 0.372
0.729MetMet: 0.729 ± 0.201
1.458MetAsn: 1.458 ± 0.236
1.302MetPro: 1.302 ± 0.3
1.667MetGln: 1.667 ± 0.369
1.771MetArg: 1.771 ± 0.363
1.771MetSer: 1.771 ± 0.277
1.615MetThr: 1.615 ± 0.271
1.354MetVal: 1.354 ± 0.325
0.625MetTrp: 0.625 ± 0.179
0.417MetTyr: 0.417 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
4.375AsnAla: 4.375 ± 0.52
0.26AsnCys: 0.26 ± 0.158
1.771AsnAsp: 1.771 ± 0.344
3.073AsnGlu: 3.073 ± 0.425
1.094AsnPhe: 1.094 ± 0.274
3.958AsnGly: 3.958 ± 0.554
0.677AsnHis: 0.677 ± 0.168
1.198AsnIle: 1.198 ± 0.189
2.135AsnLys: 2.135 ± 0.337
2.604AsnLeu: 2.604 ± 0.281
1.25AsnMet: 1.25 ± 0.234
1.615AsnAsn: 1.615 ± 0.352
2.917AsnPro: 2.917 ± 0.39
1.562AsnGln: 1.562 ± 0.403
2.604AsnArg: 2.604 ± 0.497
1.51AsnSer: 1.51 ± 0.211
2.292AsnThr: 2.292 ± 0.337
2.396AsnVal: 2.396 ± 0.315
0.625AsnTrp: 0.625 ± 0.168
1.146AsnTyr: 1.146 ± 0.215
0.0AsnXaa: 0.0 ± 0.0
Pro
4.792ProAla: 4.792 ± 0.569
0.417ProCys: 0.417 ± 0.153
3.438ProAsp: 3.438 ± 0.631
4.167ProGlu: 4.167 ± 0.64
1.51ProPhe: 1.51 ± 0.246
4.219ProGly: 4.219 ± 0.465
0.938ProHis: 0.938 ± 0.262
1.615ProIle: 1.615 ± 0.291
2.188ProLys: 2.188 ± 0.32
2.5ProLeu: 2.5 ± 0.286
0.625ProMet: 0.625 ± 0.196
1.51ProAsn: 1.51 ± 0.221
2.031ProPro: 2.031 ± 0.336
2.396ProGln: 2.396 ± 0.378
2.708ProArg: 2.708 ± 0.324
3.75ProSer: 3.75 ± 0.371
2.552ProThr: 2.552 ± 0.332
4.062ProVal: 4.062 ± 0.687
0.521ProTrp: 0.521 ± 0.167
1.302ProTyr: 1.302 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
6.146GlnAla: 6.146 ± 1.033
0.521GlnCys: 0.521 ± 0.227
2.031GlnAsp: 2.031 ± 0.283
3.281GlnGlu: 3.281 ± 0.515
1.719GlnPhe: 1.719 ± 0.357
2.812GlnGly: 2.812 ± 0.51
0.729GlnHis: 0.729 ± 0.219
2.292GlnIle: 2.292 ± 0.343
2.969GlnLys: 2.969 ± 0.309
4.583GlnLeu: 4.583 ± 0.468
1.042GlnMet: 1.042 ± 0.211
1.823GlnAsn: 1.823 ± 0.244
3.177GlnPro: 3.177 ± 0.553
6.354GlnGln: 6.354 ± 1.513
3.229GlnArg: 3.229 ± 0.343
2.396GlnSer: 2.396 ± 0.493
2.292GlnThr: 2.292 ± 0.278
2.344GlnVal: 2.344 ± 0.299
0.885GlnTrp: 0.885 ± 0.175
1.615GlnTyr: 1.615 ± 0.287
0.0GlnXaa: 0.0 ± 0.0
Arg
6.667ArgAla: 6.667 ± 0.749
0.521ArgCys: 0.521 ± 0.238
4.844ArgAsp: 4.844 ± 0.725
4.74ArgGlu: 4.74 ± 0.549
2.083ArgPhe: 2.083 ± 0.318
3.854ArgGly: 3.854 ± 0.467
1.302ArgHis: 1.302 ± 0.352
3.958ArgIle: 3.958 ± 0.434
3.906ArgLys: 3.906 ± 0.465
4.844ArgLeu: 4.844 ± 0.596
2.292ArgMet: 2.292 ± 0.342
2.604ArgAsn: 2.604 ± 0.338
2.396ArgPro: 2.396 ± 0.315
3.906ArgGln: 3.906 ± 0.473
4.375ArgArg: 4.375 ± 0.563
2.604ArgSer: 2.604 ± 0.34
3.906ArgThr: 3.906 ± 0.413
3.75ArgVal: 3.75 ± 0.382
1.198ArgTrp: 1.198 ± 0.316
1.875ArgTyr: 1.875 ± 0.29
0.0ArgXaa: 0.0 ± 0.0
Ser
5.469SerAla: 5.469 ± 0.472
0.365SerCys: 0.365 ± 0.172
3.49SerAsp: 3.49 ± 0.294
4.062SerGlu: 4.062 ± 0.451
2.031SerPhe: 2.031 ± 0.267
4.896SerGly: 4.896 ± 0.731
1.042SerHis: 1.042 ± 0.206
3.125SerIle: 3.125 ± 0.417
2.292SerLys: 2.292 ± 0.38
4.74SerLeu: 4.74 ± 0.644
1.615SerMet: 1.615 ± 0.229
1.823SerAsn: 1.823 ± 0.221
3.073SerPro: 3.073 ± 0.356
2.708SerGln: 2.708 ± 0.442
2.812SerArg: 2.812 ± 0.493
3.438SerSer: 3.438 ± 0.649
3.333SerThr: 3.333 ± 0.48
3.177SerVal: 3.177 ± 0.419
0.938SerTrp: 0.938 ± 0.232
1.51SerTyr: 1.51 ± 0.313
0.0SerXaa: 0.0 ± 0.0
Thr
6.51ThrAla: 6.51 ± 0.598
0.26ThrCys: 0.26 ± 0.142
3.333ThrAsp: 3.333 ± 0.355
4.115ThrGlu: 4.115 ± 0.468
2.083ThrPhe: 2.083 ± 0.289
4.115ThrGly: 4.115 ± 0.489
0.781ThrHis: 0.781 ± 0.179
2.656ThrIle: 2.656 ± 0.426
2.812ThrLys: 2.812 ± 0.377
4.427ThrLeu: 4.427 ± 0.547
1.042ThrMet: 1.042 ± 0.181
2.292ThrAsn: 2.292 ± 0.361
3.438ThrPro: 3.438 ± 0.352
2.76ThrGln: 2.76 ± 0.387
2.917ThrArg: 2.917 ± 0.475
3.594ThrSer: 3.594 ± 0.414
3.229ThrThr: 3.229 ± 0.487
3.385ThrVal: 3.385 ± 0.499
1.042ThrTrp: 1.042 ± 0.228
1.146ThrTyr: 1.146 ± 0.282
0.0ThrXaa: 0.0 ± 0.0
Val
5.99ValAla: 5.99 ± 0.567
0.469ValCys: 0.469 ± 0.18
3.125ValAsp: 3.125 ± 0.323
3.281ValGlu: 3.281 ± 0.363
1.667ValPhe: 1.667 ± 0.217
3.49ValGly: 3.49 ± 0.487
0.677ValHis: 0.677 ± 0.168
3.333ValIle: 3.333 ± 0.516
4.115ValLys: 4.115 ± 0.418
3.698ValLeu: 3.698 ± 0.513
1.562ValMet: 1.562 ± 0.287
2.292ValAsn: 2.292 ± 0.337
3.385ValPro: 3.385 ± 0.45
2.708ValGln: 2.708 ± 0.39
4.427ValArg: 4.427 ± 0.558
3.385ValSer: 3.385 ± 0.35
3.698ValThr: 3.698 ± 0.455
3.802ValVal: 3.802 ± 0.49
1.042ValTrp: 1.042 ± 0.225
1.927ValTyr: 1.927 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
1.354TrpAla: 1.354 ± 0.256
0.26TrpCys: 0.26 ± 0.138
1.146TrpAsp: 1.146 ± 0.238
0.833TrpGlu: 0.833 ± 0.221
0.938TrpPhe: 0.938 ± 0.211
1.146TrpGly: 1.146 ± 0.314
0.312TrpHis: 0.312 ± 0.118
1.198TrpIle: 1.198 ± 0.373
0.573TrpLys: 0.573 ± 0.174
1.771TrpLeu: 1.771 ± 0.42
0.469TrpMet: 0.469 ± 0.161
0.625TrpAsn: 0.625 ± 0.158
0.625TrpPro: 0.625 ± 0.159
1.406TrpGln: 1.406 ± 0.309
1.562TrpArg: 1.562 ± 0.301
1.094TrpSer: 1.094 ± 0.209
0.885TrpThr: 0.885 ± 0.263
1.146TrpVal: 1.146 ± 0.245
0.365TrpTrp: 0.365 ± 0.144
0.521TrpTyr: 0.521 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.969TyrAla: 2.969 ± 0.442
0.26TyrCys: 0.26 ± 0.131
2.292TyrAsp: 2.292 ± 0.261
1.354TyrGlu: 1.354 ± 0.333
1.146TyrPhe: 1.146 ± 0.2
2.344TyrGly: 2.344 ± 0.328
0.677TyrHis: 0.677 ± 0.245
1.406TyrIle: 1.406 ± 0.267
1.042TyrLys: 1.042 ± 0.247
2.083TyrLeu: 2.083 ± 0.292
0.469TyrMet: 0.469 ± 0.12
1.406TyrAsn: 1.406 ± 0.208
1.771TyrPro: 1.771 ± 0.316
1.25TyrGln: 1.25 ± 0.289
2.656TyrArg: 2.656 ± 0.371
1.771TyrSer: 1.771 ± 0.343
1.51TyrThr: 1.51 ± 0.356
1.875TyrVal: 1.875 ± 0.305
0.781TyrTrp: 0.781 ± 0.174
0.677TyrTyr: 0.677 ± 0.222
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (19201 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski