Amino acid dipepetide frequency for Gordonia phage Vivi2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.774AlaAla: 15.774 ± 1.1
0.853AlaCys: 0.853 ± 0.263
8.26AlaAsp: 8.26 ± 0.886
9.059AlaGlu: 9.059 ± 1.034
2.878AlaPhe: 2.878 ± 0.506
11.617AlaGly: 11.617 ± 1.011
2.292AlaHis: 2.292 ± 0.444
6.075AlaIle: 6.075 ± 1.009
3.73AlaLys: 3.73 ± 0.539
9.059AlaLeu: 9.059 ± 0.919
2.665AlaMet: 2.665 ± 0.397
2.718AlaAsn: 2.718 ± 0.44
6.608AlaPro: 6.608 ± 0.622
4.263AlaGln: 4.263 ± 0.421
8.633AlaArg: 8.633 ± 0.931
5.436AlaSer: 5.436 ± 0.612
7.354AlaThr: 7.354 ± 0.739
7.514AlaVal: 7.514 ± 0.715
2.558AlaTrp: 2.558 ± 0.383
2.398AlaTyr: 2.398 ± 0.366
0.0AlaXaa: 0.0 ± 0.0
Cys
1.013CysAla: 1.013 ± 0.274
0.053CysCys: 0.053 ± 0.05
0.906CysAsp: 0.906 ± 0.257
0.426CysGlu: 0.426 ± 0.15
0.213CysPhe: 0.213 ± 0.084
1.066CysGly: 1.066 ± 0.381
0.426CysHis: 0.426 ± 0.157
0.0CysIle: 0.0 ± 0.0
0.16CysLys: 0.16 ± 0.096
0.266CysLeu: 0.266 ± 0.13
0.16CysMet: 0.16 ± 0.1
0.107CysAsn: 0.107 ± 0.07
0.799CysPro: 0.799 ± 0.229
0.16CysGln: 0.16 ± 0.103
0.533CysArg: 0.533 ± 0.204
0.426CysSer: 0.426 ± 0.166
0.533CysThr: 0.533 ± 0.18
0.853CysVal: 0.853 ± 0.305
0.213CysTrp: 0.213 ± 0.118
0.053CysTyr: 0.053 ± 0.058
0.0CysXaa: 0.0 ± 0.0
Asp
8.473AspAla: 8.473 ± 0.732
0.426AspCys: 0.426 ± 0.18
8.047AspAsp: 8.047 ± 1.18
6.075AspGlu: 6.075 ± 0.836
1.172AspPhe: 1.172 ± 0.303
7.248AspGly: 7.248 ± 0.796
1.812AspHis: 1.812 ± 0.364
2.238AspIle: 2.238 ± 0.417
1.226AspLys: 1.226 ± 0.228
4.53AspLeu: 4.53 ± 0.521
1.812AspMet: 1.812 ± 0.348
2.238AspAsn: 2.238 ± 0.389
7.088AspPro: 7.088 ± 0.849
2.505AspGln: 2.505 ± 0.468
5.489AspArg: 5.489 ± 0.624
2.078AspSer: 2.078 ± 0.281
3.624AspThr: 3.624 ± 0.45
6.661AspVal: 6.661 ± 0.676
1.865AspTrp: 1.865 ± 0.398
1.652AspTyr: 1.652 ± 0.313
0.0AspXaa: 0.0 ± 0.0
Glu
5.596GluAla: 5.596 ± 0.517
0.533GluCys: 0.533 ± 0.152
3.304GluAsp: 3.304 ± 0.442
1.492GluGlu: 1.492 ± 0.323
1.972GluPhe: 1.972 ± 0.348
4.05GluGly: 4.05 ± 0.473
2.238GluHis: 2.238 ± 0.408
2.345GluIle: 2.345 ± 0.288
1.226GluLys: 1.226 ± 0.359
5.222GluLeu: 5.222 ± 0.754
0.959GluMet: 0.959 ± 0.232
1.865GluAsn: 1.865 ± 0.265
3.144GluPro: 3.144 ± 0.546
3.73GluGln: 3.73 ± 0.496
4.317GluArg: 4.317 ± 0.636
2.824GluSer: 2.824 ± 0.574
2.771GluThr: 2.771 ± 0.457
5.222GluVal: 5.222 ± 0.572
1.545GluTrp: 1.545 ± 0.296
1.759GluTyr: 1.759 ± 0.297
0.0GluXaa: 0.0 ± 0.0
Phe
3.038PheAla: 3.038 ± 0.466
0.32PheCys: 0.32 ± 0.161
2.665PheAsp: 2.665 ± 0.373
1.705PheGlu: 1.705 ± 0.366
0.639PhePhe: 0.639 ± 0.225
2.665PheGly: 2.665 ± 0.301
0.48PheHis: 0.48 ± 0.141
1.279PheIle: 1.279 ± 0.288
0.586PheLys: 0.586 ± 0.177
1.545PheLeu: 1.545 ± 0.347
0.48PheMet: 0.48 ± 0.148
0.799PheAsn: 0.799 ± 0.227
1.119PhePro: 1.119 ± 0.289
0.639PheGln: 0.639 ± 0.218
1.119PheArg: 1.119 ± 0.265
1.066PheSer: 1.066 ± 0.272
1.918PheThr: 1.918 ± 0.287
2.345PheVal: 2.345 ± 0.261
0.213PheTrp: 0.213 ± 0.113
0.266PheTyr: 0.266 ± 0.139
0.0PheXaa: 0.0 ± 0.0
Gly
8.9GlyAla: 8.9 ± 0.979
0.799GlyCys: 0.799 ± 0.263
6.768GlyAsp: 6.768 ± 0.797
4.903GlyGlu: 4.903 ± 0.577
1.918GlyPhe: 1.918 ± 0.347
7.514GlyGly: 7.514 ± 0.972
1.865GlyHis: 1.865 ± 0.284
3.997GlyIle: 3.997 ± 0.471
3.197GlyLys: 3.197 ± 0.507
5.915GlyLeu: 5.915 ± 0.8
1.812GlyMet: 1.812 ± 0.238
2.665GlyAsn: 2.665 ± 0.475
3.944GlyPro: 3.944 ± 0.517
3.357GlyGln: 3.357 ± 0.441
6.715GlyArg: 6.715 ± 0.675
5.063GlySer: 5.063 ± 0.623
5.702GlyThr: 5.702 ± 0.814
7.567GlyVal: 7.567 ± 0.639
1.492GlyTrp: 1.492 ± 0.288
2.718GlyTyr: 2.718 ± 0.428
0.0GlyXaa: 0.0 ± 0.0
His
2.025HisAla: 2.025 ± 0.365
0.053HisCys: 0.053 ± 0.052
2.025HisAsp: 2.025 ± 0.357
1.492HisGlu: 1.492 ± 0.295
0.426HisPhe: 0.426 ± 0.148
1.599HisGly: 1.599 ± 0.285
0.213HisHis: 0.213 ± 0.116
0.853HisIle: 0.853 ± 0.237
0.266HisLys: 0.266 ± 0.133
1.918HisLeu: 1.918 ± 0.329
0.639HisMet: 0.639 ± 0.192
0.586HisAsn: 0.586 ± 0.19
1.279HisPro: 1.279 ± 0.275
0.853HisGln: 0.853 ± 0.211
2.185HisArg: 2.185 ± 0.376
0.853HisSer: 0.853 ± 0.225
1.599HisThr: 1.599 ± 0.278
2.025HisVal: 2.025 ± 0.369
0.32HisTrp: 0.32 ± 0.131
0.426HisTyr: 0.426 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
6.235IleAla: 6.235 ± 0.591
0.213IleCys: 0.213 ± 0.115
4.103IleAsp: 4.103 ± 0.531
3.517IleGlu: 3.517 ± 0.405
0.586IlePhe: 0.586 ± 0.215
5.009IleGly: 5.009 ± 1.188
0.746IleHis: 0.746 ± 0.217
2.078IleIle: 2.078 ± 0.446
0.799IleLys: 0.799 ± 0.42
2.345IleLeu: 2.345 ± 0.355
0.586IleMet: 0.586 ± 0.188
0.906IleAsn: 0.906 ± 0.252
2.611IlePro: 2.611 ± 0.343
1.599IleGln: 1.599 ± 0.364
3.091IleArg: 3.091 ± 0.396
1.759IleSer: 1.759 ± 0.318
3.304IleThr: 3.304 ± 0.448
3.517IleVal: 3.517 ± 0.369
0.746IleTrp: 0.746 ± 0.215
1.226IleTyr: 1.226 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
2.505LysAla: 2.505 ± 0.341
0.266LysCys: 0.266 ± 0.138
0.959LysAsp: 0.959 ± 0.231
0.746LysGlu: 0.746 ± 0.189
1.013LysPhe: 1.013 ± 0.234
1.439LysGly: 1.439 ± 0.32
0.586LysHis: 0.586 ± 0.162
1.172LysIle: 1.172 ± 0.262
0.586LysLys: 0.586 ± 0.172
2.078LysLeu: 2.078 ± 0.379
0.266LysMet: 0.266 ± 0.137
0.906LysAsn: 0.906 ± 0.261
1.492LysPro: 1.492 ± 0.334
0.799LysGln: 0.799 ± 0.224
2.025LysArg: 2.025 ± 0.329
1.759LysSer: 1.759 ± 0.406
1.865LysThr: 1.865 ± 0.333
2.345LysVal: 2.345 ± 0.354
0.266LysTrp: 0.266 ± 0.11
0.799LysTyr: 0.799 ± 0.226
0.0LysXaa: 0.0 ± 0.0
Leu
9.113LeuAla: 9.113 ± 0.806
0.639LeuCys: 0.639 ± 0.188
5.809LeuAsp: 5.809 ± 0.634
2.398LeuGlu: 2.398 ± 0.427
1.865LeuPhe: 1.865 ± 0.417
6.981LeuGly: 6.981 ± 1.097
1.652LeuHis: 1.652 ± 0.365
3.091LeuIle: 3.091 ± 0.39
1.066LeuLys: 1.066 ± 0.25
5.489LeuLeu: 5.489 ± 0.646
1.386LeuMet: 1.386 ± 0.232
2.451LeuAsn: 2.451 ± 0.317
4.476LeuPro: 4.476 ± 0.443
3.091LeuGln: 3.091 ± 0.372
6.128LeuArg: 6.128 ± 0.465
3.624LeuSer: 3.624 ± 0.426
5.862LeuThr: 5.862 ± 0.496
5.542LeuVal: 5.542 ± 0.555
1.865LeuTrp: 1.865 ± 0.322
1.279LeuTyr: 1.279 ± 0.266
0.0LeuXaa: 0.0 ± 0.0
Met
1.865MetAla: 1.865 ± 0.297
0.16MetCys: 0.16 ± 0.087
0.959MetAsp: 0.959 ± 0.221
0.639MetGlu: 0.639 ± 0.217
0.586MetPhe: 0.586 ± 0.2
1.119MetGly: 1.119 ± 0.299
0.32MetHis: 0.32 ± 0.109
0.853MetIle: 0.853 ± 0.21
0.746MetLys: 0.746 ± 0.208
2.025MetLeu: 2.025 ± 0.364
0.266MetMet: 0.266 ± 0.132
0.693MetAsn: 0.693 ± 0.204
1.812MetPro: 1.812 ± 0.315
0.853MetGln: 0.853 ± 0.354
1.812MetArg: 1.812 ± 0.27
1.386MetSer: 1.386 ± 0.268
3.624MetThr: 3.624 ± 0.432
1.439MetVal: 1.439 ± 0.29
0.48MetTrp: 0.48 ± 0.157
0.213MetTyr: 0.213 ± 0.115
0.0MetXaa: 0.0 ± 0.0
Asn
3.73AsnAla: 3.73 ± 0.607
0.266AsnCys: 0.266 ± 0.13
2.292AsnAsp: 2.292 ± 0.389
1.439AsnGlu: 1.439 ± 0.305
0.693AsnPhe: 0.693 ± 0.21
2.878AsnGly: 2.878 ± 0.427
0.533AsnHis: 0.533 ± 0.146
0.746AsnIle: 0.746 ± 0.162
0.426AsnLys: 0.426 ± 0.208
2.238AsnLeu: 2.238 ± 0.495
0.799AsnMet: 0.799 ± 0.189
0.853AsnAsn: 0.853 ± 0.19
2.292AsnPro: 2.292 ± 0.452
1.013AsnGln: 1.013 ± 0.309
2.238AsnArg: 2.238 ± 0.406
0.906AsnSer: 0.906 ± 0.289
1.652AsnThr: 1.652 ± 0.288
1.972AsnVal: 1.972 ± 0.327
0.746AsnTrp: 0.746 ± 0.226
0.533AsnTyr: 0.533 ± 0.178
0.0AsnXaa: 0.0 ± 0.0
Pro
8.153ProAla: 8.153 ± 0.716
0.213ProCys: 0.213 ± 0.107
5.436ProAsp: 5.436 ± 0.734
3.251ProGlu: 3.251 ± 0.481
1.705ProPhe: 1.705 ± 0.308
4.636ProGly: 4.636 ± 0.511
1.279ProHis: 1.279 ± 0.308
3.73ProIle: 3.73 ± 0.413
1.439ProLys: 1.439 ± 0.292
3.997ProLeu: 3.997 ± 0.379
1.652ProMet: 1.652 ± 0.289
2.078ProAsn: 2.078 ± 0.301
4.103ProPro: 4.103 ± 0.62
2.132ProGln: 2.132 ± 0.365
2.984ProArg: 2.984 ± 0.42
3.091ProSer: 3.091 ± 0.446
4.103ProThr: 4.103 ± 0.565
4.956ProVal: 4.956 ± 0.615
1.119ProTrp: 1.119 ± 0.277
1.332ProTyr: 1.332 ± 0.249
0.0ProXaa: 0.0 ± 0.0
Gln
5.063GlnAla: 5.063 ± 0.601
0.266GlnCys: 0.266 ± 0.111
1.172GlnAsp: 1.172 ± 0.302
0.959GlnGlu: 0.959 ± 0.247
1.439GlnPhe: 1.439 ± 0.243
2.025GlnGly: 2.025 ± 0.307
0.853GlnHis: 0.853 ± 0.239
1.865GlnIle: 1.865 ± 0.443
0.746GlnLys: 0.746 ± 0.176
3.197GlnLeu: 3.197 ± 0.37
1.013GlnMet: 1.013 ± 0.258
0.959GlnAsn: 0.959 ± 0.267
2.611GlnPro: 2.611 ± 0.369
2.398GlnGln: 2.398 ± 0.368
4.157GlnArg: 4.157 ± 0.501
1.652GlnSer: 1.652 ± 0.267
2.238GlnThr: 2.238 ± 0.346
2.931GlnVal: 2.931 ± 0.397
1.013GlnTrp: 1.013 ± 0.226
0.746GlnTyr: 0.746 ± 0.159
0.0GlnXaa: 0.0 ± 0.0
Arg
9.912ArgAla: 9.912 ± 0.933
0.639ArgCys: 0.639 ± 0.185
5.063ArgAsp: 5.063 ± 0.535
3.517ArgGlu: 3.517 ± 0.512
2.025ArgPhe: 2.025 ± 0.375
4.743ArgGly: 4.743 ± 0.599
1.652ArgHis: 1.652 ± 0.326
4.05ArgIle: 4.05 ± 0.538
1.972ArgLys: 1.972 ± 0.402
5.596ArgLeu: 5.596 ± 0.589
2.238ArgMet: 2.238 ± 0.302
2.505ArgAsn: 2.505 ± 0.332
4.157ArgPro: 4.157 ± 0.554
2.451ArgGln: 2.451 ± 0.394
8.527ArgArg: 8.527 ± 1.197
3.357ArgSer: 3.357 ± 0.417
4.263ArgThr: 4.263 ± 0.419
6.128ArgVal: 6.128 ± 0.833
1.759ArgTrp: 1.759 ± 0.314
1.918ArgTyr: 1.918 ± 0.325
0.0ArgXaa: 0.0 ± 0.0
Ser
4.956SerAla: 4.956 ± 0.6
0.426SerCys: 0.426 ± 0.184
3.304SerAsp: 3.304 ± 0.403
2.505SerGlu: 2.505 ± 0.444
1.013SerPhe: 1.013 ± 0.258
5.542SerGly: 5.542 ± 0.783
0.693SerHis: 0.693 ± 0.188
2.292SerIle: 2.292 ± 0.52
1.172SerLys: 1.172 ± 0.25
3.411SerLeu: 3.411 ± 0.527
1.066SerMet: 1.066 ± 0.251
1.119SerAsn: 1.119 ± 0.245
2.451SerPro: 2.451 ± 0.4
1.279SerGln: 1.279 ± 0.295
3.464SerArg: 3.464 ± 0.46
2.718SerSer: 2.718 ± 0.587
3.997SerThr: 3.997 ± 0.576
3.837SerVal: 3.837 ± 0.408
0.853SerTrp: 0.853 ± 0.193
0.906SerTyr: 0.906 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
9.166ThrAla: 9.166 ± 0.675
1.013ThrCys: 1.013 ± 0.325
5.063ThrAsp: 5.063 ± 0.529
3.517ThrGlu: 3.517 ± 0.402
1.332ThrPhe: 1.332 ± 0.277
6.555ThrGly: 6.555 ± 0.786
1.332ThrHis: 1.332 ± 0.334
2.771ThrIle: 2.771 ± 0.426
1.918ThrLys: 1.918 ± 0.286
5.063ThrLeu: 5.063 ± 0.485
1.652ThrMet: 1.652 ± 0.269
1.599ThrAsn: 1.599 ± 0.267
4.956ThrPro: 4.956 ± 0.433
1.705ThrGln: 1.705 ± 0.287
3.89ThrArg: 3.89 ± 0.374
3.357ThrSer: 3.357 ± 0.37
5.009ThrThr: 5.009 ± 0.652
4.636ThrVal: 4.636 ± 0.684
1.279ThrTrp: 1.279 ± 0.28
1.599ThrTyr: 1.599 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
9.219ValAla: 9.219 ± 0.838
0.853ValCys: 0.853 ± 0.239
7.301ValAsp: 7.301 ± 0.652
5.222ValGlu: 5.222 ± 0.608
2.025ValPhe: 2.025 ± 0.27
6.661ValGly: 6.661 ± 0.493
1.705ValHis: 1.705 ± 0.267
3.784ValIle: 3.784 ± 0.443
1.599ValLys: 1.599 ± 0.263
5.755ValLeu: 5.755 ± 0.531
1.119ValMet: 1.119 ± 0.203
2.505ValAsn: 2.505 ± 0.373
3.997ValPro: 3.997 ± 0.488
2.824ValGln: 2.824 ± 0.379
5.542ValArg: 5.542 ± 0.637
3.517ValSer: 3.517 ± 0.442
5.596ValThr: 5.596 ± 0.582
5.969ValVal: 5.969 ± 0.621
2.132ValTrp: 2.132 ± 0.298
2.292ValTyr: 2.292 ± 0.423
0.0ValXaa: 0.0 ± 0.0
Trp
2.238TrpAla: 2.238 ± 0.348
0.266TrpCys: 0.266 ± 0.134
1.013TrpAsp: 1.013 ± 0.219
1.172TrpGlu: 1.172 ± 0.254
0.746TrpPhe: 0.746 ± 0.205
1.226TrpGly: 1.226 ± 0.237
0.426TrpHis: 0.426 ± 0.144
0.959TrpIle: 0.959 ± 0.314
0.639TrpLys: 0.639 ± 0.221
2.505TrpLeu: 2.505 ± 0.322
0.746TrpMet: 0.746 ± 0.196
0.426TrpAsn: 0.426 ± 0.173
1.332TrpPro: 1.332 ± 0.261
1.119TrpGln: 1.119 ± 0.162
1.652TrpArg: 1.652 ± 0.314
1.119TrpSer: 1.119 ± 0.241
1.013TrpThr: 1.013 ± 0.251
1.865TrpVal: 1.865 ± 0.29
0.373TrpTrp: 0.373 ± 0.175
0.426TrpTyr: 0.426 ± 0.152
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.771TyrAla: 2.771 ± 0.505
0.107TyrCys: 0.107 ± 0.081
1.812TyrAsp: 1.812 ± 0.316
2.078TyrGlu: 2.078 ± 0.366
0.639TyrPhe: 0.639 ± 0.172
2.238TyrGly: 2.238 ± 0.307
0.533TyrHis: 0.533 ± 0.168
0.799TyrIle: 0.799 ± 0.203
0.373TyrLys: 0.373 ± 0.15
1.705TyrLeu: 1.705 ± 0.303
0.48TyrMet: 0.48 ± 0.151
0.266TyrAsn: 0.266 ± 0.112
1.119TyrPro: 1.119 ± 0.308
0.533TyrGln: 0.533 ± 0.152
1.972TyrArg: 1.972 ± 0.371
1.013TyrSer: 1.013 ± 0.196
1.386TyrThr: 1.386 ± 0.297
2.292TyrVal: 2.292 ± 0.366
0.426TyrTrp: 0.426 ± 0.149
0.426TyrTyr: 0.426 ± 0.146
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (18766 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski