Amino acid dipepetide frequency for Pelagibacter phage HTVC019P

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.428AlaAla: 4.428 ± 0.858
0.458AlaCys: 0.458 ± 0.209
5.345AlaAsp: 5.345 ± 0.543
4.352AlaGlu: 4.352 ± 0.764
2.52AlaPhe: 2.52 ± 0.467
5.574AlaGly: 5.574 ± 0.983
0.611AlaHis: 0.611 ± 0.223
5.497AlaIle: 5.497 ± 0.632
6.719AlaLys: 6.719 ± 0.766
6.108AlaLeu: 6.108 ± 0.549
2.596AlaMet: 2.596 ± 0.356
6.261AlaAsn: 6.261 ± 1.21
2.214AlaPro: 2.214 ± 0.455
2.672AlaGln: 2.672 ± 0.399
2.062AlaArg: 2.062 ± 0.381
5.421AlaSer: 5.421 ± 0.656
5.574AlaThr: 5.574 ± 0.831
4.81AlaVal: 4.81 ± 0.886
0.687AlaTrp: 0.687 ± 0.25
2.825AlaTyr: 2.825 ± 0.472
0.0AlaXaa: 0.0 ± 0.0
Cys
0.458CysAla: 0.458 ± 0.203
0.076CysCys: 0.076 ± 0.072
0.611CysAsp: 0.611 ± 0.22
0.611CysGlu: 0.611 ± 0.189
0.382CysPhe: 0.382 ± 0.153
0.458CysGly: 0.458 ± 0.184
0.229CysHis: 0.229 ± 0.108
0.687CysIle: 0.687 ± 0.315
0.382CysLys: 0.382 ± 0.184
1.145CysLeu: 1.145 ± 0.284
0.153CysMet: 0.153 ± 0.1
0.687CysAsn: 0.687 ± 0.232
0.153CysPro: 0.153 ± 0.115
0.305CysGln: 0.305 ± 0.117
0.611CysArg: 0.611 ± 0.247
0.305CysSer: 0.305 ± 0.143
0.382CysThr: 0.382 ± 0.148
0.534CysVal: 0.534 ± 0.183
0.153CysTrp: 0.153 ± 0.104
0.305CysTyr: 0.305 ± 0.153
0.0CysXaa: 0.0 ± 0.0
Asp
4.581AspAla: 4.581 ± 0.533
0.382AspCys: 0.382 ± 0.15
3.741AspAsp: 3.741 ± 0.513
3.665AspGlu: 3.665 ± 0.641
2.52AspPhe: 2.52 ± 0.35
4.352AspGly: 4.352 ± 0.632
0.534AspHis: 0.534 ± 0.161
4.887AspIle: 4.887 ± 0.595
4.505AspLys: 4.505 ± 0.677
5.65AspLeu: 5.65 ± 0.617
1.451AspMet: 1.451 ± 0.347
2.978AspAsn: 2.978 ± 0.412
1.909AspPro: 1.909 ± 0.446
1.298AspGln: 1.298 ± 0.294
2.672AspArg: 2.672 ± 0.535
3.589AspSer: 3.589 ± 0.84
4.505AspThr: 4.505 ± 0.876
3.589AspVal: 3.589 ± 0.636
0.611AspTrp: 0.611 ± 0.193
2.901AspTyr: 2.901 ± 0.422
0.0AspXaa: 0.0 ± 0.0
Glu
4.428GluAla: 4.428 ± 0.664
0.84GluCys: 0.84 ± 0.326
3.589GluAsp: 3.589 ± 0.498
4.199GluGlu: 4.199 ± 0.706
2.749GluPhe: 2.749 ± 0.528
3.818GluGly: 3.818 ± 0.48
1.069GluHis: 1.069 ± 0.306
4.352GluIle: 4.352 ± 0.447
5.497GluLys: 5.497 ± 1.008
5.497GluLeu: 5.497 ± 0.661
2.214GluMet: 2.214 ± 0.444
3.818GluAsn: 3.818 ± 0.486
1.527GluPro: 1.527 ± 0.401
2.596GluGln: 2.596 ± 0.689
2.901GluArg: 2.901 ± 0.587
2.291GluSer: 2.291 ± 0.487
4.734GluThr: 4.734 ± 0.624
3.741GluVal: 3.741 ± 0.488
0.916GluTrp: 0.916 ± 0.271
2.52GluTyr: 2.52 ± 0.55
0.0GluXaa: 0.0 ± 0.0
Phe
2.443PheAla: 2.443 ± 0.437
0.382PheCys: 0.382 ± 0.211
2.825PheAsp: 2.825 ± 0.533
2.367PheGlu: 2.367 ± 0.469
1.222PhePhe: 1.222 ± 0.342
2.291PheGly: 2.291 ± 0.707
0.764PheHis: 0.764 ± 0.297
2.52PheIle: 2.52 ± 0.378
4.123PheLys: 4.123 ± 0.445
2.825PheLeu: 2.825 ± 0.398
1.374PheMet: 1.374 ± 0.316
2.214PheAsn: 2.214 ± 0.451
0.84PhePro: 0.84 ± 0.307
1.985PheGln: 1.985 ± 0.344
1.298PheArg: 1.298 ± 0.228
2.367PheSer: 2.367 ± 0.419
2.978PheThr: 2.978 ± 0.579
2.901PheVal: 2.901 ± 0.43
0.611PheTrp: 0.611 ± 0.263
1.374PheTyr: 1.374 ± 0.23
0.0PheXaa: 0.0 ± 0.0
Gly
3.894GlyAla: 3.894 ± 0.655
0.458GlyCys: 0.458 ± 0.175
4.276GlyAsp: 4.276 ± 0.543
3.436GlyGlu: 3.436 ± 0.473
3.283GlyPhe: 3.283 ± 0.47
4.505GlyGly: 4.505 ± 0.814
1.145GlyHis: 1.145 ± 0.245
4.581GlyIle: 4.581 ± 0.795
5.803GlyLys: 5.803 ± 0.762
4.658GlyLeu: 4.658 ± 0.606
1.603GlyMet: 1.603 ± 0.314
3.97GlyAsn: 3.97 ± 0.617
0.0GlyPro: 0.0 ± 0.0
2.062GlyGln: 2.062 ± 0.469
2.291GlyArg: 2.291 ± 0.51
4.581GlySer: 4.581 ± 0.721
5.574GlyThr: 5.574 ± 0.617
4.047GlyVal: 4.047 ± 0.535
0.993GlyTrp: 0.993 ± 0.378
2.901GlyTyr: 2.901 ± 0.421
0.0GlyXaa: 0.0 ± 0.0
His
1.374HisAla: 1.374 ± 0.291
0.305HisCys: 0.305 ± 0.163
0.687HisAsp: 0.687 ± 0.22
0.687HisGlu: 0.687 ± 0.286
1.222HisPhe: 1.222 ± 0.335
1.145HisGly: 1.145 ± 0.26
0.305HisHis: 0.305 ± 0.16
1.298HisIle: 1.298 ± 0.298
1.069HisLys: 1.069 ± 0.327
1.832HisLeu: 1.832 ± 0.386
0.687HisMet: 0.687 ± 0.201
0.84HisAsn: 0.84 ± 0.219
0.764HisPro: 0.764 ± 0.201
0.458HisGln: 0.458 ± 0.209
0.534HisArg: 0.534 ± 0.194
1.069HisSer: 1.069 ± 0.31
1.298HisThr: 1.298 ± 0.29
0.687HisVal: 0.687 ± 0.213
0.305HisTrp: 0.305 ± 0.141
0.458HisTyr: 0.458 ± 0.162
0.0HisXaa: 0.0 ± 0.0
Ile
5.879IleAla: 5.879 ± 0.589
0.458IleCys: 0.458 ± 0.18
4.658IleAsp: 4.658 ± 0.607
4.505IleGlu: 4.505 ± 0.811
1.756IlePhe: 1.756 ± 0.354
4.963IleGly: 4.963 ± 0.591
1.298IleHis: 1.298 ± 0.354
4.199IleIle: 4.199 ± 0.708
6.49IleLys: 6.49 ± 0.74
4.123IleLeu: 4.123 ± 0.468
1.451IleMet: 1.451 ± 0.391
3.436IleAsn: 3.436 ± 0.529
2.825IlePro: 2.825 ± 0.406
2.367IleGln: 2.367 ± 0.444
2.443IleArg: 2.443 ± 0.41
4.658IleSer: 4.658 ± 0.93
4.658IleThr: 4.658 ± 0.621
3.283IleVal: 3.283 ± 0.535
0.458IleTrp: 0.458 ± 0.206
2.291IleTyr: 2.291 ± 0.446
0.0IleXaa: 0.0 ± 0.0
Lys
5.727LysAla: 5.727 ± 0.765
0.382LysCys: 0.382 ± 0.18
4.581LysAsp: 4.581 ± 0.63
7.177LysGlu: 7.177 ± 0.882
3.207LysPhe: 3.207 ± 0.504
4.047LysGly: 4.047 ± 0.506
1.68LysHis: 1.68 ± 0.314
5.574LysIle: 5.574 ± 0.693
7.635LysLys: 7.635 ± 1.107
7.177LysLeu: 7.177 ± 0.664
1.756LysMet: 1.756 ± 0.343
4.199LysAsn: 4.199 ± 0.741
2.367LysPro: 2.367 ± 0.371
3.207LysGln: 3.207 ± 0.627
4.352LysArg: 4.352 ± 0.761
6.185LysSer: 6.185 ± 0.83
5.192LysThr: 5.192 ± 0.508
4.581LysVal: 4.581 ± 0.782
0.534LysTrp: 0.534 ± 0.215
3.207LysTyr: 3.207 ± 0.559
0.0LysXaa: 0.0 ± 0.0
Leu
6.719LeuAla: 6.719 ± 0.741
0.84LeuCys: 0.84 ± 0.326
5.039LeuAsp: 5.039 ± 0.542
5.65LeuGlu: 5.65 ± 0.842
2.672LeuPhe: 2.672 ± 0.466
3.818LeuGly: 3.818 ± 0.676
1.603LeuHis: 1.603 ± 0.371
4.123LeuIle: 4.123 ± 0.438
7.635LeuLys: 7.635 ± 0.726
6.719LeuLeu: 6.719 ± 0.686
1.603LeuMet: 1.603 ± 0.409
4.581LeuAsn: 4.581 ± 0.461
3.207LeuPro: 3.207 ± 0.5
3.436LeuGln: 3.436 ± 0.486
3.207LeuArg: 3.207 ± 0.515
5.727LeuSer: 5.727 ± 0.605
5.497LeuThr: 5.497 ± 0.621
4.734LeuVal: 4.734 ± 0.53
0.84LeuTrp: 0.84 ± 0.348
2.596LeuTyr: 2.596 ± 0.433
0.0LeuXaa: 0.0 ± 0.0
Met
2.901MetAla: 2.901 ± 0.617
0.229MetCys: 0.229 ± 0.124
1.222MetAsp: 1.222 ± 0.276
1.374MetGlu: 1.374 ± 0.349
0.916MetPhe: 0.916 ± 0.257
1.756MetGly: 1.756 ± 0.374
0.382MetHis: 0.382 ± 0.166
1.069MetIle: 1.069 ± 0.315
2.291MetLys: 2.291 ± 0.563
2.443MetLeu: 2.443 ± 0.65
0.764MetMet: 0.764 ± 0.259
1.069MetAsn: 1.069 ± 0.243
1.527MetPro: 1.527 ± 0.359
0.993MetGln: 0.993 ± 0.239
1.527MetArg: 1.527 ± 0.373
2.749MetSer: 2.749 ± 0.34
1.222MetThr: 1.222 ± 0.285
1.222MetVal: 1.222 ± 0.388
0.382MetTrp: 0.382 ± 0.167
0.687MetTyr: 0.687 ± 0.243
0.0MetXaa: 0.0 ± 0.0
Asn
4.963AsnAla: 4.963 ± 0.853
0.611AsnCys: 0.611 ± 0.201
3.054AsnAsp: 3.054 ± 0.558
3.283AsnGlu: 3.283 ± 0.416
2.978AsnPhe: 2.978 ± 0.413
4.428AsnGly: 4.428 ± 0.787
0.458AsnHis: 0.458 ± 0.182
4.505AsnIle: 4.505 ± 0.85
4.047AsnLys: 4.047 ± 0.532
4.428AsnLeu: 4.428 ± 0.401
1.451AsnMet: 1.451 ± 0.392
3.665AsnAsn: 3.665 ± 0.523
2.214AsnPro: 2.214 ± 0.447
2.062AsnGln: 2.062 ± 0.437
2.52AsnArg: 2.52 ± 0.438
4.963AsnSer: 4.963 ± 1.159
5.345AsnThr: 5.345 ± 0.77
3.13AsnVal: 3.13 ± 0.751
0.611AsnTrp: 0.611 ± 0.232
2.443AsnTyr: 2.443 ± 0.391
0.0AsnXaa: 0.0 ± 0.0
Pro
1.909ProAla: 1.909 ± 0.287
0.229ProCys: 0.229 ± 0.118
1.756ProAsp: 1.756 ± 0.407
2.749ProGlu: 2.749 ± 0.56
1.527ProPhe: 1.527 ± 0.273
0.0ProGly: 0.0 ± 0.0
0.458ProHis: 0.458 ± 0.153
1.909ProIle: 1.909 ± 0.383
2.291ProLys: 2.291 ± 0.505
2.291ProLeu: 2.291 ± 0.389
0.84ProMet: 0.84 ± 0.258
1.832ProAsn: 1.832 ± 0.4
0.916ProPro: 0.916 ± 0.278
1.222ProGln: 1.222 ± 0.337
0.382ProArg: 0.382 ± 0.151
3.13ProSer: 3.13 ± 0.367
2.825ProThr: 2.825 ± 0.416
1.451ProVal: 1.451 ± 0.339
0.0ProTrp: 0.0 ± 0.0
0.916ProTyr: 0.916 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
2.52GlnAla: 2.52 ± 0.571
0.305GlnCys: 0.305 ± 0.179
2.138GlnAsp: 2.138 ± 0.315
2.825GlnGlu: 2.825 ± 0.567
1.756GlnPhe: 1.756 ± 0.326
1.832GlnGly: 1.832 ± 0.339
0.534GlnHis: 0.534 ± 0.221
2.367GlnIle: 2.367 ± 0.447
3.13GlnLys: 3.13 ± 0.615
3.207GlnLeu: 3.207 ± 0.621
1.374GlnMet: 1.374 ± 0.46
1.603GlnAsn: 1.603 ± 0.357
0.916GlnPro: 0.916 ± 0.234
1.756GlnGln: 1.756 ± 0.313
1.68GlnArg: 1.68 ± 0.395
2.443GlnSer: 2.443 ± 0.416
2.978GlnThr: 2.978 ± 0.428
2.291GlnVal: 2.291 ± 0.489
0.458GlnTrp: 0.458 ± 0.177
1.145GlnTyr: 1.145 ± 0.319
0.0GlnXaa: 0.0 ± 0.0
Arg
2.825ArgAla: 2.825 ± 0.454
0.0ArgCys: 0.0 ± 0.0
3.054ArgAsp: 3.054 ± 0.589
3.13ArgGlu: 3.13 ± 0.532
1.527ArgPhe: 1.527 ± 0.339
2.672ArgGly: 2.672 ± 0.396
0.916ArgHis: 0.916 ± 0.301
2.062ArgIle: 2.062 ± 0.352
2.52ArgLys: 2.52 ± 0.534
3.665ArgLeu: 3.665 ± 0.738
1.222ArgMet: 1.222 ± 0.37
1.909ArgAsn: 1.909 ± 0.395
1.069ArgPro: 1.069 ± 0.265
1.527ArgGln: 1.527 ± 0.346
1.603ArgArg: 1.603 ± 0.345
2.138ArgSer: 2.138 ± 0.387
1.603ArgThr: 1.603 ± 0.346
1.756ArgVal: 1.756 ± 0.436
0.687ArgTrp: 0.687 ± 0.236
1.756ArgTyr: 1.756 ± 0.342
0.0ArgXaa: 0.0 ± 0.0
Ser
6.108SerAla: 6.108 ± 1.112
0.687SerCys: 0.687 ± 0.27
3.894SerAsp: 3.894 ± 0.627
3.741SerGlu: 3.741 ± 0.586
2.901SerPhe: 2.901 ± 0.411
6.261SerGly: 6.261 ± 1.026
1.374SerHis: 1.374 ± 0.311
5.268SerIle: 5.268 ± 0.585
5.268SerLys: 5.268 ± 0.535
5.421SerLeu: 5.421 ± 0.727
1.832SerMet: 1.832 ± 0.341
4.658SerAsn: 4.658 ± 0.669
0.916SerPro: 0.916 ± 0.218
2.672SerGln: 2.672 ± 0.381
1.832SerArg: 1.832 ± 0.437
5.116SerSer: 5.116 ± 0.72
5.803SerThr: 5.803 ± 1.039
4.505SerVal: 4.505 ± 0.73
1.069SerTrp: 1.069 ± 0.293
2.443SerTyr: 2.443 ± 0.451
0.0SerXaa: 0.0 ± 0.0
Thr
6.795ThrAla: 6.795 ± 0.96
0.611ThrCys: 0.611 ± 0.226
4.047ThrAsp: 4.047 ± 0.737
4.199ThrGlu: 4.199 ± 0.505
3.13ThrPhe: 3.13 ± 0.649
5.116ThrGly: 5.116 ± 0.551
1.222ThrHis: 1.222 ± 0.248
4.276ThrIle: 4.276 ± 0.63
4.658ThrLys: 4.658 ± 0.51
5.192ThrLeu: 5.192 ± 0.652
1.832ThrMet: 1.832 ± 0.379
5.192ThrAsn: 5.192 ± 1.116
2.749ThrPro: 2.749 ± 0.42
2.443ThrGln: 2.443 ± 0.559
1.985ThrArg: 1.985 ± 0.411
6.49ThrSer: 6.49 ± 0.912
6.872ThrThr: 6.872 ± 1.29
4.81ThrVal: 4.81 ± 1.231
0.382ThrTrp: 0.382 ± 0.181
2.138ThrTyr: 2.138 ± 0.382
0.0ThrXaa: 0.0 ± 0.0
Val
5.574ValAla: 5.574 ± 1.43
0.611ValCys: 0.611 ± 0.237
2.825ValAsp: 2.825 ± 0.47
3.665ValGlu: 3.665 ± 0.55
2.214ValPhe: 2.214 ± 0.404
3.894ValGly: 3.894 ± 0.481
1.603ValHis: 1.603 ± 0.423
3.894ValIle: 3.894 ± 0.515
4.352ValLys: 4.352 ± 0.604
3.283ValLeu: 3.283 ± 0.437
1.68ValMet: 1.68 ± 0.404
5.116ValAsn: 5.116 ± 0.764
1.832ValPro: 1.832 ± 0.426
1.68ValGln: 1.68 ± 0.265
2.291ValArg: 2.291 ± 0.455
5.116ValSer: 5.116 ± 0.686
4.047ValThr: 4.047 ± 0.818
2.672ValVal: 2.672 ± 0.402
0.458ValTrp: 0.458 ± 0.143
1.145ValTyr: 1.145 ± 0.25
0.0ValXaa: 0.0 ± 0.0
Trp
0.687TrpAla: 0.687 ± 0.244
0.076TrpCys: 0.076 ± 0.078
0.611TrpAsp: 0.611 ± 0.168
0.534TrpGlu: 0.534 ± 0.176
0.229TrpPhe: 0.229 ± 0.134
0.382TrpGly: 0.382 ± 0.163
0.076TrpHis: 0.076 ± 0.072
0.687TrpIle: 0.687 ± 0.247
1.145TrpLys: 1.145 ± 0.342
1.68TrpLeu: 1.68 ± 0.388
0.0TrpMet: 0.0 ± 0.0
0.764TrpAsn: 0.764 ± 0.198
0.0TrpPro: 0.0 ± 0.0
0.611TrpGln: 0.611 ± 0.212
0.534TrpArg: 0.534 ± 0.245
0.84TrpSer: 0.84 ± 0.225
0.993TrpThr: 0.993 ± 0.295
0.687TrpVal: 0.687 ± 0.224
0.076TrpTrp: 0.076 ± 0.087
0.076TrpTyr: 0.076 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.52TyrAla: 2.52 ± 0.384
0.687TyrCys: 0.687 ± 0.231
2.367TyrAsp: 2.367 ± 0.358
1.145TyrGlu: 1.145 ± 0.236
0.993TyrPhe: 0.993 ± 0.325
2.749TyrGly: 2.749 ± 0.519
0.764TyrHis: 0.764 ± 0.242
2.52TyrIle: 2.52 ± 0.437
3.207TyrLys: 3.207 ± 0.551
2.901TyrLeu: 2.901 ± 0.575
0.764TyrMet: 0.764 ± 0.227
2.367TyrAsn: 2.367 ± 0.403
0.687TyrPro: 0.687 ± 0.186
1.909TyrGln: 1.909 ± 0.384
0.916TyrArg: 0.916 ± 0.294
2.596TyrSer: 2.596 ± 0.371
2.062TyrThr: 2.062 ± 0.356
2.52TyrVal: 2.52 ± 0.333
0.458TyrTrp: 0.458 ± 0.178
0.916TyrTyr: 0.916 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (13098 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski