Amino acid dipepetide frequency for Pelagibacter phage HTVC011P

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.162AlaAla: 0.162 ± 0.187
0.162AlaCys: 0.162 ± 0.107
4.953AlaAsp: 4.953 ± 0.885
4.71AlaGlu: 4.71 ± 0.917
3.004AlaPhe: 3.004 ± 0.431
4.06AlaGly: 4.06 ± 0.727
0.568AlaHis: 0.568 ± 0.175
5.197AlaIle: 5.197 ± 0.659
4.304AlaLys: 4.304 ± 0.527
4.629AlaLeu: 4.629 ± 0.677
2.111AlaMet: 2.111 ± 0.346
4.791AlaAsn: 4.791 ± 0.77
2.111AlaPro: 2.111 ± 0.384
2.598AlaGln: 2.598 ± 0.476
3.167AlaArg: 3.167 ± 0.515
6.09AlaSer: 6.09 ± 0.797
5.928AlaThr: 5.928 ± 0.988
3.492AlaVal: 3.492 ± 0.702
0.568AlaTrp: 0.568 ± 0.219
2.436AlaTyr: 2.436 ± 0.57
0.0AlaXaa: 0.0 ± 0.0
Cys
0.325CysAla: 0.325 ± 0.149
0.081CysCys: 0.081 ± 0.087
0.162CysAsp: 0.162 ± 0.125
0.487CysGlu: 0.487 ± 0.227
0.406CysPhe: 0.406 ± 0.152
0.487CysGly: 0.487 ± 0.178
0.081CysHis: 0.081 ± 0.081
0.731CysIle: 0.731 ± 0.274
0.65CysLys: 0.65 ± 0.249
0.731CysLeu: 0.731 ± 0.276
0.244CysMet: 0.244 ± 0.19
0.244CysAsn: 0.244 ± 0.169
0.244CysPro: 0.244 ± 0.143
0.244CysGln: 0.244 ± 0.194
0.325CysArg: 0.325 ± 0.151
0.568CysSer: 0.568 ± 0.255
0.487CysThr: 0.487 ± 0.185
1.056CysVal: 1.056 ± 0.335
0.081CysTrp: 0.081 ± 0.087
0.325CysTyr: 0.325 ± 0.155
0.0CysXaa: 0.0 ± 0.0
Asp
4.304AspAla: 4.304 ± 0.647
0.812AspCys: 0.812 ± 0.351
3.41AspAsp: 3.41 ± 0.616
3.816AspGlu: 3.816 ± 0.52
3.898AspPhe: 3.898 ± 0.582
4.791AspGly: 4.791 ± 0.656
0.487AspHis: 0.487 ± 0.264
4.304AspIle: 4.304 ± 0.515
4.791AspLys: 4.791 ± 0.698
4.385AspLeu: 4.385 ± 0.634
1.38AspMet: 1.38 ± 0.327
3.329AspAsn: 3.329 ± 0.4
2.111AspPro: 2.111 ± 0.484
1.056AspGln: 1.056 ± 0.295
1.705AspArg: 1.705 ± 0.364
2.598AspSer: 2.598 ± 0.515
4.466AspThr: 4.466 ± 0.552
3.329AspVal: 3.329 ± 0.598
0.406AspTrp: 0.406 ± 0.176
2.923AspTyr: 2.923 ± 0.577
0.0AspXaa: 0.0 ± 0.0
Glu
4.953GluAla: 4.953 ± 0.668
0.162GluCys: 0.162 ± 0.134
3.573GluAsp: 3.573 ± 0.476
3.329GluGlu: 3.329 ± 0.631
2.111GluPhe: 2.111 ± 0.305
4.141GluGly: 4.141 ± 0.928
0.812GluHis: 0.812 ± 0.332
4.466GluIle: 4.466 ± 0.71
4.791GluLys: 4.791 ± 0.585
6.09GluLeu: 6.09 ± 0.887
1.38GluMet: 1.38 ± 0.306
3.654GluAsn: 3.654 ± 0.613
1.543GluPro: 1.543 ± 0.338
3.248GluGln: 3.248 ± 0.676
1.868GluArg: 1.868 ± 0.404
2.355GluSer: 2.355 ± 0.434
4.222GluThr: 4.222 ± 0.613
4.71GluVal: 4.71 ± 0.653
0.487GluTrp: 0.487 ± 0.216
2.355GluTyr: 2.355 ± 0.629
0.0GluXaa: 0.0 ± 0.0
Phe
2.436PheAla: 2.436 ± 0.378
0.325PheCys: 0.325 ± 0.21
3.41PheAsp: 3.41 ± 0.487
1.543PheGlu: 1.543 ± 0.322
1.868PhePhe: 1.868 ± 0.409
3.086PheGly: 3.086 ± 0.515
0.65PheHis: 0.65 ± 0.202
3.573PheIle: 3.573 ± 0.531
3.654PheLys: 3.654 ± 0.859
2.517PheLeu: 2.517 ± 0.378
0.487PheMet: 0.487 ± 0.167
3.573PheAsn: 3.573 ± 0.395
1.299PhePro: 1.299 ± 0.28
1.462PheGln: 1.462 ± 0.388
2.192PheArg: 2.192 ± 0.417
3.492PheSer: 3.492 ± 0.656
2.274PheThr: 2.274 ± 0.499
1.786PheVal: 1.786 ± 0.3
0.162PheTrp: 0.162 ± 0.103
1.38PheTyr: 1.38 ± 0.303
0.0PheXaa: 0.0 ± 0.0
Gly
3.735GlyAla: 3.735 ± 0.575
0.325GlyCys: 0.325 ± 0.159
4.629GlyAsp: 4.629 ± 0.585
2.923GlyGlu: 2.923 ± 0.524
3.004GlyPhe: 3.004 ± 0.534
3.816GlyGly: 3.816 ± 0.608
1.38GlyHis: 1.38 ± 0.464
3.979GlyIle: 3.979 ± 0.572
5.197GlyLys: 5.197 ± 0.89
5.359GlyLeu: 5.359 ± 0.64
1.218GlyMet: 1.218 ± 0.284
3.816GlyAsn: 3.816 ± 0.539
0.0GlyPro: 0.0 ± 0.0
2.436GlyGln: 2.436 ± 0.287
2.192GlyArg: 2.192 ± 0.46
5.928GlySer: 5.928 ± 1.049
5.847GlyThr: 5.847 ± 0.98
4.953GlyVal: 4.953 ± 0.884
0.974GlyTrp: 0.974 ± 0.278
2.274GlyTyr: 2.274 ± 0.39
0.0GlyXaa: 0.0 ± 0.0
His
0.812HisAla: 0.812 ± 0.276
0.406HisCys: 0.406 ± 0.252
0.65HisAsp: 0.65 ± 0.267
0.974HisGlu: 0.974 ± 0.366
0.568HisPhe: 0.568 ± 0.228
0.325HisGly: 0.325 ± 0.164
0.325HisHis: 0.325 ± 0.21
1.056HisIle: 1.056 ± 0.248
1.705HisLys: 1.705 ± 0.384
2.111HisLeu: 2.111 ± 0.498
0.325HisMet: 0.325 ± 0.168
0.731HisAsn: 0.731 ± 0.199
0.325HisPro: 0.325 ± 0.192
0.487HisGln: 0.487 ± 0.22
0.731HisArg: 0.731 ± 0.258
1.462HisSer: 1.462 ± 0.493
0.893HisThr: 0.893 ± 0.248
1.218HisVal: 1.218 ± 0.244
0.081HisTrp: 0.081 ± 0.079
0.568HisTyr: 0.568 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
5.116IleAla: 5.116 ± 0.662
0.568IleCys: 0.568 ± 0.24
4.141IleAsp: 4.141 ± 0.504
3.654IleGlu: 3.654 ± 0.587
2.111IlePhe: 2.111 ± 0.378
3.979IleGly: 3.979 ± 0.535
1.38IleHis: 1.38 ± 0.497
4.791IleIle: 4.791 ± 0.691
6.659IleLys: 6.659 ± 0.89
5.441IleLeu: 5.441 ± 0.704
1.299IleMet: 1.299 ± 0.397
5.278IleAsn: 5.278 ± 0.584
2.274IlePro: 2.274 ± 0.531
2.598IleGln: 2.598 ± 0.458
3.167IleArg: 3.167 ± 0.467
5.522IleSer: 5.522 ± 1.084
5.359IleThr: 5.359 ± 0.822
3.816IleVal: 3.816 ± 0.522
0.65IleTrp: 0.65 ± 0.265
2.03IleTyr: 2.03 ± 0.454
0.0IleXaa: 0.0 ± 0.0
Lys
4.953LysAla: 4.953 ± 0.911
0.65LysCys: 0.65 ± 0.244
4.953LysAsp: 4.953 ± 0.741
5.278LysGlu: 5.278 ± 0.929
3.41LysPhe: 3.41 ± 0.651
4.222LysGly: 4.222 ± 0.498
1.299LysHis: 1.299 ± 0.385
6.74LysIle: 6.74 ± 0.799
7.633LysLys: 7.633 ± 1.336
7.877LysLeu: 7.877 ± 0.983
2.355LysMet: 2.355 ± 0.415
4.385LysAsn: 4.385 ± 0.691
3.167LysPro: 3.167 ± 0.681
3.492LysGln: 3.492 ± 0.57
3.573LysArg: 3.573 ± 0.517
6.334LysSer: 6.334 ± 0.805
5.278LysThr: 5.278 ± 0.791
4.141LysVal: 4.141 ± 0.559
1.218LysTrp: 1.218 ± 0.311
3.735LysTyr: 3.735 ± 0.648
0.0LysXaa: 0.0 ± 0.0
Leu
5.765LeuAla: 5.765 ± 0.534
0.812LeuCys: 0.812 ± 0.319
5.359LeuAsp: 5.359 ± 0.655
6.09LeuGlu: 6.09 ± 0.953
2.517LeuPhe: 2.517 ± 0.341
5.116LeuGly: 5.116 ± 0.56
1.543LeuHis: 1.543 ± 0.428
4.791LeuIle: 4.791 ± 0.641
7.633LeuLys: 7.633 ± 0.877
5.116LeuLeu: 5.116 ± 0.756
2.111LeuMet: 2.111 ± 0.369
6.171LeuAsn: 6.171 ± 0.583
3.573LeuPro: 3.573 ± 0.642
3.735LeuGln: 3.735 ± 0.72
3.735LeuArg: 3.735 ± 0.574
6.171LeuSer: 6.171 ± 0.727
4.953LeuThr: 4.953 ± 0.61
3.654LeuVal: 3.654 ± 0.552
0.568LeuTrp: 0.568 ± 0.202
1.299LeuTyr: 1.299 ± 0.253
0.0LeuXaa: 0.0 ± 0.0
Met
1.786MetAla: 1.786 ± 0.365
0.406MetCys: 0.406 ± 0.168
0.487MetAsp: 0.487 ± 0.171
1.218MetGlu: 1.218 ± 0.373
0.812MetPhe: 0.812 ± 0.302
1.218MetGly: 1.218 ± 0.349
0.162MetHis: 0.162 ± 0.115
1.299MetIle: 1.299 ± 0.304
2.355MetLys: 2.355 ± 0.657
2.03MetLeu: 2.03 ± 0.434
0.325MetMet: 0.325 ± 0.143
1.299MetAsn: 1.299 ± 0.297
0.893MetPro: 0.893 ± 0.387
0.974MetGln: 0.974 ± 0.341
1.056MetArg: 1.056 ± 0.311
1.868MetSer: 1.868 ± 0.444
1.624MetThr: 1.624 ± 0.304
1.056MetVal: 1.056 ± 0.299
0.162MetTrp: 0.162 ± 0.111
0.325MetTyr: 0.325 ± 0.216
0.0MetXaa: 0.0 ± 0.0
Asn
3.816AsnAla: 3.816 ± 0.618
0.244AsnCys: 0.244 ± 0.141
2.842AsnAsp: 2.842 ± 0.35
3.735AsnGlu: 3.735 ± 0.541
3.492AsnPhe: 3.492 ± 0.547
5.197AsnGly: 5.197 ± 1.118
1.218AsnHis: 1.218 ± 0.36
4.872AsnIle: 4.872 ± 0.828
5.847AsnLys: 5.847 ± 0.801
5.116AsnLeu: 5.116 ± 0.612
1.137AsnMet: 1.137 ± 0.214
4.466AsnAsn: 4.466 ± 1.04
2.842AsnPro: 2.842 ± 0.462
2.355AsnGln: 2.355 ± 0.465
2.111AsnArg: 2.111 ± 0.347
4.466AsnSer: 4.466 ± 0.589
6.253AsnThr: 6.253 ± 0.912
3.41AsnVal: 3.41 ± 0.572
1.137AsnTrp: 1.137 ± 0.346
2.355AsnTyr: 2.355 ± 0.431
0.0AsnXaa: 0.0 ± 0.0
Pro
1.299ProAla: 1.299 ± 0.287
0.244ProCys: 0.244 ± 0.121
1.949ProAsp: 1.949 ± 0.414
2.761ProGlu: 2.761 ± 0.517
1.462ProPhe: 1.462 ± 0.423
0.0ProGly: 0.0 ± 0.0
0.487ProHis: 0.487 ± 0.26
2.111ProIle: 2.111 ± 0.499
2.68ProLys: 2.68 ± 0.628
1.868ProLeu: 1.868 ± 0.473
0.325ProMet: 0.325 ± 0.183
2.111ProAsn: 2.111 ± 0.355
0.731ProPro: 0.731 ± 0.285
1.624ProGln: 1.624 ± 0.476
1.137ProArg: 1.137 ± 0.255
2.517ProSer: 2.517 ± 0.438
2.274ProThr: 2.274 ± 0.521
2.192ProVal: 2.192 ± 0.338
0.081ProTrp: 0.081 ± 0.073
1.543ProTyr: 1.543 ± 0.322
0.0ProXaa: 0.0 ± 0.0
Gln
3.004GlnAla: 3.004 ± 0.441
0.325GlnCys: 0.325 ± 0.164
1.624GlnAsp: 1.624 ± 0.328
2.598GlnGlu: 2.598 ± 0.366
2.274GlnPhe: 2.274 ± 0.481
2.923GlnGly: 2.923 ± 0.54
0.487GlnHis: 0.487 ± 0.222
2.192GlnIle: 2.192 ± 0.445
3.573GlnLys: 3.573 ± 0.655
3.086GlnLeu: 3.086 ± 0.594
1.38GlnMet: 1.38 ± 0.404
1.949GlnAsn: 1.949 ± 0.382
1.056GlnPro: 1.056 ± 0.26
1.868GlnGln: 1.868 ± 0.638
1.786GlnArg: 1.786 ± 0.353
2.274GlnSer: 2.274 ± 0.393
3.004GlnThr: 3.004 ± 0.51
2.355GlnVal: 2.355 ± 0.46
0.65GlnTrp: 0.65 ± 0.248
1.38GlnTyr: 1.38 ± 0.301
0.0GlnXaa: 0.0 ± 0.0
Arg
1.705ArgAla: 1.705 ± 0.302
0.325ArgCys: 0.325 ± 0.16
2.436ArgAsp: 2.436 ± 0.388
2.517ArgGlu: 2.517 ± 0.581
1.38ArgPhe: 1.38 ± 0.33
1.949ArgGly: 1.949 ± 0.37
0.65ArgHis: 0.65 ± 0.246
3.004ArgIle: 3.004 ± 0.534
3.898ArgLys: 3.898 ± 0.616
4.71ArgLeu: 4.71 ± 0.656
0.568ArgMet: 0.568 ± 0.226
1.949ArgAsn: 1.949 ± 0.492
0.65ArgPro: 0.65 ± 0.177
1.624ArgGln: 1.624 ± 0.34
1.462ArgArg: 1.462 ± 0.395
2.03ArgSer: 2.03 ± 0.513
2.517ArgThr: 2.517 ± 0.54
2.274ArgVal: 2.274 ± 0.444
0.487ArgTrp: 0.487 ± 0.183
1.705ArgTyr: 1.705 ± 0.353
0.0ArgXaa: 0.0 ± 0.0
Ser
5.847SerAla: 5.847 ± 0.881
0.162SerCys: 0.162 ± 0.119
4.466SerAsp: 4.466 ± 0.594
3.816SerGlu: 3.816 ± 0.6
2.842SerPhe: 2.842 ± 0.461
6.577SerGly: 6.577 ± 0.841
0.812SerHis: 0.812 ± 0.332
4.953SerIle: 4.953 ± 0.74
5.197SerLys: 5.197 ± 0.729
5.035SerLeu: 5.035 ± 0.622
1.624SerMet: 1.624 ± 0.406
5.928SerAsn: 5.928 ± 0.722
1.949SerPro: 1.949 ± 0.348
2.436SerGln: 2.436 ± 0.564
2.842SerArg: 2.842 ± 0.513
6.902SerSer: 6.902 ± 1.206
7.877SerThr: 7.877 ± 1.566
3.979SerVal: 3.979 ± 0.818
0.568SerTrp: 0.568 ± 0.179
2.355SerTyr: 2.355 ± 0.375
0.0SerXaa: 0.0 ± 0.0
Thr
6.334ThrAla: 6.334 ± 0.946
0.568ThrCys: 0.568 ± 0.242
4.304ThrAsp: 4.304 ± 0.689
4.629ThrGlu: 4.629 ± 0.62
2.842ThrPhe: 2.842 ± 0.559
6.496ThrGly: 6.496 ± 1.053
1.137ThrHis: 1.137 ± 0.228
5.603ThrIle: 5.603 ± 1.038
4.385ThrLys: 4.385 ± 0.409
5.847ThrLeu: 5.847 ± 0.693
1.056ThrMet: 1.056 ± 0.289
5.928ThrAsn: 5.928 ± 1.172
2.274ThrPro: 2.274 ± 0.386
2.517ThrGln: 2.517 ± 0.366
1.543ThrArg: 1.543 ± 0.421
6.415ThrSer: 6.415 ± 1.012
5.197ThrThr: 5.197 ± 0.986
4.466ThrVal: 4.466 ± 1.381
0.812ThrTrp: 0.812 ± 0.232
3.492ThrTyr: 3.492 ± 0.514
0.0ThrXaa: 0.0 ± 0.0
Val
5.928ValAla: 5.928 ± 1.493
0.731ValCys: 0.731 ± 0.26
2.598ValAsp: 2.598 ± 0.519
3.654ValGlu: 3.654 ± 0.732
1.705ValPhe: 1.705 ± 0.372
3.248ValGly: 3.248 ± 0.455
1.218ValHis: 1.218 ± 0.286
2.923ValIle: 2.923 ± 0.449
5.603ValLys: 5.603 ± 0.824
4.222ValLeu: 4.222 ± 0.515
1.056ValMet: 1.056 ± 0.24
3.41ValAsn: 3.41 ± 0.886
1.218ValPro: 1.218 ± 0.269
3.654ValGln: 3.654 ± 0.495
1.786ValArg: 1.786 ± 0.366
5.035ValSer: 5.035 ± 0.841
3.898ValThr: 3.898 ± 0.906
3.167ValVal: 3.167 ± 0.552
0.731ValTrp: 0.731 ± 0.193
2.03ValTyr: 2.03 ± 0.47
0.0ValXaa: 0.0 ± 0.0
Trp
0.487TrpAla: 0.487 ± 0.231
0.162TrpCys: 0.162 ± 0.114
0.244TrpAsp: 0.244 ± 0.114
0.731TrpGlu: 0.731 ± 0.238
0.406TrpPhe: 0.406 ± 0.215
0.568TrpGly: 0.568 ± 0.236
0.244TrpHis: 0.244 ± 0.144
0.731TrpIle: 0.731 ± 0.25
0.974TrpLys: 0.974 ± 0.247
0.974TrpLeu: 0.974 ± 0.316
0.0TrpMet: 0.0 ± 0.0
0.731TrpAsn: 0.731 ± 0.272
0.0TrpPro: 0.0 ± 0.0
0.325TrpGln: 0.325 ± 0.172
0.244TrpArg: 0.244 ± 0.125
1.056TrpSer: 1.056 ± 0.35
1.218TrpThr: 1.218 ± 0.454
0.65TrpVal: 0.65 ± 0.309
0.325TrpTrp: 0.325 ± 0.182
0.487TrpTyr: 0.487 ± 0.254
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.274TyrAla: 2.274 ± 0.424
0.487TyrCys: 0.487 ± 0.216
2.111TyrAsp: 2.111 ± 0.519
1.786TyrGlu: 1.786 ± 0.368
1.056TyrPhe: 1.056 ± 0.294
1.786TyrGly: 1.786 ± 0.447
0.812TyrHis: 0.812 ± 0.282
2.517TyrIle: 2.517 ± 0.352
3.167TyrLys: 3.167 ± 0.903
3.573TyrLeu: 3.573 ± 0.458
0.893TyrMet: 0.893 ± 0.231
3.248TyrAsn: 3.248 ± 0.484
1.137TyrPro: 1.137 ± 0.36
1.056TyrGln: 1.056 ± 0.23
1.218TyrArg: 1.218 ± 0.297
3.086TyrSer: 3.086 ± 0.592
2.274TyrThr: 2.274 ± 0.529
2.111TyrVal: 2.111 ± 0.389
0.406TyrTrp: 0.406 ± 0.196
1.624TyrTyr: 1.624 ± 0.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 45 proteins (12316 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski