Amino acid dipepetide frequency for Podoviridae sp. ctQNx1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.265AlaAla: 11.265 ± 1.41
0.593AlaCys: 0.593 ± 0.205
4.659AlaAsp: 4.659 ± 0.632
7.03AlaGlu: 7.03 ± 0.87
3.303AlaPhe: 3.303 ± 0.502
7.793AlaGly: 7.793 ± 1.139
1.863AlaHis: 1.863 ± 0.506
6.437AlaIle: 6.437 ± 0.828
4.828AlaLys: 4.828 ± 0.566
8.47AlaLeu: 8.47 ± 0.935
3.134AlaMet: 3.134 ± 0.43
4.405AlaAsn: 4.405 ± 0.628
2.965AlaPro: 2.965 ± 0.52
5.082AlaGln: 5.082 ± 0.982
4.997AlaArg: 4.997 ± 0.538
5.082AlaSer: 5.082 ± 0.891
6.183AlaThr: 6.183 ± 0.875
4.913AlaVal: 4.913 ± 0.656
0.847AlaTrp: 0.847 ± 0.277
2.88AlaTyr: 2.88 ± 0.481
0.0AlaXaa: 0.0 ± 0.0
Cys
0.932CysAla: 0.932 ± 0.273
0.847CysCys: 0.847 ± 0.319
0.678CysAsp: 0.678 ± 0.326
0.762CysGlu: 0.762 ± 0.289
0.508CysPhe: 0.508 ± 0.196
1.44CysGly: 1.44 ± 0.326
0.762CysHis: 0.762 ± 0.223
0.508CysIle: 0.508 ± 0.216
1.101CysLys: 1.101 ± 0.35
1.101CysLeu: 1.101 ± 0.385
0.424CysMet: 0.424 ± 0.219
0.678CysAsn: 0.678 ± 0.275
0.678CysPro: 0.678 ± 0.29
0.508CysGln: 0.508 ± 0.193
0.762CysArg: 0.762 ± 0.273
1.271CysSer: 1.271 ± 0.342
0.847CysThr: 0.847 ± 0.301
0.762CysVal: 0.762 ± 0.281
0.424CysTrp: 0.424 ± 0.199
1.101CysTyr: 1.101 ± 0.392
0.0CysXaa: 0.0 ± 0.0
Asp
3.219AspAla: 3.219 ± 0.556
0.847AspCys: 0.847 ± 0.275
3.219AspAsp: 3.219 ± 0.39
4.066AspGlu: 4.066 ± 0.626
2.033AspPhe: 2.033 ± 0.415
3.981AspGly: 3.981 ± 0.842
1.016AspHis: 1.016 ± 0.3
3.219AspIle: 3.219 ± 0.476
2.033AspLys: 2.033 ± 0.387
3.558AspLeu: 3.558 ± 0.532
1.609AspMet: 1.609 ± 0.315
1.44AspAsn: 1.44 ± 0.424
2.795AspPro: 2.795 ± 0.574
1.609AspGln: 1.609 ± 0.451
3.049AspArg: 3.049 ± 0.453
3.219AspSer: 3.219 ± 0.41
2.541AspThr: 2.541 ± 0.405
2.456AspVal: 2.456 ± 0.52
0.678AspTrp: 0.678 ± 0.235
1.948AspTyr: 1.948 ± 0.358
0.0AspXaa: 0.0 ± 0.0
Glu
5.252GluAla: 5.252 ± 0.68
1.016GluCys: 1.016 ± 0.381
2.287GluAsp: 2.287 ± 0.405
4.489GluGlu: 4.489 ± 0.849
2.033GluPhe: 2.033 ± 0.434
3.727GluGly: 3.727 ± 0.733
2.372GluHis: 2.372 ± 0.464
3.388GluIle: 3.388 ± 0.479
4.743GluLys: 4.743 ± 0.75
5.252GluLeu: 5.252 ± 0.674
2.456GluMet: 2.456 ± 0.498
3.134GluAsn: 3.134 ± 0.439
2.202GluPro: 2.202 ± 0.422
4.405GluGln: 4.405 ± 0.868
4.743GluArg: 4.743 ± 0.665
2.287GluSer: 2.287 ± 0.433
3.134GluThr: 3.134 ± 0.431
4.743GluVal: 4.743 ± 0.686
1.016GluTrp: 1.016 ± 0.306
2.202GluTyr: 2.202 ± 0.302
0.0GluXaa: 0.0 ± 0.0
Phe
3.388PheAla: 3.388 ± 0.391
0.593PheCys: 0.593 ± 0.228
2.372PheAsp: 2.372 ± 0.428
1.863PheGlu: 1.863 ± 0.485
1.186PhePhe: 1.186 ± 0.351
2.71PheGly: 2.71 ± 0.421
0.847PheHis: 0.847 ± 0.272
1.694PheIle: 1.694 ± 0.35
1.694PheLys: 1.694 ± 0.374
3.134PheLeu: 3.134 ± 0.454
0.847PheMet: 0.847 ± 0.269
2.202PheAsn: 2.202 ± 0.439
1.779PhePro: 1.779 ± 0.455
1.271PheGln: 1.271 ± 0.329
1.355PheArg: 1.355 ± 0.298
1.779PheSer: 1.779 ± 0.513
1.948PheThr: 1.948 ± 0.354
2.118PheVal: 2.118 ± 0.451
0.424PheTrp: 0.424 ± 0.173
1.609PheTyr: 1.609 ± 0.389
0.0PheXaa: 0.0 ± 0.0
Gly
6.522GlyAla: 6.522 ± 0.842
0.932GlyCys: 0.932 ± 0.34
3.473GlyAsp: 3.473 ± 0.502
4.828GlyGlu: 4.828 ± 0.61
1.779GlyPhe: 1.779 ± 0.347
6.776GlyGly: 6.776 ± 0.993
1.355GlyHis: 1.355 ± 0.303
5.167GlyIle: 5.167 ± 0.789
4.32GlyLys: 4.32 ± 0.538
5.506GlyLeu: 5.506 ± 0.567
3.727GlyMet: 3.727 ± 0.503
3.727GlyAsn: 3.727 ± 0.514
1.186GlyPro: 1.186 ± 0.297
2.71GlyGln: 2.71 ± 0.579
4.066GlyArg: 4.066 ± 0.733
3.896GlySer: 3.896 ± 0.615
4.997GlyThr: 4.997 ± 0.872
4.32GlyVal: 4.32 ± 0.552
1.186GlyTrp: 1.186 ± 0.285
2.287GlyTyr: 2.287 ± 0.483
0.0GlyXaa: 0.0 ± 0.0
His
2.202HisAla: 2.202 ± 0.455
0.762HisCys: 0.762 ± 0.244
0.678HisAsp: 0.678 ± 0.204
1.525HisGlu: 1.525 ± 0.312
0.593HisPhe: 0.593 ± 0.191
1.694HisGly: 1.694 ± 0.453
0.847HisHis: 0.847 ± 0.285
1.694HisIle: 1.694 ± 0.338
1.355HisLys: 1.355 ± 0.411
1.863HisLeu: 1.863 ± 0.375
0.847HisMet: 0.847 ± 0.285
0.508HisAsn: 0.508 ± 0.212
1.186HisPro: 1.186 ± 0.345
0.762HisGln: 0.762 ± 0.239
1.609HisArg: 1.609 ± 0.373
1.271HisSer: 1.271 ± 0.275
0.847HisThr: 0.847 ± 0.282
1.609HisVal: 1.609 ± 0.444
0.508HisTrp: 0.508 ± 0.216
0.762HisTyr: 0.762 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
5.59IleAla: 5.59 ± 0.767
1.016IleCys: 1.016 ± 0.315
3.896IleAsp: 3.896 ± 0.596
4.235IleGlu: 4.235 ± 0.621
2.287IlePhe: 2.287 ± 0.352
3.981IleGly: 3.981 ± 0.671
1.186IleHis: 1.186 ± 0.305
2.965IleIle: 2.965 ± 0.48
3.134IleLys: 3.134 ± 0.516
3.981IleLeu: 3.981 ± 0.479
1.186IleMet: 1.186 ± 0.303
2.965IleAsn: 2.965 ± 0.401
3.303IlePro: 3.303 ± 0.452
2.456IleGln: 2.456 ± 0.405
3.812IleArg: 3.812 ± 0.6
2.965IleSer: 2.965 ± 0.727
3.388IleThr: 3.388 ± 0.555
3.049IleVal: 3.049 ± 0.579
0.254IleTrp: 0.254 ± 0.15
1.525IleTyr: 1.525 ± 0.327
0.0IleXaa: 0.0 ± 0.0
Lys
3.896LysAla: 3.896 ± 0.792
0.508LysCys: 0.508 ± 0.188
2.456LysAsp: 2.456 ± 0.584
4.235LysGlu: 4.235 ± 0.706
1.863LysPhe: 1.863 ± 0.511
2.795LysGly: 2.795 ± 0.432
1.101LysHis: 1.101 ± 0.316
2.965LysIle: 2.965 ± 0.623
3.558LysLys: 3.558 ± 0.616
5.421LysLeu: 5.421 ± 0.686
2.118LysMet: 2.118 ± 0.468
3.049LysAsn: 3.049 ± 0.551
2.202LysPro: 2.202 ± 0.462
2.626LysGln: 2.626 ± 0.64
2.965LysArg: 2.965 ± 0.566
2.372LysSer: 2.372 ± 0.433
2.965LysThr: 2.965 ± 0.497
4.235LysVal: 4.235 ± 0.55
0.678LysTrp: 0.678 ± 0.265
2.71LysTyr: 2.71 ± 0.386
0.0LysXaa: 0.0 ± 0.0
Leu
7.369LeuAla: 7.369 ± 0.788
1.779LeuCys: 1.779 ± 0.396
4.32LeuAsp: 4.32 ± 0.627
5.844LeuGlu: 5.844 ± 0.658
2.626LeuPhe: 2.626 ± 0.606
4.997LeuGly: 4.997 ± 0.671
1.779LeuHis: 1.779 ± 0.439
3.727LeuIle: 3.727 ± 0.558
3.134LeuLys: 3.134 ± 0.589
5.421LeuLeu: 5.421 ± 0.635
2.202LeuMet: 2.202 ± 0.47
3.812LeuAsn: 3.812 ± 0.558
3.134LeuPro: 3.134 ± 0.553
3.134LeuGln: 3.134 ± 0.621
4.659LeuArg: 4.659 ± 0.735
5.844LeuSer: 5.844 ± 0.561
6.607LeuThr: 6.607 ± 0.708
4.15LeuVal: 4.15 ± 0.508
1.016LeuTrp: 1.016 ± 0.281
2.033LeuTyr: 2.033 ± 0.328
0.0LeuXaa: 0.0 ± 0.0
Met
4.405MetAla: 4.405 ± 0.518
0.508MetCys: 0.508 ± 0.217
1.948MetAsp: 1.948 ± 0.408
2.118MetGlu: 2.118 ± 0.4
0.847MetPhe: 0.847 ± 0.271
1.948MetGly: 1.948 ± 0.372
0.762MetHis: 0.762 ± 0.23
1.948MetIle: 1.948 ± 0.397
2.202MetLys: 2.202 ± 0.553
2.541MetLeu: 2.541 ± 0.433
1.271MetMet: 1.271 ± 0.34
1.016MetAsn: 1.016 ± 0.257
1.101MetPro: 1.101 ± 0.266
1.779MetGln: 1.779 ± 0.415
1.186MetArg: 1.186 ± 0.331
2.033MetSer: 2.033 ± 0.366
1.609MetThr: 1.609 ± 0.3
1.525MetVal: 1.525 ± 0.328
0.339MetTrp: 0.339 ± 0.169
0.508MetTyr: 0.508 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
4.489AsnAla: 4.489 ± 0.538
1.186AsnCys: 1.186 ± 0.355
1.948AsnAsp: 1.948 ± 0.429
3.388AsnGlu: 3.388 ± 0.546
1.101AsnPhe: 1.101 ± 0.309
3.981AsnGly: 3.981 ± 0.646
1.271AsnHis: 1.271 ± 0.342
3.219AsnIle: 3.219 ± 0.636
2.456AsnLys: 2.456 ± 0.408
3.558AsnLeu: 3.558 ± 0.554
0.847AsnMet: 0.847 ± 0.254
2.287AsnAsn: 2.287 ± 0.538
2.456AsnPro: 2.456 ± 0.567
2.118AsnGln: 2.118 ± 0.506
2.71AsnArg: 2.71 ± 0.578
3.049AsnSer: 3.049 ± 0.623
1.779AsnThr: 1.779 ± 0.362
2.965AsnVal: 2.965 ± 0.617
0.678AsnTrp: 0.678 ± 0.252
1.948AsnTyr: 1.948 ± 0.371
0.0AsnXaa: 0.0 ± 0.0
Pro
3.558ProAla: 3.558 ± 0.587
0.932ProCys: 0.932 ± 0.315
2.88ProAsp: 2.88 ± 0.554
2.626ProGlu: 2.626 ± 0.452
2.033ProPhe: 2.033 ± 0.334
2.372ProGly: 2.372 ± 0.411
1.016ProHis: 1.016 ± 0.265
2.033ProIle: 2.033 ± 0.424
2.71ProLys: 2.71 ± 0.481
3.219ProLeu: 3.219 ± 0.591
1.186ProMet: 1.186 ± 0.257
1.694ProAsn: 1.694 ± 0.344
1.779ProPro: 1.779 ± 0.412
1.948ProGln: 1.948 ± 0.349
1.863ProArg: 1.863 ± 0.315
3.134ProSer: 3.134 ± 0.476
2.202ProThr: 2.202 ± 0.59
3.134ProVal: 3.134 ± 0.544
0.593ProTrp: 0.593 ± 0.241
1.355ProTyr: 1.355 ± 0.383
0.0ProXaa: 0.0 ± 0.0
Gln
5.675GlnAla: 5.675 ± 0.955
0.762GlnCys: 0.762 ± 0.289
1.016GlnAsp: 1.016 ± 0.308
3.727GlnGlu: 3.727 ± 0.683
1.948GlnPhe: 1.948 ± 0.314
2.626GlnGly: 2.626 ± 0.448
0.847GlnHis: 0.847 ± 0.285
2.372GlnIle: 2.372 ± 0.426
2.541GlnLys: 2.541 ± 0.351
3.473GlnLeu: 3.473 ± 0.698
1.525GlnMet: 1.525 ± 0.349
2.202GlnAsn: 2.202 ± 0.567
2.118GlnPro: 2.118 ± 0.668
4.997GlnGln: 4.997 ± 1.672
3.473GlnArg: 3.473 ± 0.663
2.795GlnSer: 2.795 ± 0.576
2.033GlnThr: 2.033 ± 0.434
3.049GlnVal: 3.049 ± 0.572
0.847GlnTrp: 0.847 ± 0.266
1.525GlnTyr: 1.525 ± 0.32
0.0GlnXaa: 0.0 ± 0.0
Arg
4.913ArgAla: 4.913 ± 0.495
0.847ArgCys: 0.847 ± 0.261
3.049ArgAsp: 3.049 ± 0.455
3.473ArgGlu: 3.473 ± 0.596
2.118ArgPhe: 2.118 ± 0.45
4.574ArgGly: 4.574 ± 0.698
1.016ArgHis: 1.016 ± 0.283
3.558ArgIle: 3.558 ± 0.554
3.981ArgLys: 3.981 ± 0.721
3.981ArgLeu: 3.981 ± 0.567
2.541ArgMet: 2.541 ± 0.516
2.456ArgAsn: 2.456 ± 0.459
1.863ArgPro: 1.863 ± 0.382
3.049ArgGln: 3.049 ± 0.564
3.727ArgArg: 3.727 ± 0.659
3.134ArgSer: 3.134 ± 0.428
2.626ArgThr: 2.626 ± 0.512
3.473ArgVal: 3.473 ± 0.529
1.355ArgTrp: 1.355 ± 0.381
2.202ArgTyr: 2.202 ± 0.503
0.0ArgXaa: 0.0 ± 0.0
Ser
7.2SerAla: 7.2 ± 1.179
0.678SerCys: 0.678 ± 0.208
2.118SerAsp: 2.118 ± 0.427
2.71SerGlu: 2.71 ± 0.475
2.541SerPhe: 2.541 ± 0.515
4.743SerGly: 4.743 ± 0.767
0.932SerHis: 0.932 ± 0.334
2.372SerIle: 2.372 ± 0.452
3.134SerLys: 3.134 ± 0.581
4.828SerLeu: 4.828 ± 0.657
1.694SerMet: 1.694 ± 0.463
2.626SerAsn: 2.626 ± 0.557
2.965SerPro: 2.965 ± 0.477
3.134SerGln: 3.134 ± 0.539
4.15SerArg: 4.15 ± 0.544
2.71SerSer: 2.71 ± 0.631
3.219SerThr: 3.219 ± 0.513
3.642SerVal: 3.642 ± 0.532
0.678SerTrp: 0.678 ± 0.266
1.355SerTyr: 1.355 ± 0.394
0.0SerXaa: 0.0 ± 0.0
Thr
5.844ThrAla: 5.844 ± 0.694
0.508ThrCys: 0.508 ± 0.193
2.118ThrAsp: 2.118 ± 0.387
2.626ThrGlu: 2.626 ± 0.513
1.525ThrPhe: 1.525 ± 0.446
5.082ThrGly: 5.082 ± 0.575
1.101ThrHis: 1.101 ± 0.384
3.219ThrIle: 3.219 ± 0.381
3.049ThrLys: 3.049 ± 0.546
4.743ThrLeu: 4.743 ± 0.572
1.609ThrMet: 1.609 ± 0.264
2.71ThrAsn: 2.71 ± 0.466
3.388ThrPro: 3.388 ± 0.463
2.456ThrGln: 2.456 ± 0.513
2.456ThrArg: 2.456 ± 0.366
3.303ThrSer: 3.303 ± 0.566
2.88ThrThr: 2.88 ± 0.518
4.659ThrVal: 4.659 ± 1.121
1.186ThrTrp: 1.186 ± 0.344
1.863ThrTyr: 1.863 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
5.844ValAla: 5.844 ± 0.839
0.932ValCys: 0.932 ± 0.325
2.965ValAsp: 2.965 ± 0.532
2.626ValGlu: 2.626 ± 0.441
2.202ValPhe: 2.202 ± 0.422
4.066ValGly: 4.066 ± 0.532
1.694ValHis: 1.694 ± 0.387
4.15ValIle: 4.15 ± 0.596
2.626ValLys: 2.626 ± 0.442
3.812ValLeu: 3.812 ± 0.573
1.355ValMet: 1.355 ± 0.399
3.896ValAsn: 3.896 ± 0.612
2.541ValPro: 2.541 ± 0.389
3.134ValGln: 3.134 ± 0.679
3.134ValArg: 3.134 ± 0.541
4.489ValSer: 4.489 ± 0.732
4.235ValThr: 4.235 ± 0.666
5.421ValVal: 5.421 ± 0.858
1.186ValTrp: 1.186 ± 0.34
2.287ValTyr: 2.287 ± 0.463
0.0ValXaa: 0.0 ± 0.0
Trp
1.44TrpAla: 1.44 ± 0.408
0.169TrpCys: 0.169 ± 0.132
0.508TrpAsp: 0.508 ± 0.179
0.678TrpGlu: 0.678 ± 0.232
1.016TrpPhe: 1.016 ± 0.301
0.339TrpGly: 0.339 ± 0.148
0.339TrpHis: 0.339 ± 0.18
0.593TrpIle: 0.593 ± 0.202
0.762TrpLys: 0.762 ± 0.26
1.694TrpLeu: 1.694 ± 0.333
0.508TrpMet: 0.508 ± 0.178
0.593TrpAsn: 0.593 ± 0.184
0.678TrpPro: 0.678 ± 0.281
0.593TrpGln: 0.593 ± 0.224
1.355TrpArg: 1.355 ± 0.363
0.847TrpSer: 0.847 ± 0.368
0.339TrpThr: 0.339 ± 0.174
0.847TrpVal: 0.847 ± 0.241
0.508TrpTrp: 0.508 ± 0.215
0.678TrpTyr: 0.678 ± 0.236
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.727TyrAla: 3.727 ± 0.546
0.678TyrCys: 0.678 ± 0.277
1.948TyrAsp: 1.948 ± 0.427
2.033TyrGlu: 2.033 ± 0.393
1.271TyrPhe: 1.271 ± 0.34
3.134TyrGly: 3.134 ± 0.56
1.016TyrHis: 1.016 ± 0.34
1.948TyrIle: 1.948 ± 0.327
1.355TyrLys: 1.355 ± 0.351
2.033TyrLeu: 2.033 ± 0.487
0.424TyrMet: 0.424 ± 0.176
2.033TyrAsn: 2.033 ± 0.462
1.948TyrPro: 1.948 ± 0.333
1.779TyrGln: 1.779 ± 0.388
1.863TyrArg: 1.863 ± 0.375
1.948TyrSer: 1.948 ± 0.455
1.948TyrThr: 1.948 ± 0.363
1.609TyrVal: 1.609 ± 0.369
0.085TyrTrp: 0.085 ± 0.075
1.186TyrTyr: 1.186 ± 0.317
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (11807 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski