Amino acid dipepetide frequency for Podoviridae sp. ctKoA10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.385AlaAla: 12.385 ± 1.99
0.488AlaCys: 0.488 ± 0.227
5.851AlaAsp: 5.851 ± 0.649
6.729AlaGlu: 6.729 ± 0.827
3.121AlaPhe: 3.121 ± 0.468
7.217AlaGly: 7.217 ± 1.02
1.17AlaHis: 1.17 ± 0.441
4.779AlaIle: 4.779 ± 0.699
5.851AlaLys: 5.851 ± 0.921
8.192AlaLeu: 8.192 ± 0.979
3.316AlaMet: 3.316 ± 0.596
4.876AlaAsn: 4.876 ± 0.952
4.876AlaPro: 4.876 ± 0.672
5.071AlaGln: 5.071 ± 0.745
3.803AlaArg: 3.803 ± 0.521
6.144AlaSer: 6.144 ± 0.802
6.827AlaThr: 6.827 ± 0.949
6.924AlaVal: 6.924 ± 0.959
1.073AlaTrp: 1.073 ± 0.317
2.536AlaTyr: 2.536 ± 0.65
0.0AlaXaa: 0.0 ± 0.0
Cys
0.878CysAla: 0.878 ± 0.369
0.39CysCys: 0.39 ± 0.21
0.195CysAsp: 0.195 ± 0.143
0.975CysGlu: 0.975 ± 0.356
0.39CysPhe: 0.39 ± 0.232
0.878CysGly: 0.878 ± 0.374
0.098CysHis: 0.098 ± 0.094
0.78CysIle: 0.78 ± 0.239
0.39CysLys: 0.39 ± 0.232
1.365CysLeu: 1.365 ± 0.388
0.293CysMet: 0.293 ± 0.188
0.293CysAsn: 0.293 ± 0.162
0.683CysPro: 0.683 ± 0.278
0.683CysGln: 0.683 ± 0.253
0.488CysArg: 0.488 ± 0.286
0.585CysSer: 0.585 ± 0.337
0.39CysThr: 0.39 ± 0.165
0.585CysVal: 0.585 ± 0.205
0.098CysTrp: 0.098 ± 0.086
0.683CysTyr: 0.683 ± 0.271
0.0CysXaa: 0.0 ± 0.0
Asp
5.071AspAla: 5.071 ± 0.542
0.78AspCys: 0.78 ± 0.352
3.901AspAsp: 3.901 ± 0.723
4.193AspGlu: 4.193 ± 0.748
2.438AspPhe: 2.438 ± 0.491
4.779AspGly: 4.779 ± 0.564
0.975AspHis: 0.975 ± 0.299
3.121AspIle: 3.121 ± 0.481
3.413AspLys: 3.413 ± 0.605
6.046AspLeu: 6.046 ± 0.608
1.658AspMet: 1.658 ± 0.332
2.731AspAsn: 2.731 ± 0.502
1.755AspPro: 1.755 ± 0.453
2.341AspGln: 2.341 ± 0.444
1.17AspArg: 1.17 ± 0.318
4.681AspSer: 4.681 ± 0.753
3.023AspThr: 3.023 ± 0.606
3.023AspVal: 3.023 ± 0.544
1.073AspTrp: 1.073 ± 0.338
1.755AspTyr: 1.755 ± 0.536
0.0AspXaa: 0.0 ± 0.0
Glu
4.779GluAla: 4.779 ± 0.577
1.073GluCys: 1.073 ± 0.378
2.048GluAsp: 2.048 ± 0.529
2.633GluGlu: 2.633 ± 0.748
2.048GluPhe: 2.048 ± 0.507
2.926GluGly: 2.926 ± 0.598
0.683GluHis: 0.683 ± 0.257
3.316GluIle: 3.316 ± 0.564
3.608GluLys: 3.608 ± 0.705
7.217GluLeu: 7.217 ± 1.186
1.658GluMet: 1.658 ± 0.521
2.243GluAsn: 2.243 ± 0.446
1.56GluPro: 1.56 ± 0.598
3.901GluGln: 3.901 ± 0.818
3.511GluArg: 3.511 ± 0.666
4.876GluSer: 4.876 ± 0.542
2.926GluThr: 2.926 ± 0.504
2.828GluVal: 2.828 ± 0.439
0.975GluTrp: 0.975 ± 0.314
3.218GluTyr: 3.218 ± 0.568
0.0GluXaa: 0.0 ± 0.0
Phe
3.998PheAla: 3.998 ± 0.777
0.39PheCys: 0.39 ± 0.184
3.023PheAsp: 3.023 ± 0.411
2.341PheGlu: 2.341 ± 0.491
0.878PhePhe: 0.878 ± 0.237
3.901PheGly: 3.901 ± 0.73
0.293PheHis: 0.293 ± 0.152
1.853PheIle: 1.853 ± 0.467
2.926PheLys: 2.926 ± 0.641
2.341PheLeu: 2.341 ± 0.471
0.78PheMet: 0.78 ± 0.303
1.268PheAsn: 1.268 ± 0.296
0.585PhePro: 0.585 ± 0.191
1.365PheGln: 1.365 ± 0.425
1.17PheArg: 1.17 ± 0.394
2.536PheSer: 2.536 ± 0.561
2.731PheThr: 2.731 ± 0.49
2.438PheVal: 2.438 ± 0.557
0.683PheTrp: 0.683 ± 0.251
1.365PheTyr: 1.365 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
8.289GlyAla: 8.289 ± 0.861
0.78GlyCys: 0.78 ± 0.295
4.096GlyAsp: 4.096 ± 0.602
3.511GlyGlu: 3.511 ± 0.616
3.218GlyPhe: 3.218 ± 0.457
8.387GlyGly: 8.387 ± 1.548
0.975GlyHis: 0.975 ± 0.344
2.828GlyIle: 2.828 ± 0.409
4.291GlyLys: 4.291 ± 0.642
5.754GlyLeu: 5.754 ± 0.87
1.56GlyMet: 1.56 ± 0.473
2.926GlyAsn: 2.926 ± 0.603
1.073GlyPro: 1.073 ± 0.329
2.828GlyGln: 2.828 ± 0.545
3.413GlyArg: 3.413 ± 0.764
4.584GlySer: 4.584 ± 0.76
6.046GlyThr: 6.046 ± 1.176
6.534GlyVal: 6.534 ± 0.865
0.878GlyTrp: 0.878 ± 0.275
3.608GlyTyr: 3.608 ± 0.745
0.0GlyXaa: 0.0 ± 0.0
His
1.755HisAla: 1.755 ± 0.681
0.195HisCys: 0.195 ± 0.188
1.658HisAsp: 1.658 ± 0.416
0.585HisGlu: 0.585 ± 0.275
0.585HisPhe: 0.585 ± 0.23
0.488HisGly: 0.488 ± 0.252
0.195HisHis: 0.195 ± 0.14
1.17HisIle: 1.17 ± 0.411
0.878HisLys: 0.878 ± 0.316
1.268HisLeu: 1.268 ± 0.46
0.098HisMet: 0.098 ± 0.094
0.975HisAsn: 0.975 ± 0.332
0.683HisPro: 0.683 ± 0.258
0.195HisGln: 0.195 ± 0.122
0.585HisArg: 0.585 ± 0.23
0.975HisSer: 0.975 ± 0.347
0.488HisThr: 0.488 ± 0.225
0.683HisVal: 0.683 ± 0.315
0.683HisTrp: 0.683 ± 0.22
0.683HisTyr: 0.683 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
7.119IleAla: 7.119 ± 0.67
0.585IleCys: 0.585 ± 0.223
3.803IleAsp: 3.803 ± 0.512
3.121IleGlu: 3.121 ± 0.587
1.268IlePhe: 1.268 ± 0.31
4.291IleGly: 4.291 ± 0.568
0.39IleHis: 0.39 ± 0.165
3.316IleIle: 3.316 ± 0.591
3.608IleLys: 3.608 ± 0.683
3.998IleLeu: 3.998 ± 0.503
0.878IleMet: 0.878 ± 0.289
3.218IleAsn: 3.218 ± 0.629
2.243IlePro: 2.243 ± 0.489
2.438IleGln: 2.438 ± 0.528
1.268IleArg: 1.268 ± 0.395
4.291IleSer: 4.291 ± 0.714
4.193IleThr: 4.193 ± 0.602
3.706IleVal: 3.706 ± 0.548
0.39IleTrp: 0.39 ± 0.193
2.048IleTyr: 2.048 ± 0.537
0.0IleXaa: 0.0 ± 0.0
Lys
7.119LysAla: 7.119 ± 1.04
0.878LysCys: 0.878 ± 0.341
2.828LysAsp: 2.828 ± 0.545
3.218LysGlu: 3.218 ± 0.63
2.828LysPhe: 2.828 ± 0.43
3.413LysGly: 3.413 ± 0.519
1.073LysHis: 1.073 ± 0.401
3.218LysIle: 3.218 ± 0.533
3.511LysLys: 3.511 ± 0.765
3.998LysLeu: 3.998 ± 0.62
1.755LysMet: 1.755 ± 0.468
1.17LysAsn: 1.17 ± 0.384
2.536LysPro: 2.536 ± 0.526
4.291LysGln: 4.291 ± 0.883
3.218LysArg: 3.218 ± 0.561
3.121LysSer: 3.121 ± 0.593
4.486LysThr: 4.486 ± 0.801
2.828LysVal: 2.828 ± 0.529
1.365LysTrp: 1.365 ± 0.415
2.048LysTyr: 2.048 ± 0.501
0.0LysXaa: 0.0 ± 0.0
Leu
7.802LeuAla: 7.802 ± 0.703
0.878LeuCys: 0.878 ± 0.308
4.584LeuAsp: 4.584 ± 0.665
4.096LeuGlu: 4.096 ± 0.678
2.926LeuPhe: 2.926 ± 0.565
5.461LeuGly: 5.461 ± 0.628
1.365LeuHis: 1.365 ± 0.502
3.511LeuIle: 3.511 ± 0.79
4.193LeuLys: 4.193 ± 0.63
7.217LeuLeu: 7.217 ± 0.963
1.755LeuMet: 1.755 ± 0.405
4.193LeuAsn: 4.193 ± 0.528
4.096LeuPro: 4.096 ± 0.679
3.803LeuGln: 3.803 ± 0.529
4.096LeuArg: 4.096 ± 0.542
7.412LeuSer: 7.412 ± 1.342
6.437LeuThr: 6.437 ± 0.872
4.974LeuVal: 4.974 ± 0.595
0.683LeuTrp: 0.683 ± 0.273
1.853LeuTyr: 1.853 ± 0.397
0.0LeuXaa: 0.0 ± 0.0
Met
2.731MetAla: 2.731 ± 0.433
0.293MetCys: 0.293 ± 0.146
1.17MetAsp: 1.17 ± 0.317
1.073MetGlu: 1.073 ± 0.309
0.585MetPhe: 0.585 ± 0.21
0.878MetGly: 0.878 ± 0.329
0.098MetHis: 0.098 ± 0.096
2.048MetIle: 2.048 ± 0.579
1.755MetLys: 1.755 ± 0.582
1.853MetLeu: 1.853 ± 0.445
0.878MetMet: 0.878 ± 0.281
1.365MetAsn: 1.365 ± 0.365
1.17MetPro: 1.17 ± 0.486
1.658MetGln: 1.658 ± 0.417
1.073MetArg: 1.073 ± 0.32
2.048MetSer: 2.048 ± 0.45
1.853MetThr: 1.853 ± 0.428
1.56MetVal: 1.56 ± 0.423
0.39MetTrp: 0.39 ± 0.185
0.585MetTyr: 0.585 ± 0.234
0.0MetXaa: 0.0 ± 0.0
Asn
4.193AsnAla: 4.193 ± 0.705
0.488AsnCys: 0.488 ± 0.216
1.658AsnAsp: 1.658 ± 0.403
2.146AsnGlu: 2.146 ± 0.497
1.365AsnPhe: 1.365 ± 0.405
3.511AsnGly: 3.511 ± 0.892
0.488AsnHis: 0.488 ± 0.207
2.731AsnIle: 2.731 ± 0.571
2.828AsnLys: 2.828 ± 0.417
3.608AsnLeu: 3.608 ± 0.751
1.463AsnMet: 1.463 ± 0.404
1.56AsnAsn: 1.56 ± 0.595
2.146AsnPro: 2.146 ± 0.386
2.341AsnGln: 2.341 ± 0.571
2.243AsnArg: 2.243 ± 0.447
3.316AsnSer: 3.316 ± 0.465
2.243AsnThr: 2.243 ± 0.541
2.536AsnVal: 2.536 ± 0.475
0.585AsnTrp: 0.585 ± 0.204
2.341AsnTyr: 2.341 ± 0.464
0.0AsnXaa: 0.0 ± 0.0
Pro
3.803ProAla: 3.803 ± 0.598
0.195ProCys: 0.195 ± 0.149
3.023ProAsp: 3.023 ± 0.531
3.901ProGlu: 3.901 ± 0.721
1.853ProPhe: 1.853 ± 0.387
0.878ProGly: 0.878 ± 0.291
0.78ProHis: 0.78 ± 0.27
1.755ProIle: 1.755 ± 0.728
1.56ProLys: 1.56 ± 0.33
2.731ProLeu: 2.731 ± 0.49
0.78ProMet: 0.78 ± 0.263
1.268ProAsn: 1.268 ± 0.29
1.463ProPro: 1.463 ± 0.316
2.438ProGln: 2.438 ± 0.789
1.17ProArg: 1.17 ± 0.357
2.243ProSer: 2.243 ± 0.446
2.048ProThr: 2.048 ± 0.423
3.316ProVal: 3.316 ± 0.5
0.293ProTrp: 0.293 ± 0.172
0.683ProTyr: 0.683 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
4.681GlnAla: 4.681 ± 0.908
0.39GlnCys: 0.39 ± 0.174
1.95GlnAsp: 1.95 ± 0.411
2.926GlnGlu: 2.926 ± 0.47
2.243GlnPhe: 2.243 ± 0.531
3.121GlnGly: 3.121 ± 0.611
1.56GlnHis: 1.56 ± 0.481
2.926GlnIle: 2.926 ± 0.519
3.511GlnLys: 3.511 ± 0.652
4.779GlnLeu: 4.779 ± 0.895
1.463GlnMet: 1.463 ± 0.356
1.56GlnAsn: 1.56 ± 0.307
1.658GlnPro: 1.658 ± 0.52
5.949GlnGln: 5.949 ± 1.311
3.511GlnArg: 3.511 ± 0.522
2.438GlnSer: 2.438 ± 0.433
2.341GlnThr: 2.341 ± 0.445
3.023GlnVal: 3.023 ± 0.567
0.78GlnTrp: 0.78 ± 0.304
1.853GlnTyr: 1.853 ± 0.408
0.0GlnXaa: 0.0 ± 0.0
Arg
3.121ArgAla: 3.121 ± 0.545
0.683ArgCys: 0.683 ± 0.241
2.731ArgAsp: 2.731 ± 0.753
2.048ArgGlu: 2.048 ± 0.608
1.755ArgPhe: 1.755 ± 0.419
2.341ArgGly: 2.341 ± 0.605
0.585ArgHis: 0.585 ± 0.255
2.828ArgIle: 2.828 ± 0.484
3.316ArgLys: 3.316 ± 0.834
3.316ArgLeu: 3.316 ± 0.638
1.463ArgMet: 1.463 ± 0.352
2.536ArgAsn: 2.536 ± 0.514
1.365ArgPro: 1.365 ± 0.297
3.316ArgGln: 3.316 ± 0.688
1.853ArgArg: 1.853 ± 0.597
2.243ArgSer: 2.243 ± 0.373
2.146ArgThr: 2.146 ± 0.57
3.023ArgVal: 3.023 ± 0.461
0.683ArgTrp: 0.683 ± 0.279
1.463ArgTyr: 1.463 ± 0.366
0.0ArgXaa: 0.0 ± 0.0
Ser
7.314SerAla: 7.314 ± 0.832
0.585SerCys: 0.585 ± 0.238
5.071SerAsp: 5.071 ± 0.862
3.706SerGlu: 3.706 ± 0.672
2.341SerPhe: 2.341 ± 0.458
7.509SerGly: 7.509 ± 1.399
0.585SerHis: 0.585 ± 0.256
4.193SerIle: 4.193 ± 0.636
3.706SerLys: 3.706 ± 0.659
4.779SerLeu: 4.779 ± 0.848
1.365SerMet: 1.365 ± 0.316
3.608SerAsn: 3.608 ± 0.595
1.853SerPro: 1.853 ± 0.455
2.828SerGln: 2.828 ± 0.487
2.926SerArg: 2.926 ± 0.53
4.193SerSer: 4.193 ± 0.696
5.266SerThr: 5.266 ± 0.803
4.584SerVal: 4.584 ± 0.741
0.585SerTrp: 0.585 ± 0.252
1.853SerTyr: 1.853 ± 0.541
0.0SerXaa: 0.0 ± 0.0
Thr
6.729ThrAla: 6.729 ± 0.926
0.39ThrCys: 0.39 ± 0.227
3.023ThrAsp: 3.023 ± 0.448
3.608ThrGlu: 3.608 ± 0.776
1.95ThrPhe: 1.95 ± 0.444
7.217ThrGly: 7.217 ± 0.803
1.073ThrHis: 1.073 ± 0.376
3.608ThrIle: 3.608 ± 0.862
3.901ThrLys: 3.901 ± 0.555
4.584ThrLeu: 4.584 ± 0.857
1.56ThrMet: 1.56 ± 0.305
2.536ThrAsn: 2.536 ± 0.541
2.146ThrPro: 2.146 ± 0.627
2.341ThrGln: 2.341 ± 0.469
2.633ThrArg: 2.633 ± 0.457
3.511ThrSer: 3.511 ± 0.65
3.803ThrThr: 3.803 ± 0.788
5.364ThrVal: 5.364 ± 0.854
1.268ThrTrp: 1.268 ± 0.438
2.243ThrTyr: 2.243 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
5.656ValAla: 5.656 ± 0.648
1.17ValCys: 1.17 ± 0.368
4.096ValAsp: 4.096 ± 0.741
4.193ValGlu: 4.193 ± 0.71
2.633ValPhe: 2.633 ± 0.594
5.071ValGly: 5.071 ± 0.776
1.17ValHis: 1.17 ± 0.301
4.974ValIle: 4.974 ± 0.586
3.316ValLys: 3.316 ± 0.691
4.486ValLeu: 4.486 ± 0.725
1.17ValMet: 1.17 ± 0.278
3.218ValAsn: 3.218 ± 0.563
2.536ValPro: 2.536 ± 0.469
1.853ValGln: 1.853 ± 0.373
2.438ValArg: 2.438 ± 0.534
5.071ValSer: 5.071 ± 0.865
4.291ValThr: 4.291 ± 0.708
4.584ValVal: 4.584 ± 0.823
0.488ValTrp: 0.488 ± 0.225
2.926ValTyr: 2.926 ± 0.719
0.0ValXaa: 0.0 ± 0.0
Trp
0.683TrpAla: 0.683 ± 0.281
0.0TrpCys: 0.0 ± 0.0
1.073TrpAsp: 1.073 ± 0.29
0.488TrpGlu: 0.488 ± 0.215
0.683TrpPhe: 0.683 ± 0.238
0.878TrpGly: 0.878 ± 0.322
0.39TrpHis: 0.39 ± 0.209
0.78TrpIle: 0.78 ± 0.25
0.683TrpLys: 0.683 ± 0.238
1.853TrpLeu: 1.853 ± 0.411
0.195TrpMet: 0.195 ± 0.145
0.293TrpAsn: 0.293 ± 0.172
0.878TrpPro: 0.878 ± 0.322
1.17TrpGln: 1.17 ± 0.331
0.78TrpArg: 0.78 ± 0.297
1.073TrpSer: 1.073 ± 0.279
0.683TrpThr: 0.683 ± 0.243
0.488TrpVal: 0.488 ± 0.211
0.098TrpTrp: 0.098 ± 0.086
0.683TrpTyr: 0.683 ± 0.225
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.121TyrAla: 3.121 ± 0.491
0.488TyrCys: 0.488 ± 0.218
2.146TyrAsp: 2.146 ± 0.507
2.536TyrGlu: 2.536 ± 0.496
1.755TyrPhe: 1.755 ± 0.331
2.438TyrGly: 2.438 ± 0.466
0.878TyrHis: 0.878 ± 0.31
2.438TyrIle: 2.438 ± 0.754
1.658TyrLys: 1.658 ± 0.348
1.95TyrLeu: 1.95 ± 0.435
0.878TyrMet: 0.878 ± 0.276
1.95TyrAsn: 1.95 ± 0.452
0.878TyrPro: 0.878 ± 0.259
1.853TyrGln: 1.853 ± 0.444
1.463TyrArg: 1.463 ± 0.416
3.608TyrSer: 3.608 ± 0.746
1.268TyrThr: 1.268 ± 0.414
2.341TyrVal: 2.341 ± 0.537
0.78TyrTrp: 0.78 ± 0.327
1.56TyrTyr: 1.56 ± 0.477
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 50 proteins (10255 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski