Amino acid dipepetide frequency for Pediococcus phage cIP1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.849AlaAla: 9.849 ± 1.68
0.346AlaCys: 0.346 ± 0.169
6.393AlaAsp: 6.393 ± 0.691
4.579AlaGlu: 4.579 ± 0.736
3.629AlaPhe: 3.629 ± 0.475
7.171AlaGly: 7.171 ± 1.032
1.641AlaHis: 1.641 ± 0.317
6.998AlaIle: 6.998 ± 0.973
4.665AlaLys: 4.665 ± 0.711
5.875AlaLeu: 5.875 ± 0.707
4.06AlaMet: 4.06 ± 0.874
4.147AlaAsn: 4.147 ± 0.521
2.592AlaPro: 2.592 ± 0.649
3.715AlaGln: 3.715 ± 0.665
4.406AlaArg: 4.406 ± 0.611
4.752AlaSer: 4.752 ± 0.686
6.393AlaThr: 6.393 ± 0.929
6.307AlaVal: 6.307 ± 0.762
0.95AlaTrp: 0.95 ± 0.236
3.197AlaTyr: 3.197 ± 0.57
0.0AlaXaa: 0.0 ± 0.0
Cys
0.346CysAla: 0.346 ± 0.18
0.086CysCys: 0.086 ± 0.078
0.086CysAsp: 0.086 ± 0.074
0.173CysGlu: 0.173 ± 0.121
0.086CysPhe: 0.086 ± 0.075
0.605CysGly: 0.605 ± 0.239
0.432CysHis: 0.432 ± 0.188
0.086CysIle: 0.086 ± 0.095
0.259CysLys: 0.259 ± 0.149
0.173CysLeu: 0.173 ± 0.123
0.086CysMet: 0.086 ± 0.074
0.259CysAsn: 0.259 ± 0.14
0.259CysPro: 0.259 ± 0.149
0.086CysGln: 0.086 ± 0.093
0.605CysArg: 0.605 ± 0.199
0.086CysSer: 0.086 ± 0.089
0.173CysThr: 0.173 ± 0.125
0.346CysVal: 0.346 ± 0.196
0.0CysTrp: 0.0 ± 0.0
0.778CysTyr: 0.778 ± 0.25
0.0CysXaa: 0.0 ± 0.0
Asp
5.702AspAla: 5.702 ± 0.53
0.346AspCys: 0.346 ± 0.19
5.443AspAsp: 5.443 ± 0.857
5.097AspGlu: 5.097 ± 0.721
1.987AspPhe: 1.987 ± 0.385
6.739AspGly: 6.739 ± 0.813
0.778AspHis: 0.778 ± 0.269
4.32AspIle: 4.32 ± 0.605
4.579AspLys: 4.579 ± 0.586
4.752AspLeu: 4.752 ± 0.533
2.678AspMet: 2.678 ± 0.443
4.924AspAsn: 4.924 ± 0.566
2.246AspPro: 2.246 ± 0.47
1.987AspGln: 1.987 ± 0.368
2.333AspArg: 2.333 ± 0.506
4.233AspSer: 4.233 ± 0.611
4.406AspThr: 4.406 ± 0.735
5.27AspVal: 5.27 ± 0.789
0.432AspTrp: 0.432 ± 0.227
2.851AspTyr: 2.851 ± 0.516
0.0AspXaa: 0.0 ± 0.0
Glu
3.024GluAla: 3.024 ± 0.547
0.432GluCys: 0.432 ± 0.183
3.283GluAsp: 3.283 ± 0.575
2.505GluGlu: 2.505 ± 0.589
2.073GluPhe: 2.073 ± 0.377
1.901GluGly: 1.901 ± 0.371
1.296GluHis: 1.296 ± 0.291
4.06GluIle: 4.06 ± 0.683
3.801GluLys: 3.801 ± 0.64
5.961GluLeu: 5.961 ± 0.819
1.814GluMet: 1.814 ± 0.464
2.419GluAsn: 2.419 ± 0.404
1.901GluPro: 1.901 ± 0.471
2.246GluGln: 2.246 ± 0.48
3.715GluArg: 3.715 ± 0.705
3.369GluSer: 3.369 ± 0.511
3.283GluThr: 3.283 ± 0.501
3.542GluVal: 3.542 ± 0.662
0.346GluTrp: 0.346 ± 0.175
3.283GluTyr: 3.283 ± 0.648
0.0GluXaa: 0.0 ± 0.0
Phe
1.814PheAla: 1.814 ± 0.422
0.086PheCys: 0.086 ± 0.091
2.678PheAsp: 2.678 ± 0.49
1.901PheGlu: 1.901 ± 0.417
1.382PhePhe: 1.382 ± 0.318
3.283PheGly: 3.283 ± 0.532
0.691PheHis: 0.691 ± 0.275
2.16PheIle: 2.16 ± 0.741
3.11PheLys: 3.11 ± 0.524
1.901PheLeu: 1.901 ± 0.414
1.123PheMet: 1.123 ± 0.268
3.024PheAsn: 3.024 ± 0.446
0.778PhePro: 0.778 ± 0.28
0.691PheGln: 0.691 ± 0.253
1.555PheArg: 1.555 ± 0.316
1.987PheSer: 1.987 ± 0.36
4.406PheThr: 4.406 ± 0.525
2.592PheVal: 2.592 ± 0.409
0.346PheTrp: 0.346 ± 0.174
1.814PheTyr: 1.814 ± 0.375
0.0PheXaa: 0.0 ± 0.0
Gly
5.961GlyAla: 5.961 ± 1.435
0.086GlyCys: 0.086 ± 0.095
5.011GlyAsp: 5.011 ± 0.711
4.406GlyGlu: 4.406 ± 0.635
2.937GlyPhe: 2.937 ± 0.495
4.147GlyGly: 4.147 ± 0.719
0.95GlyHis: 0.95 ± 0.282
4.752GlyIle: 4.752 ± 0.801
7.343GlyLys: 7.343 ± 0.741
4.838GlyLeu: 4.838 ± 0.85
2.592GlyMet: 2.592 ± 0.612
3.801GlyAsn: 3.801 ± 0.542
1.469GlyPro: 1.469 ± 0.326
2.333GlyGln: 2.333 ± 0.408
3.629GlyArg: 3.629 ± 0.644
4.924GlySer: 4.924 ± 0.563
4.665GlyThr: 4.665 ± 0.617
6.134GlyVal: 6.134 ± 0.888
0.518GlyTrp: 0.518 ± 0.22
4.492GlyTyr: 4.492 ± 0.672
0.0GlyXaa: 0.0 ± 0.0
His
1.469HisAla: 1.469 ± 0.297
0.173HisCys: 0.173 ± 0.116
2.16HisAsp: 2.16 ± 0.477
0.691HisGlu: 0.691 ± 0.252
0.518HisPhe: 0.518 ± 0.28
2.073HisGly: 2.073 ± 0.432
0.346HisHis: 0.346 ± 0.155
0.778HisIle: 0.778 ± 0.214
0.864HisLys: 0.864 ± 0.254
0.518HisLeu: 0.518 ± 0.178
0.518HisMet: 0.518 ± 0.213
0.864HisAsn: 0.864 ± 0.261
0.691HisPro: 0.691 ± 0.233
0.173HisGln: 0.173 ± 0.129
0.95HisArg: 0.95 ± 0.257
0.864HisSer: 0.864 ± 0.264
1.21HisThr: 1.21 ± 0.317
0.691HisVal: 0.691 ± 0.223
0.432HisTrp: 0.432 ± 0.176
0.95HisTyr: 0.95 ± 0.279
0.0HisXaa: 0.0 ± 0.0
Ile
6.393IleAla: 6.393 ± 1.057
0.259IleCys: 0.259 ± 0.152
4.838IleAsp: 4.838 ± 0.502
4.233IleGlu: 4.233 ± 0.619
1.814IlePhe: 1.814 ± 0.382
4.32IleGly: 4.32 ± 1.099
0.778IleHis: 0.778 ± 0.272
3.801IleIle: 3.801 ± 0.566
4.838IleLys: 4.838 ± 0.733
3.542IleLeu: 3.542 ± 0.456
2.419IleMet: 2.419 ± 0.425
3.11IleAsn: 3.11 ± 0.485
1.641IlePro: 1.641 ± 0.313
1.901IleGln: 1.901 ± 0.421
2.333IleArg: 2.333 ± 0.495
3.974IleSer: 3.974 ± 0.64
4.147IleThr: 4.147 ± 0.747
4.406IleVal: 4.406 ± 0.731
0.259IleTrp: 0.259 ± 0.14
2.333IleTyr: 2.333 ± 0.472
0.0IleXaa: 0.0 ± 0.0
Lys
5.529LysAla: 5.529 ± 0.642
0.086LysCys: 0.086 ± 0.091
4.665LysAsp: 4.665 ± 0.582
3.11LysGlu: 3.11 ± 0.614
2.333LysPhe: 2.333 ± 0.447
4.32LysGly: 4.32 ± 0.676
1.641LysHis: 1.641 ± 0.39
3.197LysIle: 3.197 ± 0.68
3.629LysLys: 3.629 ± 0.519
6.134LysLeu: 6.134 ± 0.757
2.419LysMet: 2.419 ± 0.419
3.369LysAsn: 3.369 ± 0.657
2.419LysPro: 2.419 ± 0.53
2.333LysGln: 2.333 ± 0.32
3.369LysArg: 3.369 ± 0.649
3.715LysSer: 3.715 ± 0.49
4.233LysThr: 4.233 ± 0.536
4.579LysVal: 4.579 ± 0.723
0.95LysTrp: 0.95 ± 0.297
2.505LysTyr: 2.505 ± 0.576
0.0LysXaa: 0.0 ± 0.0
Leu
7.171LeuAla: 7.171 ± 0.616
0.432LeuCys: 0.432 ± 0.175
5.27LeuAsp: 5.27 ± 0.695
3.888LeuGlu: 3.888 ± 0.647
2.592LeuPhe: 2.592 ± 0.414
4.665LeuGly: 4.665 ± 0.595
0.864LeuHis: 0.864 ± 0.329
4.06LeuIle: 4.06 ± 0.53
4.924LeuLys: 4.924 ± 0.704
4.492LeuLeu: 4.492 ± 0.733
2.16LeuMet: 2.16 ± 0.431
4.06LeuAsn: 4.06 ± 0.601
3.024LeuPro: 3.024 ± 0.482
2.765LeuGln: 2.765 ± 0.453
3.11LeuArg: 3.11 ± 0.601
4.752LeuSer: 4.752 ± 0.574
3.197LeuThr: 3.197 ± 0.441
5.27LeuVal: 5.27 ± 0.776
0.864LeuTrp: 0.864 ± 0.306
3.024LeuTyr: 3.024 ± 0.657
0.0LeuXaa: 0.0 ± 0.0
Met
3.283MetAla: 3.283 ± 0.577
0.259MetCys: 0.259 ± 0.145
1.814MetAsp: 1.814 ± 0.419
1.641MetGlu: 1.641 ± 0.348
1.901MetPhe: 1.901 ± 0.482
1.296MetGly: 1.296 ± 0.615
0.432MetHis: 0.432 ± 0.185
2.592MetIle: 2.592 ± 0.492
2.937MetLys: 2.937 ± 0.519
2.678MetLeu: 2.678 ± 0.715
1.123MetMet: 1.123 ± 0.282
1.21MetAsn: 1.21 ± 0.285
0.432MetPro: 0.432 ± 0.206
1.382MetGln: 1.382 ± 0.316
1.296MetArg: 1.296 ± 0.377
2.419MetSer: 2.419 ± 0.556
2.678MetThr: 2.678 ± 0.448
1.123MetVal: 1.123 ± 0.275
0.173MetTrp: 0.173 ± 0.127
1.555MetTyr: 1.555 ± 0.342
0.0MetXaa: 0.0 ± 0.0
Asn
6.22AsnAla: 6.22 ± 0.885
0.173AsnCys: 0.173 ± 0.128
4.32AsnAsp: 4.32 ± 0.545
2.765AsnGlu: 2.765 ± 0.539
1.123AsnPhe: 1.123 ± 0.294
6.22AsnGly: 6.22 ± 0.789
0.95AsnHis: 0.95 ± 0.205
2.505AsnIle: 2.505 ± 0.49
2.419AsnLys: 2.419 ± 0.608
4.06AsnLeu: 4.06 ± 0.578
1.641AsnMet: 1.641 ± 0.35
3.283AsnAsn: 3.283 ± 0.739
1.469AsnPro: 1.469 ± 0.374
1.469AsnGln: 1.469 ± 0.406
2.937AsnArg: 2.937 ± 0.477
2.505AsnSer: 2.505 ± 0.391
2.592AsnThr: 2.592 ± 0.446
3.542AsnVal: 3.542 ± 0.473
0.95AsnTrp: 0.95 ± 0.278
2.678AsnTyr: 2.678 ± 0.509
0.0AsnXaa: 0.0 ± 0.0
Pro
3.024ProAla: 3.024 ± 0.609
0.173ProCys: 0.173 ± 0.119
2.851ProAsp: 2.851 ± 0.552
1.728ProGlu: 1.728 ± 0.516
0.864ProPhe: 0.864 ± 0.257
2.505ProGly: 2.505 ± 0.501
0.259ProHis: 0.259 ± 0.143
1.296ProIle: 1.296 ± 0.263
1.814ProLys: 1.814 ± 0.503
2.246ProLeu: 2.246 ± 0.401
0.864ProMet: 0.864 ± 0.276
1.728ProAsn: 1.728 ± 0.378
0.259ProPro: 0.259 ± 0.152
1.123ProGln: 1.123 ± 0.327
1.469ProArg: 1.469 ± 0.315
2.16ProSer: 2.16 ± 0.457
1.728ProThr: 1.728 ± 0.414
3.283ProVal: 3.283 ± 0.484
0.518ProTrp: 0.518 ± 0.218
1.21ProTyr: 1.21 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
4.233GlnAla: 4.233 ± 0.534
0.173GlnCys: 0.173 ± 0.117
0.95GlnAsp: 0.95 ± 0.331
1.382GlnGlu: 1.382 ± 0.346
1.296GlnPhe: 1.296 ± 0.332
2.246GlnGly: 2.246 ± 0.383
0.173GlnHis: 0.173 ± 0.124
1.296GlnIle: 1.296 ± 0.309
1.296GlnLys: 1.296 ± 0.31
2.419GlnLeu: 2.419 ± 0.436
0.778GlnMet: 0.778 ± 0.254
1.296GlnAsn: 1.296 ± 0.3
1.123GlnPro: 1.123 ± 0.252
1.469GlnGln: 1.469 ± 0.327
1.814GlnArg: 1.814 ± 0.335
2.246GlnSer: 2.246 ± 0.346
2.678GlnThr: 2.678 ± 0.434
2.419GlnVal: 2.419 ± 0.535
0.605GlnTrp: 0.605 ± 0.225
1.814GlnTyr: 1.814 ± 0.434
0.0GlnXaa: 0.0 ± 0.0
Arg
4.579ArgAla: 4.579 ± 0.626
0.346ArgCys: 0.346 ± 0.165
3.369ArgAsp: 3.369 ± 0.652
2.505ArgGlu: 2.505 ± 0.487
2.678ArgPhe: 2.678 ± 0.538
3.801ArgGly: 3.801 ± 0.477
1.123ArgHis: 1.123 ± 0.296
3.197ArgIle: 3.197 ± 0.495
2.419ArgLys: 2.419 ± 0.562
4.233ArgLeu: 4.233 ± 0.771
1.037ArgMet: 1.037 ± 0.354
2.246ArgAsn: 2.246 ± 0.436
1.296ArgPro: 1.296 ± 0.321
1.382ArgGln: 1.382 ± 0.321
2.937ArgArg: 2.937 ± 0.593
1.555ArgSer: 1.555 ± 0.333
2.073ArgThr: 2.073 ± 0.457
4.406ArgVal: 4.406 ± 0.718
0.605ArgTrp: 0.605 ± 0.23
2.505ArgTyr: 2.505 ± 0.508
0.0ArgXaa: 0.0 ± 0.0
Ser
4.32SerAla: 4.32 ± 0.626
0.259SerCys: 0.259 ± 0.14
5.27SerAsp: 5.27 ± 0.746
3.024SerGlu: 3.024 ± 0.393
2.678SerPhe: 2.678 ± 0.749
6.479SerGly: 6.479 ± 0.842
1.382SerHis: 1.382 ± 0.321
4.579SerIle: 4.579 ± 0.655
2.765SerLys: 2.765 ± 0.545
3.801SerLeu: 3.801 ± 0.586
2.16SerMet: 2.16 ± 0.405
3.024SerAsn: 3.024 ± 0.55
1.901SerPro: 1.901 ± 0.404
1.987SerGln: 1.987 ± 0.44
2.073SerArg: 2.073 ± 0.412
4.492SerSer: 4.492 ± 0.69
2.937SerThr: 2.937 ± 0.481
4.32SerVal: 4.32 ± 0.485
1.037SerTrp: 1.037 ± 0.352
2.073SerTyr: 2.073 ± 0.405
0.0SerXaa: 0.0 ± 0.0
Thr
7.084ThrAla: 7.084 ± 1.025
0.432ThrCys: 0.432 ± 0.207
3.542ThrAsp: 3.542 ± 0.642
3.542ThrGlu: 3.542 ± 0.494
2.765ThrPhe: 2.765 ± 0.516
5.097ThrGly: 5.097 ± 0.769
0.864ThrHis: 0.864 ± 0.286
4.147ThrIle: 4.147 ± 0.515
3.11ThrLys: 3.11 ± 0.509
5.011ThrLeu: 5.011 ± 0.642
1.123ThrMet: 1.123 ± 0.29
3.542ThrAsn: 3.542 ± 0.683
3.024ThrPro: 3.024 ± 0.749
1.296ThrGln: 1.296 ± 0.286
2.851ThrArg: 2.851 ± 0.508
4.233ThrSer: 4.233 ± 0.521
3.197ThrThr: 3.197 ± 0.506
5.097ThrVal: 5.097 ± 0.685
0.518ThrTrp: 0.518 ± 0.173
2.765ThrTyr: 2.765 ± 0.468
0.0ThrXaa: 0.0 ± 0.0
Val
7.775ValAla: 7.775 ± 0.762
0.518ValCys: 0.518 ± 0.246
4.752ValAsp: 4.752 ± 0.565
3.456ValGlu: 3.456 ± 0.566
2.851ValPhe: 2.851 ± 0.474
4.147ValGly: 4.147 ± 0.632
1.296ValHis: 1.296 ± 0.246
4.492ValIle: 4.492 ± 0.626
5.184ValLys: 5.184 ± 0.467
4.492ValLeu: 4.492 ± 0.655
1.987ValMet: 1.987 ± 0.421
4.06ValAsn: 4.06 ± 0.507
2.851ValPro: 2.851 ± 0.495
1.728ValGln: 1.728 ± 0.323
3.542ValArg: 3.542 ± 0.626
5.184ValSer: 5.184 ± 0.75
5.616ValThr: 5.616 ± 0.837
6.479ValVal: 6.479 ± 0.833
1.21ValTrp: 1.21 ± 0.29
3.197ValTyr: 3.197 ± 0.77
0.0ValXaa: 0.0 ± 0.0
Trp
1.037TrpAla: 1.037 ± 0.3
0.0TrpCys: 0.0 ± 0.0
0.691TrpAsp: 0.691 ± 0.199
0.691TrpGlu: 0.691 ± 0.192
0.259TrpPhe: 0.259 ± 0.135
0.864TrpGly: 0.864 ± 0.258
0.259TrpHis: 0.259 ± 0.155
0.605TrpIle: 0.605 ± 0.217
0.518TrpLys: 0.518 ± 0.217
1.21TrpLeu: 1.21 ± 0.378
0.0TrpMet: 0.0 ± 0.0
0.605TrpAsn: 0.605 ± 0.238
0.173TrpPro: 0.173 ± 0.104
0.173TrpGln: 0.173 ± 0.125
0.518TrpArg: 0.518 ± 0.174
0.95TrpSer: 0.95 ± 0.243
0.778TrpThr: 0.778 ± 0.186
1.555TrpVal: 1.555 ± 0.426
0.086TrpTrp: 0.086 ± 0.089
0.346TrpTyr: 0.346 ± 0.156
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.851TyrAla: 2.851 ± 0.627
0.346TyrCys: 0.346 ± 0.184
3.715TyrAsp: 3.715 ± 0.582
2.937TyrGlu: 2.937 ± 0.474
1.555TyrPhe: 1.555 ± 0.408
3.11TyrGly: 3.11 ± 0.397
0.778TyrHis: 0.778 ± 0.27
2.419TyrIle: 2.419 ± 0.498
4.06TyrLys: 4.06 ± 0.74
2.246TyrLeu: 2.246 ± 0.422
1.555TyrMet: 1.555 ± 0.4
2.851TyrAsn: 2.851 ± 0.498
1.641TyrPro: 1.641 ± 0.394
1.469TyrGln: 1.469 ± 0.423
2.851TyrArg: 2.851 ± 0.614
2.16TyrSer: 2.16 ± 0.44
2.851TyrThr: 2.851 ± 0.548
3.456TyrVal: 3.456 ± 0.575
0.518TyrTrp: 0.518 ± 0.19
1.814TyrTyr: 1.814 ± 0.39
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (11576 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski