Amino acid dipepetide frequency for Corynebacterium phage LGCM-VI

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.143AlaAla: 14.143 ± 2.309
0.651AlaCys: 0.651 ± 0.335
6.234AlaAsp: 6.234 ± 0.559
7.723AlaGlu: 7.723 ± 0.822
3.257AlaPhe: 3.257 ± 0.6
7.351AlaGly: 7.351 ± 1.227
2.14AlaHis: 2.14 ± 0.463
5.955AlaIle: 5.955 ± 0.68
5.769AlaLys: 5.769 ± 0.87
8.188AlaLeu: 8.188 ± 1.454
3.443AlaMet: 3.443 ± 0.524
2.978AlaAsn: 2.978 ± 0.453
3.536AlaPro: 3.536 ± 0.566
6.327AlaGln: 6.327 ± 1.331
5.304AlaArg: 5.304 ± 0.604
6.327AlaSer: 6.327 ± 0.808
6.606AlaThr: 6.606 ± 0.844
7.909AlaVal: 7.909 ± 1.001
1.861AlaTrp: 1.861 ± 0.483
2.047AlaTyr: 2.047 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.558CysAla: 0.558 ± 0.236
0.186CysCys: 0.186 ± 0.138
0.465CysAsp: 0.465 ± 0.186
0.372CysGlu: 0.372 ± 0.203
0.093CysPhe: 0.093 ± 0.103
0.651CysGly: 0.651 ± 0.291
0.0CysHis: 0.0 ± 0.0
0.372CysIle: 0.372 ± 0.189
0.186CysLys: 0.186 ± 0.134
0.651CysLeu: 0.651 ± 0.236
0.279CysMet: 0.279 ± 0.175
0.372CysAsn: 0.372 ± 0.174
0.279CysPro: 0.279 ± 0.146
0.186CysGln: 0.186 ± 0.176
1.024CysArg: 1.024 ± 0.393
0.837CysSer: 0.837 ± 0.276
0.279CysThr: 0.279 ± 0.149
0.186CysVal: 0.186 ± 0.199
0.186CysTrp: 0.186 ± 0.128
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.351AspAla: 7.351 ± 0.624
0.186AspCys: 0.186 ± 0.113
3.35AspAsp: 3.35 ± 0.811
4.652AspGlu: 4.652 ± 0.777
3.071AspPhe: 3.071 ± 0.718
4.839AspGly: 4.839 ± 0.62
1.303AspHis: 1.303 ± 0.348
3.35AspIle: 3.35 ± 0.6
2.512AspLys: 2.512 ± 0.486
5.025AspLeu: 5.025 ± 0.885
1.396AspMet: 1.396 ± 0.321
2.698AspAsn: 2.698 ± 0.531
3.164AspPro: 3.164 ± 0.629
2.326AspGln: 2.326 ± 0.445
2.326AspArg: 2.326 ± 0.344
4.28AspSer: 4.28 ± 0.555
2.885AspThr: 2.885 ± 0.575
4.466AspVal: 4.466 ± 0.693
1.675AspTrp: 1.675 ± 0.349
1.396AspTyr: 1.396 ± 0.459
0.0AspXaa: 0.0 ± 0.0
Glu
5.862GluAla: 5.862 ± 1.068
0.558GluCys: 0.558 ± 0.234
3.35GluAsp: 3.35 ± 0.622
4.746GluGlu: 4.746 ± 0.603
1.861GluPhe: 1.861 ± 0.483
3.164GluGly: 3.164 ± 0.547
1.21GluHis: 1.21 ± 0.27
4.932GluIle: 4.932 ± 0.785
4.187GluLys: 4.187 ± 0.566
6.7GluLeu: 6.7 ± 0.714
1.396GluMet: 1.396 ± 0.292
2.978GluAsn: 2.978 ± 0.557
2.233GluPro: 2.233 ± 0.42
2.512GluGln: 2.512 ± 0.449
3.536GluArg: 3.536 ± 0.695
4.652GluSer: 4.652 ± 0.687
4.094GluThr: 4.094 ± 0.573
4.559GluVal: 4.559 ± 0.556
1.117GluTrp: 1.117 ± 0.286
0.93GluTyr: 0.93 ± 0.351
0.0GluXaa: 0.0 ± 0.0
Phe
2.512PheAla: 2.512 ± 0.673
0.279PheCys: 0.279 ± 0.17
2.791PheAsp: 2.791 ± 0.645
1.675PheGlu: 1.675 ± 0.346
0.837PhePhe: 0.837 ± 0.433
1.954PheGly: 1.954 ± 0.42
1.024PheHis: 1.024 ± 0.256
1.21PheIle: 1.21 ± 0.269
1.582PheLys: 1.582 ± 0.438
1.675PheLeu: 1.675 ± 0.482
0.558PheMet: 0.558 ± 0.283
1.21PheAsn: 1.21 ± 0.372
1.21PhePro: 1.21 ± 0.402
1.21PheGln: 1.21 ± 0.3
1.675PheArg: 1.675 ± 0.589
2.326PheSer: 2.326 ± 0.484
2.14PheThr: 2.14 ± 0.418
2.14PheVal: 2.14 ± 0.454
0.372PheTrp: 0.372 ± 0.2
0.744PheTyr: 0.744 ± 0.236
0.0PheXaa: 0.0 ± 0.0
Gly
7.816GlyAla: 7.816 ± 1.554
0.279GlyCys: 0.279 ± 0.188
4.094GlyAsp: 4.094 ± 0.693
2.885GlyGlu: 2.885 ± 0.464
2.698GlyPhe: 2.698 ± 0.654
5.955GlyGly: 5.955 ± 1.426
1.582GlyHis: 1.582 ± 0.324
5.862GlyIle: 5.862 ± 1.182
5.118GlyLys: 5.118 ± 0.733
7.258GlyLeu: 7.258 ± 1.273
0.93GlyMet: 0.93 ± 0.315
2.978GlyAsn: 2.978 ± 0.502
2.326GlyPro: 2.326 ± 0.561
2.791GlyGln: 2.791 ± 0.397
4.001GlyArg: 4.001 ± 0.778
4.094GlySer: 4.094 ± 0.506
4.466GlyThr: 4.466 ± 0.59
6.606GlyVal: 6.606 ± 0.887
1.489GlyTrp: 1.489 ± 0.595
1.861GlyTyr: 1.861 ± 0.372
0.0GlyXaa: 0.0 ± 0.0
His
1.303HisAla: 1.303 ± 0.405
0.372HisCys: 0.372 ± 0.213
1.303HisAsp: 1.303 ± 0.271
1.024HisGlu: 1.024 ± 0.273
0.465HisPhe: 0.465 ± 0.212
1.582HisGly: 1.582 ± 0.304
0.465HisHis: 0.465 ± 0.183
1.582HisIle: 1.582 ± 0.296
0.558HisLys: 0.558 ± 0.194
1.861HisLeu: 1.861 ± 0.413
0.465HisMet: 0.465 ± 0.188
0.651HisAsn: 0.651 ± 0.238
1.024HisPro: 1.024 ± 0.253
0.744HisGln: 0.744 ± 0.235
1.489HisArg: 1.489 ± 0.287
0.93HisSer: 0.93 ± 0.238
1.117HisThr: 1.117 ± 0.27
1.489HisVal: 1.489 ± 0.445
0.558HisTrp: 0.558 ± 0.21
0.372HisTyr: 0.372 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
7.816IleAla: 7.816 ± 0.804
0.093IleCys: 0.093 ± 0.097
3.722IleAsp: 3.722 ± 0.737
3.908IleGlu: 3.908 ± 0.628
1.117IlePhe: 1.117 ± 0.395
5.397IleGly: 5.397 ± 0.947
1.117IleHis: 1.117 ± 0.314
2.978IleIle: 2.978 ± 0.638
4.466IleLys: 4.466 ± 0.795
3.35IleLeu: 3.35 ± 0.503
1.024IleMet: 1.024 ± 0.288
3.257IleAsn: 3.257 ± 0.387
3.071IlePro: 3.071 ± 0.542
2.047IleGln: 2.047 ± 0.398
4.001IleArg: 4.001 ± 0.654
3.443IleSer: 3.443 ± 0.483
4.839IleThr: 4.839 ± 0.75
2.791IleVal: 2.791 ± 0.498
1.024IleTrp: 1.024 ± 0.327
0.93IleTyr: 0.93 ± 0.229
0.0IleXaa: 0.0 ± 0.0
Lys
7.444LysAla: 7.444 ± 1.282
0.093LysCys: 0.093 ± 0.083
3.164LysAsp: 3.164 ± 0.51
3.071LysGlu: 3.071 ± 0.395
0.93LysPhe: 0.93 ± 0.252
4.001LysGly: 4.001 ± 0.519
0.651LysHis: 0.651 ± 0.284
4.187LysIle: 4.187 ± 0.683
2.605LysLys: 2.605 ± 0.603
5.211LysLeu: 5.211 ± 0.716
0.837LysMet: 0.837 ± 0.306
2.326LysAsn: 2.326 ± 0.596
2.14LysPro: 2.14 ± 0.471
3.071LysGln: 3.071 ± 0.412
3.536LysArg: 3.536 ± 0.558
2.791LysSer: 2.791 ± 0.648
4.187LysThr: 4.187 ± 0.72
3.35LysVal: 3.35 ± 0.616
0.651LysTrp: 0.651 ± 0.183
1.117LysTyr: 1.117 ± 0.39
0.0LysXaa: 0.0 ± 0.0
Leu
8.654LeuAla: 8.654 ± 1.087
0.93LeuCys: 0.93 ± 0.278
5.862LeuAsp: 5.862 ± 0.92
6.7LeuGlu: 6.7 ± 0.79
2.047LeuPhe: 2.047 ± 0.407
7.165LeuGly: 7.165 ± 1.114
1.21LeuHis: 1.21 ± 0.447
3.722LeuIle: 3.722 ± 0.53
4.746LeuLys: 4.746 ± 0.565
7.351LeuLeu: 7.351 ± 0.809
2.326LeuMet: 2.326 ± 0.585
2.605LeuAsn: 2.605 ± 0.534
3.722LeuPro: 3.722 ± 0.529
2.233LeuGln: 2.233 ± 0.523
4.466LeuArg: 4.466 ± 0.962
6.327LeuSer: 6.327 ± 0.793
4.559LeuThr: 4.559 ± 0.634
4.839LeuVal: 4.839 ± 0.826
1.396LeuTrp: 1.396 ± 0.377
1.768LeuTyr: 1.768 ± 0.432
0.0LeuXaa: 0.0 ± 0.0
Met
1.954MetAla: 1.954 ± 0.328
0.0MetCys: 0.0 ± 0.0
1.303MetAsp: 1.303 ± 0.424
1.954MetGlu: 1.954 ± 0.424
0.558MetPhe: 0.558 ± 0.269
1.768MetGly: 1.768 ± 0.372
0.186MetHis: 0.186 ± 0.197
1.21MetIle: 1.21 ± 0.318
1.21MetLys: 1.21 ± 0.378
1.489MetLeu: 1.489 ± 0.654
0.0MetMet: 0.0 ± 0.0
1.489MetAsn: 1.489 ± 0.377
1.303MetPro: 1.303 ± 0.323
0.93MetGln: 0.93 ± 0.287
1.582MetArg: 1.582 ± 0.339
1.582MetSer: 1.582 ± 0.364
1.768MetThr: 1.768 ± 0.349
1.024MetVal: 1.024 ± 0.342
0.651MetTrp: 0.651 ± 0.247
0.558MetTyr: 0.558 ± 0.203
0.0MetXaa: 0.0 ± 0.0
Asn
4.187AsnAla: 4.187 ± 0.655
0.093AsnCys: 0.093 ± 0.103
2.14AsnAsp: 2.14 ± 0.451
2.512AsnGlu: 2.512 ± 0.414
0.93AsnPhe: 0.93 ± 0.329
2.978AsnGly: 2.978 ± 0.559
0.651AsnHis: 0.651 ± 0.249
1.954AsnIle: 1.954 ± 0.524
1.861AsnLys: 1.861 ± 0.471
3.815AsnLeu: 3.815 ± 0.593
0.651AsnMet: 0.651 ± 0.245
1.768AsnAsn: 1.768 ± 0.489
2.698AsnPro: 2.698 ± 0.434
1.117AsnGln: 1.117 ± 0.317
2.233AsnArg: 2.233 ± 0.591
2.233AsnSer: 2.233 ± 0.504
1.675AsnThr: 1.675 ± 0.355
2.512AsnVal: 2.512 ± 0.493
0.279AsnTrp: 0.279 ± 0.147
1.21AsnTyr: 1.21 ± 0.39
0.0AsnXaa: 0.0 ± 0.0
Pro
3.536ProAla: 3.536 ± 0.689
0.279ProCys: 0.279 ± 0.13
2.698ProAsp: 2.698 ± 0.672
3.443ProGlu: 3.443 ± 0.601
1.582ProPhe: 1.582 ± 0.33
3.536ProGly: 3.536 ± 0.597
1.117ProHis: 1.117 ± 0.353
2.885ProIle: 2.885 ± 0.526
1.954ProLys: 1.954 ± 0.349
4.28ProLeu: 4.28 ± 0.632
0.837ProMet: 0.837 ± 0.244
1.21ProAsn: 1.21 ± 0.25
2.512ProPro: 2.512 ± 0.589
2.512ProGln: 2.512 ± 0.44
1.768ProArg: 1.768 ± 0.514
2.512ProSer: 2.512 ± 0.674
2.791ProThr: 2.791 ± 0.612
3.35ProVal: 3.35 ± 0.672
0.744ProTrp: 0.744 ± 0.267
1.024ProTyr: 1.024 ± 0.305
0.0ProXaa: 0.0 ± 0.0
Gln
5.769GlnAla: 5.769 ± 0.743
0.558GlnCys: 0.558 ± 0.218
2.14GlnAsp: 2.14 ± 0.316
2.14GlnGlu: 2.14 ± 0.568
1.396GlnPhe: 1.396 ± 0.474
2.233GlnGly: 2.233 ± 0.676
0.744GlnHis: 0.744 ± 0.297
1.303GlnIle: 1.303 ± 0.302
2.326GlnLys: 2.326 ± 0.361
4.28GlnLeu: 4.28 ± 0.542
0.837GlnMet: 0.837 ± 0.231
0.744GlnAsn: 0.744 ± 0.265
1.675GlnPro: 1.675 ± 0.358
2.326GlnGln: 2.326 ± 0.568
2.791GlnArg: 2.791 ± 0.479
3.164GlnSer: 3.164 ± 0.503
1.954GlnThr: 1.954 ± 0.327
2.791GlnVal: 2.791 ± 0.49
1.675GlnTrp: 1.675 ± 0.355
0.837GlnTyr: 0.837 ± 0.321
0.0GlnXaa: 0.0 ± 0.0
Arg
5.118ArgAla: 5.118 ± 0.695
0.465ArgCys: 0.465 ± 0.238
3.536ArgAsp: 3.536 ± 0.65
3.071ArgGlu: 3.071 ± 0.63
2.14ArgPhe: 2.14 ± 0.505
4.559ArgGly: 4.559 ± 0.519
1.21ArgHis: 1.21 ± 0.378
4.094ArgIle: 4.094 ± 0.637
3.908ArgLys: 3.908 ± 0.649
3.629ArgLeu: 3.629 ± 0.441
1.489ArgMet: 1.489 ± 0.337
2.14ArgAsn: 2.14 ± 0.402
3.257ArgPro: 3.257 ± 0.579
1.489ArgGln: 1.489 ± 0.264
5.49ArgArg: 5.49 ± 0.967
3.443ArgSer: 3.443 ± 0.624
1.861ArgThr: 1.861 ± 0.497
3.908ArgVal: 3.908 ± 0.817
1.117ArgTrp: 1.117 ± 0.402
1.024ArgTyr: 1.024 ± 0.244
0.0ArgXaa: 0.0 ± 0.0
Ser
6.979SerAla: 6.979 ± 1.015
0.651SerCys: 0.651 ± 0.24
3.629SerAsp: 3.629 ± 0.707
3.629SerGlu: 3.629 ± 0.793
2.047SerPhe: 2.047 ± 0.48
5.211SerGly: 5.211 ± 0.593
0.93SerHis: 0.93 ± 0.286
4.559SerIle: 4.559 ± 0.617
3.164SerLys: 3.164 ± 0.437
5.211SerLeu: 5.211 ± 0.872
2.326SerMet: 2.326 ± 0.532
2.326SerAsn: 2.326 ± 0.434
2.698SerPro: 2.698 ± 0.519
3.257SerGln: 3.257 ± 0.524
3.722SerArg: 3.722 ± 0.828
3.908SerSer: 3.908 ± 0.548
3.257SerThr: 3.257 ± 0.569
4.28SerVal: 4.28 ± 0.771
0.837SerTrp: 0.837 ± 0.267
1.303SerTyr: 1.303 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
6.606ThrAla: 6.606 ± 0.794
0.465ThrCys: 0.465 ± 0.226
2.791ThrAsp: 2.791 ± 0.557
3.071ThrGlu: 3.071 ± 0.533
1.768ThrPhe: 1.768 ± 0.369
4.559ThrGly: 4.559 ± 1.132
1.21ThrHis: 1.21 ± 0.31
4.373ThrIle: 4.373 ± 0.824
2.791ThrLys: 2.791 ± 0.607
4.652ThrLeu: 4.652 ± 0.7
1.582ThrMet: 1.582 ± 0.424
1.396ThrAsn: 1.396 ± 0.327
2.885ThrPro: 2.885 ± 0.643
2.978ThrGln: 2.978 ± 0.495
2.978ThrArg: 2.978 ± 0.552
4.373ThrSer: 4.373 ± 0.612
3.815ThrThr: 3.815 ± 0.54
4.559ThrVal: 4.559 ± 0.674
1.396ThrTrp: 1.396 ± 0.481
1.21ThrTyr: 1.21 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
6.327ValAla: 6.327 ± 0.683
0.651ValCys: 0.651 ± 0.379
5.397ValAsp: 5.397 ± 0.781
5.025ValGlu: 5.025 ± 0.602
1.303ValPhe: 1.303 ± 0.368
5.769ValGly: 5.769 ± 0.756
1.582ValHis: 1.582 ± 0.338
3.722ValIle: 3.722 ± 0.437
4.559ValLys: 4.559 ± 0.742
5.025ValLeu: 5.025 ± 0.718
1.117ValMet: 1.117 ± 0.345
3.164ValAsn: 3.164 ± 0.462
3.164ValPro: 3.164 ± 0.556
1.675ValGln: 1.675 ± 0.382
2.978ValArg: 2.978 ± 0.5
3.908ValSer: 3.908 ± 0.655
4.746ValThr: 4.746 ± 1.054
5.304ValVal: 5.304 ± 0.638
1.117ValTrp: 1.117 ± 0.336
2.512ValTyr: 2.512 ± 0.435
0.0ValXaa: 0.0 ± 0.0
Trp
1.768TrpAla: 1.768 ± 0.4
0.093TrpCys: 0.093 ± 0.097
1.675TrpAsp: 1.675 ± 0.4
1.303TrpGlu: 1.303 ± 0.329
0.279TrpPhe: 0.279 ± 0.13
1.024TrpGly: 1.024 ± 0.367
0.372TrpHis: 0.372 ± 0.163
1.303TrpIle: 1.303 ± 0.323
1.21TrpLys: 1.21 ± 0.311
1.675TrpLeu: 1.675 ± 0.344
0.558TrpMet: 0.558 ± 0.285
0.651TrpAsn: 0.651 ± 0.264
0.558TrpPro: 0.558 ± 0.243
1.117TrpGln: 1.117 ± 0.225
1.117TrpArg: 1.117 ± 0.336
0.651TrpSer: 0.651 ± 0.226
1.675TrpThr: 1.675 ± 0.359
1.024TrpVal: 1.024 ± 0.338
0.465TrpTrp: 0.465 ± 0.213
0.372TrpTyr: 0.372 ± 0.163
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.047TyrAla: 2.047 ± 0.426
0.279TyrCys: 0.279 ± 0.169
2.605TyrAsp: 2.605 ± 0.537
1.675TyrGlu: 1.675 ± 0.364
0.744TyrPhe: 0.744 ± 0.343
1.489TyrGly: 1.489 ± 0.328
0.651TyrHis: 0.651 ± 0.286
0.837TyrIle: 0.837 ± 0.317
0.651TyrLys: 0.651 ± 0.264
1.303TyrLeu: 1.303 ± 0.453
0.372TyrMet: 0.372 ± 0.148
0.558TyrAsn: 0.558 ± 0.244
1.117TyrPro: 1.117 ± 0.369
0.837TyrGln: 0.837 ± 0.326
0.93TyrArg: 0.93 ± 0.283
2.14TyrSer: 2.14 ± 0.484
0.744TyrThr: 0.744 ± 0.226
1.954TyrVal: 1.954 ± 0.422
0.279TyrTrp: 0.279 ± 0.146
0.558TyrTyr: 0.558 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 48 proteins (10748 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski