Amino acid dipepetide frequency for Lactobacillus virus LLKu

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.369AlaAla: 1.369 ± 0.811
0.526AlaCys: 0.526 ± 0.218
4.528AlaAsp: 4.528 ± 0.663
4.107AlaGlu: 4.107 ± 0.649
3.791AlaPhe: 3.791 ± 0.576
6.002AlaGly: 6.002 ± 1.639
0.526AlaHis: 0.526 ± 0.233
5.37AlaIle: 5.37 ± 0.576
6.002AlaLys: 6.002 ± 0.911
3.896AlaLeu: 3.896 ± 0.808
1.685AlaMet: 1.685 ± 0.496
4.001AlaAsn: 4.001 ± 0.906
1.158AlaPro: 1.158 ± 0.305
2.632AlaGln: 2.632 ± 0.77
3.054AlaArg: 3.054 ± 0.743
4.949AlaSer: 4.949 ± 1.113
3.369AlaThr: 3.369 ± 0.576
4.212AlaVal: 4.212 ± 0.806
0.842AlaTrp: 0.842 ± 0.262
3.369AlaTyr: 3.369 ± 0.752
0.0AlaXaa: 0.0 ± 0.0
Cys
0.526CysAla: 0.526 ± 0.286
0.211CysCys: 0.211 ± 0.137
0.211CysAsp: 0.211 ± 0.142
0.632CysGlu: 0.632 ± 0.245
0.105CysPhe: 0.105 ± 0.102
1.158CysGly: 1.158 ± 0.376
0.316CysHis: 0.316 ± 0.184
0.211CysIle: 0.211 ± 0.126
1.264CysLys: 1.264 ± 0.384
0.737CysLeu: 0.737 ± 0.323
0.421CysMet: 0.421 ± 0.223
0.737CysAsn: 0.737 ± 0.269
0.316CysPro: 0.316 ± 0.194
0.632CysGln: 0.632 ± 0.225
0.421CysArg: 0.421 ± 0.194
0.211CysSer: 0.211 ± 0.164
0.526CysThr: 0.526 ± 0.24
0.632CysVal: 0.632 ± 0.246
0.211CysTrp: 0.211 ± 0.155
0.316CysTyr: 0.316 ± 0.195
0.0CysXaa: 0.0 ± 0.0
Asp
3.159AspAla: 3.159 ± 0.407
0.526AspCys: 0.526 ± 0.223
4.528AspAsp: 4.528 ± 0.784
5.054AspGlu: 5.054 ± 0.916
2.738AspPhe: 2.738 ± 0.513
5.686AspGly: 5.686 ± 0.978
0.737AspHis: 0.737 ± 0.267
4.949AspIle: 4.949 ± 0.771
4.107AspLys: 4.107 ± 0.584
5.581AspLeu: 5.581 ± 0.704
2.001AspMet: 2.001 ± 0.574
4.001AspAsn: 4.001 ± 0.758
0.948AspPro: 0.948 ± 0.386
2.211AspGln: 2.211 ± 0.353
1.895AspArg: 1.895 ± 0.418
4.633AspSer: 4.633 ± 0.606
3.475AspThr: 3.475 ± 0.714
4.317AspVal: 4.317 ± 0.625
0.632AspTrp: 0.632 ± 0.31
3.054AspTyr: 3.054 ± 0.665
0.0AspXaa: 0.0 ± 0.0
Glu
5.265GluAla: 5.265 ± 0.634
0.632GluCys: 0.632 ± 0.275
5.475GluAsp: 5.475 ± 0.889
6.002GluGlu: 6.002 ± 1.149
2.422GluPhe: 2.422 ± 0.43
2.948GluGly: 2.948 ± 0.469
0.948GluHis: 0.948 ± 0.343
5.686GluIle: 5.686 ± 1.193
6.002GluLys: 6.002 ± 0.865
6.107GluLeu: 6.107 ± 1.15
2.422GluMet: 2.422 ± 0.53
4.422GluAsn: 4.422 ± 0.593
1.369GluPro: 1.369 ± 0.484
2.106GluGln: 2.106 ± 0.395
2.001GluArg: 2.001 ± 0.427
2.422GluSer: 2.422 ± 0.425
4.317GluThr: 4.317 ± 0.698
5.16GluVal: 5.16 ± 0.601
0.526GluTrp: 0.526 ± 0.248
2.527GluTyr: 2.527 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
2.001PheAla: 2.001 ± 0.537
0.421PheCys: 0.421 ± 0.189
2.211PheAsp: 2.211 ± 0.528
2.843PheGlu: 2.843 ± 0.53
1.579PhePhe: 1.579 ± 0.448
3.159PheGly: 3.159 ± 0.602
0.421PheHis: 0.421 ± 0.174
2.948PheIle: 2.948 ± 0.513
3.369PheLys: 3.369 ± 0.563
2.948PheLeu: 2.948 ± 0.676
0.948PheMet: 0.948 ± 0.35
2.317PheAsn: 2.317 ± 0.485
0.632PhePro: 0.632 ± 0.273
1.369PheGln: 1.369 ± 0.351
1.79PheArg: 1.79 ± 0.495
2.632PheSer: 2.632 ± 0.618
3.475PheThr: 3.475 ± 0.697
2.211PheVal: 2.211 ± 0.467
0.105PheTrp: 0.105 ± 0.115
1.264PheTyr: 1.264 ± 0.35
0.0PheXaa: 0.0 ± 0.0
Gly
4.317GlyAla: 4.317 ± 0.987
0.316GlyCys: 0.316 ± 0.187
4.528GlyAsp: 4.528 ± 0.892
3.264GlyGlu: 3.264 ± 0.629
3.475GlyPhe: 3.475 ± 0.6
3.896GlyGly: 3.896 ± 0.788
0.842GlyHis: 0.842 ± 0.262
5.791GlyIle: 5.791 ± 0.99
6.107GlyLys: 6.107 ± 0.923
5.37GlyLeu: 5.37 ± 0.734
2.106GlyMet: 2.106 ± 0.597
3.791GlyAsn: 3.791 ± 0.894
0.0GlyPro: 0.0 ± 0.0
1.79GlyGln: 1.79 ± 0.445
2.843GlyArg: 2.843 ± 0.587
4.317GlySer: 4.317 ± 1.04
4.001GlyThr: 4.001 ± 0.668
5.897GlyVal: 5.897 ± 0.727
0.842GlyTrp: 0.842 ± 0.247
3.369GlyTyr: 3.369 ± 0.691
0.0GlyXaa: 0.0 ± 0.0
His
0.948HisAla: 0.948 ± 0.287
0.105HisCys: 0.105 ± 0.095
0.526HisAsp: 0.526 ± 0.272
0.526HisGlu: 0.526 ± 0.343
0.316HisPhe: 0.316 ± 0.2
0.737HisGly: 0.737 ± 0.299
0.632HisHis: 0.632 ± 0.317
1.053HisIle: 1.053 ± 0.324
1.369HisLys: 1.369 ± 0.322
1.053HisLeu: 1.053 ± 0.288
0.316HisMet: 0.316 ± 0.184
0.526HisAsn: 0.526 ± 0.221
0.316HisPro: 0.316 ± 0.179
0.526HisGln: 0.526 ± 0.199
0.526HisArg: 0.526 ± 0.224
0.948HisSer: 0.948 ± 0.283
0.948HisThr: 0.948 ± 0.277
1.685HisVal: 1.685 ± 0.435
0.0HisTrp: 0.0 ± 0.0
0.842HisTyr: 0.842 ± 0.315
0.0HisXaa: 0.0 ± 0.0
Ile
4.107IleAla: 4.107 ± 0.698
0.737IleCys: 0.737 ± 0.272
4.633IleAsp: 4.633 ± 0.738
5.686IleGlu: 5.686 ± 0.884
1.895IlePhe: 1.895 ± 0.405
3.791IleGly: 3.791 ± 0.616
0.526IleHis: 0.526 ± 0.242
4.001IleIle: 4.001 ± 0.753
5.686IleLys: 5.686 ± 0.918
4.001IleLeu: 4.001 ± 0.943
2.001IleMet: 2.001 ± 0.481
5.16IleAsn: 5.16 ± 0.56
2.948IlePro: 2.948 ± 0.666
1.579IleGln: 1.579 ± 0.365
2.948IleArg: 2.948 ± 0.63
5.054IleSer: 5.054 ± 0.879
5.265IleThr: 5.265 ± 0.796
4.528IleVal: 4.528 ± 0.674
0.632IleTrp: 0.632 ± 0.25
3.369IleTyr: 3.369 ± 0.416
0.0IleXaa: 0.0 ± 0.0
Lys
6.002LysAla: 6.002 ± 1.042
0.316LysCys: 0.316 ± 0.184
4.001LysAsp: 4.001 ± 0.643
6.739LysGlu: 6.739 ± 0.957
3.054LysPhe: 3.054 ± 0.662
5.686LysGly: 5.686 ± 0.577
1.158LysHis: 1.158 ± 0.47
5.897LysIle: 5.897 ± 0.769
7.16LysLys: 7.16 ± 0.941
7.687LysLeu: 7.687 ± 0.895
1.685LysMet: 1.685 ± 0.398
7.16LysAsn: 7.16 ± 1.024
1.685LysPro: 1.685 ± 0.379
3.369LysGln: 3.369 ± 0.572
4.528LysArg: 4.528 ± 0.705
4.738LysSer: 4.738 ± 0.594
4.317LysThr: 4.317 ± 0.701
5.791LysVal: 5.791 ± 1.034
0.421LysTrp: 0.421 ± 0.178
3.369LysTyr: 3.369 ± 0.628
0.0LysXaa: 0.0 ± 0.0
Leu
5.581LeuAla: 5.581 ± 0.892
1.264LeuCys: 1.264 ± 0.452
5.265LeuAsp: 5.265 ± 0.89
4.633LeuGlu: 4.633 ± 0.531
2.422LeuPhe: 2.422 ± 0.526
5.37LeuGly: 5.37 ± 0.957
1.369LeuHis: 1.369 ± 0.428
4.738LeuIle: 4.738 ± 0.741
7.265LeuLys: 7.265 ± 0.872
6.423LeuLeu: 6.423 ± 0.889
1.579LeuMet: 1.579 ± 0.337
5.054LeuAsn: 5.054 ± 0.634
2.843LeuPro: 2.843 ± 0.652
1.79LeuGln: 1.79 ± 0.327
3.264LeuArg: 3.264 ± 0.752
5.37LeuSer: 5.37 ± 0.936
4.844LeuThr: 4.844 ± 0.577
4.633LeuVal: 4.633 ± 0.726
0.421LeuTrp: 0.421 ± 0.212
2.422LeuTyr: 2.422 ± 0.6
0.0LeuXaa: 0.0 ± 0.0
Met
3.054MetAla: 3.054 ± 0.581
0.211MetCys: 0.211 ± 0.139
1.579MetAsp: 1.579 ± 0.38
1.579MetGlu: 1.579 ± 0.405
0.316MetPhe: 0.316 ± 0.217
0.948MetGly: 0.948 ± 0.29
0.211MetHis: 0.211 ± 0.147
1.474MetIle: 1.474 ± 0.422
3.369MetLys: 3.369 ± 0.585
1.264MetLeu: 1.264 ± 0.345
0.737MetMet: 0.737 ± 0.256
1.685MetAsn: 1.685 ± 0.425
0.737MetPro: 0.737 ± 0.266
0.632MetGln: 0.632 ± 0.283
1.895MetArg: 1.895 ± 0.386
1.369MetSer: 1.369 ± 0.538
1.79MetThr: 1.79 ± 0.512
2.001MetVal: 2.001 ± 0.519
0.316MetTrp: 0.316 ± 0.15
0.211MetTyr: 0.211 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
4.738AsnAla: 4.738 ± 1.12
0.737AsnCys: 0.737 ± 0.307
3.685AsnAsp: 3.685 ± 0.66
5.686AsnGlu: 5.686 ± 0.96
2.843AsnPhe: 2.843 ± 0.505
6.95AsnGly: 6.95 ± 0.972
1.158AsnHis: 1.158 ± 0.376
3.264AsnIle: 3.264 ± 0.627
5.054AsnLys: 5.054 ± 0.714
5.265AsnLeu: 5.265 ± 0.886
1.474AsnMet: 1.474 ± 0.309
4.212AsnAsn: 4.212 ± 1.041
1.79AsnPro: 1.79 ± 0.473
2.422AsnGln: 2.422 ± 0.526
2.527AsnArg: 2.527 ± 0.472
4.528AsnSer: 4.528 ± 0.781
3.475AsnThr: 3.475 ± 0.786
3.369AsnVal: 3.369 ± 0.468
1.158AsnTrp: 1.158 ± 0.369
2.948AsnTyr: 2.948 ± 0.629
0.0AsnXaa: 0.0 ± 0.0
Pro
2.001ProAla: 2.001 ± 0.437
0.105ProCys: 0.105 ± 0.123
1.158ProAsp: 1.158 ± 0.361
1.264ProGlu: 1.264 ± 0.32
0.842ProPhe: 0.842 ± 0.261
1.369ProGly: 1.369 ± 0.537
0.421ProHis: 0.421 ± 0.234
1.895ProIle: 1.895 ± 0.376
2.317ProLys: 2.317 ± 0.579
1.579ProLeu: 1.579 ± 0.436
0.105ProMet: 0.105 ± 0.099
2.001ProAsn: 2.001 ± 0.431
0.526ProPro: 0.526 ± 0.232
0.737ProGln: 0.737 ± 0.262
0.737ProArg: 0.737 ± 0.221
2.001ProSer: 2.001 ± 0.579
1.369ProThr: 1.369 ± 0.346
2.527ProVal: 2.527 ± 0.446
0.421ProTrp: 0.421 ± 0.261
1.158ProTyr: 1.158 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
2.948GlnAla: 2.948 ± 0.5
0.421GlnCys: 0.421 ± 0.27
2.422GlnAsp: 2.422 ± 0.429
2.001GlnGlu: 2.001 ± 0.498
0.948GlnPhe: 0.948 ± 0.285
1.685GlnGly: 1.685 ± 0.491
0.421GlnHis: 0.421 ± 0.198
3.054GlnIle: 3.054 ± 0.536
2.738GlnLys: 2.738 ± 0.717
3.264GlnLeu: 3.264 ± 0.92
0.632GlnMet: 0.632 ± 0.292
1.79GlnAsn: 1.79 ± 0.486
1.158GlnPro: 1.158 ± 0.346
1.369GlnGln: 1.369 ± 0.375
1.158GlnArg: 1.158 ± 0.314
2.211GlnSer: 2.211 ± 0.496
1.474GlnThr: 1.474 ± 0.412
2.422GlnVal: 2.422 ± 0.423
0.211GlnTrp: 0.211 ± 0.133
1.579GlnTyr: 1.579 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
2.422ArgAla: 2.422 ± 0.592
0.632ArgCys: 0.632 ± 0.3
2.632ArgAsp: 2.632 ± 0.625
2.948ArgGlu: 2.948 ± 0.642
2.106ArgPhe: 2.106 ± 0.33
2.106ArgGly: 2.106 ± 0.459
0.526ArgHis: 0.526 ± 0.236
2.948ArgIle: 2.948 ± 0.447
3.791ArgLys: 3.791 ± 0.579
3.475ArgLeu: 3.475 ± 0.537
1.053ArgMet: 1.053 ± 0.334
3.054ArgAsn: 3.054 ± 0.778
0.842ArgPro: 0.842 ± 0.366
0.948ArgGln: 0.948 ± 0.32
2.001ArgArg: 2.001 ± 0.408
2.211ArgSer: 2.211 ± 0.461
2.106ArgThr: 2.106 ± 0.476
2.948ArgVal: 2.948 ± 0.455
0.737ArgTrp: 0.737 ± 0.279
2.317ArgTyr: 2.317 ± 0.509
0.0ArgXaa: 0.0 ± 0.0
Ser
4.212SerAla: 4.212 ± 0.62
0.842SerCys: 0.842 ± 0.337
3.685SerAsp: 3.685 ± 0.611
3.791SerGlu: 3.791 ± 0.496
3.264SerPhe: 3.264 ± 0.561
4.001SerGly: 4.001 ± 0.747
0.842SerHis: 0.842 ± 0.276
3.475SerIle: 3.475 ± 0.615
5.265SerLys: 5.265 ± 0.608
3.58SerLeu: 3.58 ± 0.529
2.317SerMet: 2.317 ± 0.448
5.37SerAsn: 5.37 ± 0.877
2.001SerPro: 2.001 ± 0.449
2.106SerGln: 2.106 ± 0.513
2.632SerArg: 2.632 ± 0.803
5.16SerSer: 5.16 ± 0.731
4.107SerThr: 4.107 ± 0.69
4.844SerVal: 4.844 ± 0.688
1.264SerTrp: 1.264 ± 0.391
3.054SerTyr: 3.054 ± 0.615
0.0SerXaa: 0.0 ± 0.0
Thr
4.738ThrAla: 4.738 ± 0.726
0.316ThrCys: 0.316 ± 0.192
4.107ThrAsp: 4.107 ± 0.731
4.212ThrGlu: 4.212 ± 0.695
2.422ThrPhe: 2.422 ± 0.53
3.685ThrGly: 3.685 ± 0.613
0.526ThrHis: 0.526 ± 0.224
3.475ThrIle: 3.475 ± 0.731
4.738ThrLys: 4.738 ± 0.829
5.16ThrLeu: 5.16 ± 0.749
0.948ThrMet: 0.948 ± 0.32
3.58ThrAsn: 3.58 ± 0.896
1.895ThrPro: 1.895 ± 0.537
2.317ThrGln: 2.317 ± 0.512
2.211ThrArg: 2.211 ± 0.446
3.685ThrSer: 3.685 ± 0.692
2.843ThrThr: 2.843 ± 0.578
5.37ThrVal: 5.37 ± 0.971
0.737ThrTrp: 0.737 ± 0.233
2.738ThrTyr: 2.738 ± 0.546
0.0ThrXaa: 0.0 ± 0.0
Val
5.791ValAla: 5.791 ± 0.96
0.737ValCys: 0.737 ± 0.328
4.528ValAsp: 4.528 ± 0.718
4.844ValGlu: 4.844 ± 0.758
2.106ValPhe: 2.106 ± 0.457
4.528ValGly: 4.528 ± 0.764
1.579ValHis: 1.579 ± 0.354
4.528ValIle: 4.528 ± 0.627
5.37ValLys: 5.37 ± 0.86
4.844ValLeu: 4.844 ± 0.52
1.369ValMet: 1.369 ± 0.341
5.37ValAsn: 5.37 ± 0.724
1.579ValPro: 1.579 ± 0.396
2.317ValGln: 2.317 ± 0.472
2.422ValArg: 2.422 ± 0.446
5.581ValSer: 5.581 ± 0.728
4.528ValThr: 4.528 ± 0.603
4.107ValVal: 4.107 ± 0.649
0.632ValTrp: 0.632 ± 0.27
3.475ValTyr: 3.475 ± 0.511
0.0ValXaa: 0.0 ± 0.0
Trp
0.632TrpAla: 0.632 ± 0.332
0.105TrpCys: 0.105 ± 0.1
0.316TrpAsp: 0.316 ± 0.185
0.632TrpGlu: 0.632 ± 0.272
0.737TrpPhe: 0.737 ± 0.295
0.842TrpGly: 0.842 ± 0.287
0.421TrpHis: 0.421 ± 0.204
0.421TrpIle: 0.421 ± 0.187
1.053TrpLys: 1.053 ± 0.272
1.158TrpLeu: 1.158 ± 0.351
0.211TrpMet: 0.211 ± 0.152
0.948TrpAsn: 0.948 ± 0.397
0.0TrpPro: 0.0 ± 0.0
0.526TrpGln: 0.526 ± 0.195
0.421TrpArg: 0.421 ± 0.181
0.526TrpSer: 0.526 ± 0.336
0.526TrpThr: 0.526 ± 0.226
0.421TrpVal: 0.421 ± 0.202
0.0TrpTrp: 0.0 ± 0.0
0.316TrpTyr: 0.316 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.579TyrAla: 1.579 ± 0.463
0.632TyrCys: 0.632 ± 0.312
4.212TyrAsp: 4.212 ± 0.579
2.317TyrGlu: 2.317 ± 0.502
1.264TyrPhe: 1.264 ± 0.28
2.317TyrGly: 2.317 ± 0.478
0.316TyrHis: 0.316 ± 0.208
3.475TyrIle: 3.475 ± 0.624
2.738TyrLys: 2.738 ± 0.542
3.159TyrLeu: 3.159 ± 0.659
1.264TyrMet: 1.264 ± 0.281
2.106TyrAsn: 2.106 ± 0.458
1.79TyrPro: 1.79 ± 0.428
2.527TyrGln: 2.527 ± 0.502
2.527TyrArg: 2.527 ± 0.594
3.264TyrSer: 3.264 ± 0.664
2.948TyrThr: 2.948 ± 0.556
3.159TyrVal: 3.159 ± 0.616
0.105TyrTrp: 0.105 ± 0.118
2.001TyrTyr: 2.001 ± 0.462
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (9498 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski