Amino acid dipepetide frequency for Lactobacillus prophage Lj965

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.029AlaAla: 4.029 ± 0.831
0.252AlaCys: 0.252 ± 0.159
3.189AlaAsp: 3.189 ± 0.373
5.12AlaGlu: 5.12 ± 0.673
3.105AlaPhe: 3.105 ± 0.543
4.364AlaGly: 4.364 ± 0.833
0.671AlaHis: 0.671 ± 0.274
5.707AlaIle: 5.707 ± 0.731
8.645AlaLys: 8.645 ± 1.079
6.043AlaLeu: 6.043 ± 0.827
1.762AlaMet: 1.762 ± 0.376
4.7AlaAsn: 4.7 ± 0.539
1.091AlaPro: 1.091 ± 0.368
3.693AlaGln: 3.693 ± 0.485
2.266AlaArg: 2.266 ± 0.424
4.029AlaSer: 4.029 ± 0.639
4.868AlaThr: 4.868 ± 0.607
5.12AlaVal: 5.12 ± 0.626
1.259AlaTrp: 1.259 ± 0.53
2.854AlaTyr: 2.854 ± 0.444
0.0AlaXaa: 0.0 ± 0.0
Cys
0.587CysAla: 0.587 ± 0.231
0.0CysCys: 0.0 ± 0.0
0.671CysAsp: 0.671 ± 0.355
0.168CysGlu: 0.168 ± 0.101
0.0CysPhe: 0.0 ± 0.0
0.252CysGly: 0.252 ± 0.131
0.084CysHis: 0.084 ± 0.083
0.252CysIle: 0.252 ± 0.129
0.42CysLys: 0.42 ± 0.237
0.504CysLeu: 0.504 ± 0.242
0.252CysMet: 0.252 ± 0.172
0.42CysAsn: 0.42 ± 0.209
0.336CysPro: 0.336 ± 0.17
0.084CysGln: 0.084 ± 0.07
0.336CysArg: 0.336 ± 0.208
0.168CysSer: 0.168 ± 0.117
0.336CysThr: 0.336 ± 0.193
0.084CysVal: 0.084 ± 0.081
0.084CysTrp: 0.084 ± 0.102
0.252CysTyr: 0.252 ± 0.178
0.0CysXaa: 0.0 ± 0.0
Asp
4.532AspAla: 4.532 ± 0.514
0.252AspCys: 0.252 ± 0.132
4.029AspAsp: 4.029 ± 0.668
4.364AspGlu: 4.364 ± 0.756
3.273AspPhe: 3.273 ± 0.688
4.196AspGly: 4.196 ± 0.659
1.007AspHis: 1.007 ± 0.337
4.952AspIle: 4.952 ± 0.665
5.959AspLys: 5.959 ± 0.705
6.798AspLeu: 6.798 ± 0.588
1.511AspMet: 1.511 ± 0.381
3.945AspAsn: 3.945 ± 0.394
2.35AspPro: 2.35 ± 0.586
2.098AspGln: 2.098 ± 0.398
2.518AspArg: 2.518 ± 0.564
3.441AspSer: 3.441 ± 0.44
3.945AspThr: 3.945 ± 0.47
3.273AspVal: 3.273 ± 0.424
1.175AspTrp: 1.175 ± 0.39
2.35AspTyr: 2.35 ± 0.607
0.0AspXaa: 0.0 ± 0.0
Glu
5.623GluAla: 5.623 ± 0.704
0.168GluCys: 0.168 ± 0.127
4.364GluAsp: 4.364 ± 0.787
3.693GluGlu: 3.693 ± 0.589
2.014GluPhe: 2.014 ± 0.451
2.937GluGly: 2.937 ± 0.565
0.839GluHis: 0.839 ± 0.164
4.029GluIle: 4.029 ± 0.683
5.539GluLys: 5.539 ± 0.886
7.47GluLeu: 7.47 ± 1.033
2.014GluMet: 2.014 ± 0.352
4.196GluAsn: 4.196 ± 0.465
1.595GluPro: 1.595 ± 0.35
2.937GluGln: 2.937 ± 0.44
3.189GluArg: 3.189 ± 0.648
2.518GluSer: 2.518 ± 0.46
2.686GluThr: 2.686 ± 0.496
3.693GluVal: 3.693 ± 0.687
0.671GluTrp: 0.671 ± 0.187
1.679GluTyr: 1.679 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
3.021PheAla: 3.021 ± 0.486
0.336PheCys: 0.336 ± 0.169
2.686PheAsp: 2.686 ± 0.618
2.266PheGlu: 2.266 ± 0.481
1.511PhePhe: 1.511 ± 0.333
3.609PheGly: 3.609 ± 0.544
0.252PheHis: 0.252 ± 0.106
2.182PheIle: 2.182 ± 0.381
3.525PheLys: 3.525 ± 0.765
2.518PheLeu: 2.518 ± 0.522
1.175PheMet: 1.175 ± 0.293
3.273PheAsn: 3.273 ± 0.371
1.511PhePro: 1.511 ± 0.368
0.839PheGln: 0.839 ± 0.238
1.343PheArg: 1.343 ± 0.348
3.273PheSer: 3.273 ± 0.602
2.77PheThr: 2.77 ± 0.493
2.266PheVal: 2.266 ± 0.458
0.336PheTrp: 0.336 ± 0.176
2.182PheTyr: 2.182 ± 0.418
0.0PheXaa: 0.0 ± 0.0
Gly
3.945GlyAla: 3.945 ± 0.867
0.336GlyCys: 0.336 ± 0.183
4.112GlyAsp: 4.112 ± 0.758
3.525GlyGlu: 3.525 ± 0.422
2.686GlyPhe: 2.686 ± 0.467
4.029GlyGly: 4.029 ± 1.009
1.091GlyHis: 1.091 ± 0.348
3.945GlyIle: 3.945 ± 0.676
4.616GlyLys: 4.616 ± 0.555
5.539GlyLeu: 5.539 ± 0.779
2.098GlyMet: 2.098 ± 0.421
4.616GlyAsn: 4.616 ± 0.968
0.504GlyPro: 0.504 ± 0.194
2.266GlyGln: 2.266 ± 0.314
1.846GlyArg: 1.846 ± 0.412
4.784GlySer: 4.784 ± 1.164
4.448GlyThr: 4.448 ± 0.933
3.273GlyVal: 3.273 ± 0.406
0.923GlyTrp: 0.923 ± 0.305
3.021GlyTyr: 3.021 ± 0.53
0.0GlyXaa: 0.0 ± 0.0
His
0.587HisAla: 0.587 ± 0.197
0.0HisCys: 0.0 ± 0.0
1.259HisAsp: 1.259 ± 0.308
0.755HisGlu: 0.755 ± 0.349
0.42HisPhe: 0.42 ± 0.226
1.846HisGly: 1.846 ± 0.39
0.336HisHis: 0.336 ± 0.177
1.343HisIle: 1.343 ± 0.486
1.091HisLys: 1.091 ± 0.277
1.007HisLeu: 1.007 ± 0.352
0.252HisMet: 0.252 ± 0.152
0.923HisAsn: 0.923 ± 0.277
0.42HisPro: 0.42 ± 0.2
0.671HisGln: 0.671 ± 0.247
0.587HisArg: 0.587 ± 0.227
0.755HisSer: 0.755 ± 0.271
0.671HisThr: 0.671 ± 0.297
0.755HisVal: 0.755 ± 0.221
0.336HisTrp: 0.336 ± 0.197
0.336HisTyr: 0.336 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
4.112IleAla: 4.112 ± 0.685
0.587IleCys: 0.587 ± 0.201
4.7IleAsp: 4.7 ± 0.516
3.945IleGlu: 3.945 ± 0.844
2.854IlePhe: 2.854 ± 0.433
3.189IleGly: 3.189 ± 0.832
0.923IleHis: 0.923 ± 0.294
3.609IleIle: 3.609 ± 0.745
7.134IleLys: 7.134 ± 0.78
3.021IleLeu: 3.021 ± 0.607
1.007IleMet: 1.007 ± 0.288
5.371IleAsn: 5.371 ± 0.752
1.93IlePro: 1.93 ± 0.383
2.77IleGln: 2.77 ± 0.586
3.357IleArg: 3.357 ± 0.499
5.287IleSer: 5.287 ± 0.642
4.448IleThr: 4.448 ± 0.652
4.112IleVal: 4.112 ± 0.523
1.007IleTrp: 1.007 ± 0.331
2.854IleTyr: 2.854 ± 0.585
0.0IleXaa: 0.0 ± 0.0
Lys
8.225LysAla: 8.225 ± 0.988
0.336LysCys: 0.336 ± 0.18
7.218LysAsp: 7.218 ± 1.015
5.12LysGlu: 5.12 ± 0.851
3.441LysPhe: 3.441 ± 0.521
4.532LysGly: 4.532 ± 0.737
1.595LysHis: 1.595 ± 0.37
6.043LysIle: 6.043 ± 0.737
7.721LysLys: 7.721 ± 1.008
7.134LysLeu: 7.134 ± 0.754
2.266LysMet: 2.266 ± 0.35
4.784LysAsn: 4.784 ± 0.531
2.266LysPro: 2.266 ± 0.493
4.952LysGln: 4.952 ± 0.874
4.112LysArg: 4.112 ± 0.84
4.7LysSer: 4.7 ± 0.888
4.364LysThr: 4.364 ± 0.746
5.623LysVal: 5.623 ± 0.507
0.839LysTrp: 0.839 ± 0.238
3.441LysTyr: 3.441 ± 0.548
0.0LysXaa: 0.0 ± 0.0
Leu
6.714LeuAla: 6.714 ± 1.038
0.336LeuCys: 0.336 ± 0.155
5.707LeuAsp: 5.707 ± 0.675
5.036LeuGlu: 5.036 ± 0.545
2.77LeuPhe: 2.77 ± 0.717
6.714LeuGly: 6.714 ± 1.133
1.259LeuHis: 1.259 ± 0.335
3.945LeuIle: 3.945 ± 0.556
7.47LeuLys: 7.47 ± 1.026
6.295LeuLeu: 6.295 ± 1.0
1.511LeuMet: 1.511 ± 0.437
5.371LeuAsn: 5.371 ± 0.782
3.609LeuPro: 3.609 ± 0.552
3.777LeuGln: 3.777 ± 0.557
3.693LeuArg: 3.693 ± 0.66
5.623LeuSer: 5.623 ± 0.516
5.204LeuThr: 5.204 ± 0.747
3.105LeuVal: 3.105 ± 0.473
1.175LeuTrp: 1.175 ± 0.326
1.93LeuTyr: 1.93 ± 0.524
0.0LeuXaa: 0.0 ± 0.0
Met
1.259MetAla: 1.259 ± 0.325
0.168MetCys: 0.168 ± 0.145
1.427MetAsp: 1.427 ± 0.361
1.595MetGlu: 1.595 ± 0.351
0.587MetPhe: 0.587 ± 0.219
0.755MetGly: 0.755 ± 0.206
0.336MetHis: 0.336 ± 0.177
0.923MetIle: 0.923 ± 0.37
2.182MetLys: 2.182 ± 0.496
2.014MetLeu: 2.014 ± 0.458
0.504MetMet: 0.504 ± 0.201
1.93MetAsn: 1.93 ± 0.448
0.923MetPro: 0.923 ± 0.245
1.259MetGln: 1.259 ± 0.316
0.923MetArg: 0.923 ± 0.202
1.93MetSer: 1.93 ± 0.474
1.762MetThr: 1.762 ± 0.357
1.259MetVal: 1.259 ± 0.317
0.168MetTrp: 0.168 ± 0.111
0.755MetTyr: 0.755 ± 0.3
0.0MetXaa: 0.0 ± 0.0
Asn
5.287AsnAla: 5.287 ± 0.829
0.504AsnCys: 0.504 ± 0.27
4.532AsnAsp: 4.532 ± 0.546
5.12AsnGlu: 5.12 ± 0.549
2.854AsnPhe: 2.854 ± 0.45
3.945AsnGly: 3.945 ± 0.718
1.343AsnHis: 1.343 ± 0.44
4.616AsnIle: 4.616 ± 0.618
4.448AsnLys: 4.448 ± 0.991
4.532AsnLeu: 4.532 ± 0.827
1.846AsnMet: 1.846 ± 0.314
5.204AsnAsn: 5.204 ± 0.724
1.679AsnPro: 1.679 ± 0.48
3.273AsnGln: 3.273 ± 0.431
2.098AsnArg: 2.098 ± 0.37
4.616AsnSer: 4.616 ± 1.166
3.021AsnThr: 3.021 ± 0.528
4.7AsnVal: 4.7 ± 0.545
1.511AsnTrp: 1.511 ± 0.229
2.686AsnTyr: 2.686 ± 0.473
0.0AsnXaa: 0.0 ± 0.0
Pro
2.098ProAla: 2.098 ± 0.406
0.084ProCys: 0.084 ± 0.097
2.182ProAsp: 2.182 ± 0.549
2.434ProGlu: 2.434 ± 0.603
1.007ProPhe: 1.007 ± 0.309
1.007ProGly: 1.007 ± 0.292
0.252ProHis: 0.252 ± 0.122
1.93ProIle: 1.93 ± 0.458
3.105ProLys: 3.105 ± 0.528
2.434ProLeu: 2.434 ± 0.505
0.42ProMet: 0.42 ± 0.188
1.93ProAsn: 1.93 ± 0.477
0.923ProPro: 0.923 ± 0.257
1.511ProGln: 1.511 ± 0.391
1.091ProArg: 1.091 ± 0.369
2.014ProSer: 2.014 ± 0.453
1.595ProThr: 1.595 ± 0.46
1.427ProVal: 1.427 ± 0.402
0.336ProTrp: 0.336 ± 0.198
0.923ProTyr: 0.923 ± 0.272
0.0ProXaa: 0.0 ± 0.0
Gln
4.029GlnAla: 4.029 ± 0.788
0.168GlnCys: 0.168 ± 0.098
2.098GlnAsp: 2.098 ± 0.536
2.602GlnGlu: 2.602 ± 0.348
1.679GlnPhe: 1.679 ± 0.28
2.686GlnGly: 2.686 ± 0.484
0.42GlnHis: 0.42 ± 0.155
3.105GlnIle: 3.105 ± 0.506
4.28GlnLys: 4.28 ± 0.697
2.518GlnLeu: 2.518 ± 0.634
1.007GlnMet: 1.007 ± 0.34
3.021GlnAsn: 3.021 ± 0.53
1.007GlnPro: 1.007 ± 0.372
2.014GlnGln: 2.014 ± 0.459
2.098GlnArg: 2.098 ± 0.347
2.937GlnSer: 2.937 ± 0.633
3.357GlnThr: 3.357 ± 0.762
2.518GlnVal: 2.518 ± 0.408
0.504GlnTrp: 0.504 ± 0.158
1.93GlnTyr: 1.93 ± 0.55
0.0GlnXaa: 0.0 ± 0.0
Arg
2.686ArgAla: 2.686 ± 0.385
0.084ArgCys: 0.084 ± 0.083
2.854ArgAsp: 2.854 ± 0.522
2.602ArgGlu: 2.602 ± 0.529
1.343ArgPhe: 1.343 ± 0.254
1.595ArgGly: 1.595 ± 0.466
0.587ArgHis: 0.587 ± 0.22
3.861ArgIle: 3.861 ± 0.486
3.609ArgLys: 3.609 ± 0.466
4.7ArgLeu: 4.7 ± 0.658
0.504ArgMet: 0.504 ± 0.201
2.686ArgAsn: 2.686 ± 0.4
0.755ArgPro: 0.755 ± 0.219
1.762ArgGln: 1.762 ± 0.384
2.014ArgArg: 2.014 ± 0.427
2.77ArgSer: 2.77 ± 0.645
2.014ArgThr: 2.014 ± 0.468
1.93ArgVal: 1.93 ± 0.555
0.504ArgTrp: 0.504 ± 0.241
1.595ArgTyr: 1.595 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
4.364SerAla: 4.364 ± 0.51
0.168SerCys: 0.168 ± 0.117
4.112SerAsp: 4.112 ± 0.572
4.196SerGlu: 4.196 ± 0.56
3.189SerPhe: 3.189 ± 0.601
5.875SerGly: 5.875 ± 1.088
0.839SerHis: 0.839 ± 0.233
4.28SerIle: 4.28 ± 0.496
4.952SerLys: 4.952 ± 0.944
5.539SerLeu: 5.539 ± 0.654
1.259SerMet: 1.259 ± 0.391
4.112SerAsn: 4.112 ± 0.499
1.511SerPro: 1.511 ± 0.407
2.854SerGln: 2.854 ± 0.535
2.518SerArg: 2.518 ± 0.625
5.875SerSer: 5.875 ± 1.001
3.945SerThr: 3.945 ± 0.759
3.525SerVal: 3.525 ± 0.432
1.427SerTrp: 1.427 ± 0.427
2.77SerTyr: 2.77 ± 0.526
0.0SerXaa: 0.0 ± 0.0
Thr
4.784ThrAla: 4.784 ± 0.583
0.168ThrCys: 0.168 ± 0.115
4.784ThrAsp: 4.784 ± 0.635
3.105ThrGlu: 3.105 ± 0.468
2.518ThrPhe: 2.518 ± 0.534
3.105ThrGly: 3.105 ± 0.459
0.587ThrHis: 0.587 ± 0.228
4.532ThrIle: 4.532 ± 0.565
4.364ThrLys: 4.364 ± 0.556
4.952ThrLeu: 4.952 ± 0.519
1.259ThrMet: 1.259 ± 0.403
3.945ThrAsn: 3.945 ± 0.82
2.182ThrPro: 2.182 ± 0.376
2.182ThrGln: 2.182 ± 0.442
2.434ThrArg: 2.434 ± 0.439
4.532ThrSer: 4.532 ± 0.748
3.021ThrThr: 3.021 ± 0.648
3.777ThrVal: 3.777 ± 0.785
0.839ThrTrp: 0.839 ± 0.362
2.686ThrTyr: 2.686 ± 0.513
0.0ThrXaa: 0.0 ± 0.0
Val
2.854ValAla: 2.854 ± 0.468
0.671ValCys: 0.671 ± 0.232
3.609ValAsp: 3.609 ± 0.605
2.854ValGlu: 2.854 ± 0.51
2.434ValPhe: 2.434 ± 0.533
3.273ValGly: 3.273 ± 0.444
1.091ValHis: 1.091 ± 0.257
4.112ValIle: 4.112 ± 0.351
5.455ValLys: 5.455 ± 0.869
4.029ValLeu: 4.029 ± 0.56
1.175ValMet: 1.175 ± 0.355
4.532ValAsn: 4.532 ± 0.647
2.35ValPro: 2.35 ± 0.485
2.686ValGln: 2.686 ± 0.417
1.93ValArg: 1.93 ± 0.44
4.196ValSer: 4.196 ± 0.676
4.029ValThr: 4.029 ± 0.507
2.77ValVal: 2.77 ± 0.716
1.007ValTrp: 1.007 ± 0.27
2.098ValTyr: 2.098 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
1.091TrpAla: 1.091 ± 0.476
0.168TrpCys: 0.168 ± 0.119
0.671TrpAsp: 0.671 ± 0.206
0.671TrpGlu: 0.671 ± 0.281
1.175TrpPhe: 1.175 ± 0.259
1.007TrpGly: 1.007 ± 0.339
0.336TrpHis: 0.336 ± 0.178
1.259TrpIle: 1.259 ± 0.308
1.091TrpLys: 1.091 ± 0.291
1.007TrpLeu: 1.007 ± 0.239
0.084TrpMet: 0.084 ± 0.075
1.091TrpAsn: 1.091 ± 0.294
0.168TrpPro: 0.168 ± 0.101
0.923TrpGln: 0.923 ± 0.25
0.336TrpArg: 0.336 ± 0.128
1.175TrpSer: 1.175 ± 0.351
0.839TrpThr: 0.839 ± 0.269
1.091TrpVal: 1.091 ± 0.346
0.336TrpTrp: 0.336 ± 0.166
0.504TrpTyr: 0.504 ± 0.224
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.937TyrAla: 2.937 ± 0.503
0.504TyrCys: 0.504 ± 0.28
1.762TyrAsp: 1.762 ± 0.332
2.686TyrGlu: 2.686 ± 0.601
2.014TyrPhe: 2.014 ± 0.421
2.686TyrGly: 2.686 ± 0.454
0.504TyrHis: 0.504 ± 0.254
1.679TyrIle: 1.679 ± 0.363
3.105TyrLys: 3.105 ± 0.654
3.189TyrLeu: 3.189 ± 0.854
0.671TyrMet: 0.671 ± 0.229
1.762TyrAsn: 1.762 ± 0.4
1.595TyrPro: 1.595 ± 0.383
1.427TyrGln: 1.427 ± 0.346
1.762TyrArg: 1.762 ± 0.358
2.686TyrSer: 2.686 ± 0.579
2.434TyrThr: 2.434 ± 0.432
2.854TyrVal: 2.854 ± 0.503
0.587TyrTrp: 0.587 ± 0.27
1.427TyrTyr: 1.427 ± 0.528
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 46 proteins (11916 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski