Amino acid dipepetide frequency for Lactobacillus phage JNU_P4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.105AlaAla: 6.105 ± 0.736
0.076AlaCys: 0.076 ± 0.073
5.265AlaAsp: 5.265 ± 1.066
5.265AlaGlu: 5.265 ± 0.653
2.137AlaPhe: 2.137 ± 0.439
4.426AlaGly: 4.426 ± 0.559
1.145AlaHis: 1.145 ± 0.296
5.418AlaIle: 5.418 ± 0.578
6.868AlaLys: 6.868 ± 1.051
7.325AlaLeu: 7.325 ± 0.791
2.747AlaMet: 2.747 ± 0.567
4.121AlaAsn: 4.121 ± 0.588
1.755AlaPro: 1.755 ± 0.345
3.434AlaGln: 3.434 ± 0.52
2.976AlaArg: 2.976 ± 0.574
4.044AlaSer: 4.044 ± 0.613
5.57AlaThr: 5.57 ± 0.666
5.647AlaVal: 5.647 ± 0.631
0.916AlaTrp: 0.916 ± 0.275
3.052AlaTyr: 3.052 ± 0.485
0.0AlaXaa: 0.0 ± 0.0
Cys
0.229CysAla: 0.229 ± 0.18
0.0CysCys: 0.0 ± 0.0
0.229CysAsp: 0.229 ± 0.136
0.229CysGlu: 0.229 ± 0.133
0.305CysPhe: 0.305 ± 0.151
0.229CysGly: 0.229 ± 0.125
0.076CysHis: 0.076 ± 0.069
0.153CysIle: 0.153 ± 0.099
0.0CysLys: 0.0 ± 0.0
0.687CysLeu: 0.687 ± 0.225
0.153CysMet: 0.153 ± 0.091
0.305CysAsn: 0.305 ± 0.13
0.229CysPro: 0.229 ± 0.141
0.229CysGln: 0.229 ± 0.132
0.229CysArg: 0.229 ± 0.131
0.305CysSer: 0.305 ± 0.152
0.076CysThr: 0.076 ± 0.078
0.229CysVal: 0.229 ± 0.124
0.076CysTrp: 0.076 ± 0.09
0.229CysTyr: 0.229 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
4.502AspAla: 4.502 ± 0.638
0.305AspCys: 0.305 ± 0.193
6.791AspAsp: 6.791 ± 1.108
4.807AspGlu: 4.807 ± 0.736
2.518AspPhe: 2.518 ± 0.401
6.562AspGly: 6.562 ± 0.766
1.374AspHis: 1.374 ± 0.366
4.426AspIle: 4.426 ± 0.577
4.426AspLys: 4.426 ± 0.603
4.884AspLeu: 4.884 ± 0.679
2.518AspMet: 2.518 ± 0.431
3.129AspAsn: 3.129 ± 0.48
2.366AspPro: 2.366 ± 0.387
2.289AspGln: 2.289 ± 0.405
2.671AspArg: 2.671 ± 0.494
4.96AspSer: 4.96 ± 0.603
4.655AspThr: 4.655 ± 0.591
4.121AspVal: 4.121 ± 0.726
1.221AspTrp: 1.221 ± 0.317
2.747AspTyr: 2.747 ± 0.441
0.0AspXaa: 0.0 ± 0.0
Glu
4.96GluAla: 4.96 ± 0.48
0.229GluCys: 0.229 ± 0.138
3.129GluAsp: 3.129 ± 0.466
3.052GluGlu: 3.052 ± 0.565
2.366GluPhe: 2.366 ± 0.494
2.213GluGly: 2.213 ± 0.352
1.297GluHis: 1.297 ± 0.365
2.671GluIle: 2.671 ± 0.441
5.036GluLys: 5.036 ± 0.739
5.876GluLeu: 5.876 ± 0.747
1.602GluMet: 1.602 ± 0.402
3.739GluAsn: 3.739 ± 0.635
2.137GluPro: 2.137 ± 0.479
3.357GluGln: 3.357 ± 0.484
2.594GluArg: 2.594 ± 0.586
3.586GluSer: 3.586 ± 0.538
2.9GluThr: 2.9 ± 0.396
3.51GluVal: 3.51 ± 0.523
1.068GluTrp: 1.068 ± 0.273
2.289GluTyr: 2.289 ± 0.362
0.0GluXaa: 0.0 ± 0.0
Phe
2.518PheAla: 2.518 ± 0.393
0.076PheCys: 0.076 ± 0.073
3.586PheAsp: 3.586 ± 0.475
2.671PheGlu: 2.671 ± 0.464
1.297PhePhe: 1.297 ± 0.308
2.213PheGly: 2.213 ± 0.343
0.687PheHis: 0.687 ± 0.259
1.526PheIle: 1.526 ± 0.333
4.273PheLys: 4.273 ± 0.451
2.671PheLeu: 2.671 ± 0.428
0.763PheMet: 0.763 ± 0.209
1.908PheAsn: 1.908 ± 0.404
1.068PhePro: 1.068 ± 0.325
1.221PheGln: 1.221 ± 0.253
1.374PheArg: 1.374 ± 0.319
2.823PheSer: 2.823 ± 0.454
2.518PheThr: 2.518 ± 0.436
1.984PheVal: 1.984 ± 0.378
0.382PheTrp: 0.382 ± 0.156
0.992PheTyr: 0.992 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
4.121GlyAla: 4.121 ± 0.557
0.534GlyCys: 0.534 ± 0.201
3.968GlyAsp: 3.968 ± 0.661
3.663GlyGlu: 3.663 ± 0.502
2.137GlyPhe: 2.137 ± 0.342
4.121GlyGly: 4.121 ± 0.571
2.06GlyHis: 2.06 ± 0.371
3.815GlyIle: 3.815 ± 0.648
5.113GlyLys: 5.113 ± 0.716
5.647GlyLeu: 5.647 ± 0.832
1.068GlyMet: 1.068 ± 0.324
2.9GlyAsn: 2.9 ± 0.547
0.763GlyPro: 0.763 ± 0.24
1.45GlyGln: 1.45 ± 0.311
2.289GlyArg: 2.289 ± 0.437
4.197GlySer: 4.197 ± 0.608
5.952GlyThr: 5.952 ± 0.945
4.807GlyVal: 4.807 ± 0.567
1.145GlyTrp: 1.145 ± 0.309
2.9GlyTyr: 2.9 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
1.221HisAla: 1.221 ± 0.3
0.153HisCys: 0.153 ± 0.132
1.679HisAsp: 1.679 ± 0.397
1.526HisGlu: 1.526 ± 0.316
0.916HisPhe: 0.916 ± 0.278
1.526HisGly: 1.526 ± 0.345
0.305HisHis: 0.305 ± 0.157
1.374HisIle: 1.374 ± 0.307
0.687HisLys: 0.687 ± 0.247
0.916HisLeu: 0.916 ± 0.226
0.61HisMet: 0.61 ± 0.242
0.839HisAsn: 0.839 ± 0.23
0.534HisPro: 0.534 ± 0.19
0.992HisGln: 0.992 ± 0.269
1.374HisArg: 1.374 ± 0.329
1.679HisSer: 1.679 ± 0.314
1.984HisThr: 1.984 ± 0.412
1.984HisVal: 1.984 ± 0.403
0.382HisTrp: 0.382 ± 0.213
1.068HisTyr: 1.068 ± 0.278
0.0HisXaa: 0.0 ± 0.0
Ile
4.426IleAla: 4.426 ± 0.557
0.534IleCys: 0.534 ± 0.192
4.502IleAsp: 4.502 ± 0.692
3.739IleGlu: 3.739 ± 0.673
1.984IlePhe: 1.984 ± 0.401
2.823IleGly: 2.823 ± 0.56
1.145IleHis: 1.145 ± 0.282
2.518IleIle: 2.518 ± 0.428
4.578IleLys: 4.578 ± 0.55
3.434IleLeu: 3.434 ± 0.522
1.221IleMet: 1.221 ± 0.249
3.205IleAsn: 3.205 ± 0.581
2.594IlePro: 2.594 ± 0.407
1.908IleGln: 1.908 ± 0.34
2.9IleArg: 2.9 ± 0.437
5.036IleSer: 5.036 ± 0.621
4.044IleThr: 4.044 ± 0.557
4.807IleVal: 4.807 ± 0.58
0.534IleTrp: 0.534 ± 0.226
3.052IleTyr: 3.052 ± 0.436
0.0IleXaa: 0.0 ± 0.0
Lys
6.105LysAla: 6.105 ± 0.633
0.153LysCys: 0.153 ± 0.111
3.892LysAsp: 3.892 ± 0.587
4.121LysGlu: 4.121 ± 0.634
2.442LysPhe: 2.442 ± 0.431
3.892LysGly: 3.892 ± 0.563
1.45LysHis: 1.45 ± 0.293
4.884LysIle: 4.884 ± 0.642
7.478LysLys: 7.478 ± 1.584
6.333LysLeu: 6.333 ± 0.7
2.747LysMet: 2.747 ± 0.481
4.426LysAsn: 4.426 ± 0.609
2.594LysPro: 2.594 ± 0.512
3.357LysGln: 3.357 ± 0.655
4.349LysArg: 4.349 ± 0.654
5.418LysSer: 5.418 ± 0.791
4.426LysThr: 4.426 ± 0.794
5.418LysVal: 5.418 ± 0.592
1.068LysTrp: 1.068 ± 0.285
2.594LysTyr: 2.594 ± 0.706
0.0LysXaa: 0.0 ± 0.0
Leu
6.562LeuAla: 6.562 ± 0.644
0.153LeuCys: 0.153 ± 0.099
6.181LeuAsp: 6.181 ± 0.63
4.197LeuGlu: 4.197 ± 0.559
3.052LeuPhe: 3.052 ± 0.465
4.044LeuGly: 4.044 ± 0.517
0.839LeuHis: 0.839 ± 0.244
4.349LeuIle: 4.349 ± 0.525
5.036LeuLys: 5.036 ± 0.683
6.486LeuLeu: 6.486 ± 0.731
2.06LeuMet: 2.06 ± 0.382
4.731LeuAsn: 4.731 ± 0.632
2.976LeuPro: 2.976 ± 0.51
3.663LeuGln: 3.663 ± 0.574
3.663LeuArg: 3.663 ± 0.649
5.036LeuSer: 5.036 ± 0.467
5.799LeuThr: 5.799 ± 0.741
4.655LeuVal: 4.655 ± 0.601
0.534LeuTrp: 0.534 ± 0.219
2.518LeuTyr: 2.518 ± 0.456
0.0LeuXaa: 0.0 ± 0.0
Met
3.586MetAla: 3.586 ± 0.504
0.153MetCys: 0.153 ± 0.095
1.526MetAsp: 1.526 ± 0.349
1.908MetGlu: 1.908 ± 0.3
0.458MetPhe: 0.458 ± 0.172
1.297MetGly: 1.297 ± 0.344
0.61MetHis: 0.61 ± 0.216
2.137MetIle: 2.137 ± 0.421
2.06MetLys: 2.06 ± 0.346
1.374MetLeu: 1.374 ± 0.32
0.763MetMet: 0.763 ± 0.202
1.221MetAsn: 1.221 ± 0.329
1.068MetPro: 1.068 ± 0.255
0.916MetGln: 0.916 ± 0.252
1.602MetArg: 1.602 ± 0.402
1.908MetSer: 1.908 ± 0.399
2.594MetThr: 2.594 ± 0.437
1.755MetVal: 1.755 ± 0.332
0.382MetTrp: 0.382 ± 0.165
1.068MetTyr: 1.068 ± 0.287
0.0MetXaa: 0.0 ± 0.0
Asn
4.273AsnAla: 4.273 ± 0.568
0.153AsnCys: 0.153 ± 0.098
4.349AsnAsp: 4.349 ± 0.731
2.366AsnGlu: 2.366 ± 0.392
2.594AsnPhe: 2.594 ± 0.46
5.113AsnGly: 5.113 ± 0.62
1.297AsnHis: 1.297 ± 0.315
3.052AsnIle: 3.052 ± 0.428
3.586AsnLys: 3.586 ± 0.599
3.663AsnLeu: 3.663 ± 0.553
1.526AsnMet: 1.526 ± 0.34
1.984AsnAsn: 1.984 ± 0.348
2.137AsnPro: 2.137 ± 0.372
2.518AsnGln: 2.518 ± 0.44
2.06AsnArg: 2.06 ± 0.408
3.51AsnSer: 3.51 ± 0.524
2.976AsnThr: 2.976 ± 0.483
2.9AsnVal: 2.9 ± 0.5
0.61AsnTrp: 0.61 ± 0.199
1.908AsnTyr: 1.908 ± 0.45
0.0AsnXaa: 0.0 ± 0.0
Pro
2.823ProAla: 2.823 ± 0.556
0.0ProCys: 0.0 ± 0.0
3.205ProAsp: 3.205 ± 0.571
2.366ProGlu: 2.366 ± 0.482
1.221ProPhe: 1.221 ± 0.282
1.45ProGly: 1.45 ± 0.326
1.602ProHis: 1.602 ± 0.349
1.679ProIle: 1.679 ± 0.317
2.976ProLys: 2.976 ± 0.528
2.594ProLeu: 2.594 ± 0.425
0.61ProMet: 0.61 ± 0.178
1.45ProAsn: 1.45 ± 0.249
1.297ProPro: 1.297 ± 0.421
1.145ProGln: 1.145 ± 0.31
1.145ProArg: 1.145 ± 0.298
2.518ProSer: 2.518 ± 0.461
3.129ProThr: 3.129 ± 0.535
1.984ProVal: 1.984 ± 0.324
0.458ProTrp: 0.458 ± 0.21
1.602ProTyr: 1.602 ± 0.304
0.0ProXaa: 0.0 ± 0.0
Gln
3.663GlnAla: 3.663 ± 0.614
0.229GlnCys: 0.229 ± 0.148
2.06GlnAsp: 2.06 ± 0.418
2.213GlnGlu: 2.213 ± 0.414
1.755GlnPhe: 1.755 ± 0.383
2.289GlnGly: 2.289 ± 0.441
0.763GlnHis: 0.763 ± 0.243
3.281GlnIle: 3.281 ± 0.396
2.213GlnLys: 2.213 ± 0.495
3.205GlnLeu: 3.205 ± 0.514
1.145GlnMet: 1.145 ± 0.328
2.06GlnAsn: 2.06 ± 0.377
1.984GlnPro: 1.984 ± 0.325
2.442GlnGln: 2.442 ± 0.506
1.908GlnArg: 1.908 ± 0.367
2.518GlnSer: 2.518 ± 0.386
1.679GlnThr: 1.679 ± 0.32
3.281GlnVal: 3.281 ± 0.496
0.305GlnTrp: 0.305 ± 0.141
1.221GlnTyr: 1.221 ± 0.376
0.0GlnXaa: 0.0 ± 0.0
Arg
3.357ArgAla: 3.357 ± 0.611
0.305ArgCys: 0.305 ± 0.138
2.976ArgAsp: 2.976 ± 0.502
2.823ArgGlu: 2.823 ± 0.503
1.526ArgPhe: 1.526 ± 0.361
2.289ArgGly: 2.289 ± 0.465
0.992ArgHis: 0.992 ± 0.291
2.747ArgIle: 2.747 ± 0.522
3.586ArgLys: 3.586 ± 0.613
4.044ArgLeu: 4.044 ± 0.565
1.221ArgMet: 1.221 ± 0.354
2.442ArgAsn: 2.442 ± 0.341
1.374ArgPro: 1.374 ± 0.341
1.602ArgGln: 1.602 ± 0.271
1.984ArgArg: 1.984 ± 0.393
3.281ArgSer: 3.281 ± 0.456
2.594ArgThr: 2.594 ± 0.377
2.747ArgVal: 2.747 ± 0.458
0.382ArgTrp: 0.382 ± 0.185
1.679ArgTyr: 1.679 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
5.494SerAla: 5.494 ± 0.526
0.229SerCys: 0.229 ± 0.139
5.036SerAsp: 5.036 ± 0.682
3.281SerGlu: 3.281 ± 0.554
2.137SerPhe: 2.137 ± 0.36
5.494SerGly: 5.494 ± 0.635
1.221SerHis: 1.221 ± 0.299
3.586SerIle: 3.586 ± 0.448
6.41SerLys: 6.41 ± 0.885
4.044SerLeu: 4.044 ± 0.559
2.976SerMet: 2.976 ± 0.417
4.349SerAsn: 4.349 ± 0.627
2.137SerPro: 2.137 ± 0.371
1.831SerGln: 1.831 ± 0.385
2.823SerArg: 2.823 ± 0.47
4.502SerSer: 4.502 ± 0.56
3.663SerThr: 3.663 ± 0.59
4.807SerVal: 4.807 ± 0.573
1.145SerTrp: 1.145 ± 0.233
2.823SerTyr: 2.823 ± 0.457
0.0SerXaa: 0.0 ± 0.0
Thr
5.341ThrAla: 5.341 ± 0.799
0.076ThrCys: 0.076 ± 0.075
5.265ThrAsp: 5.265 ± 0.77
3.586ThrGlu: 3.586 ± 0.505
3.205ThrPhe: 3.205 ± 0.407
4.273ThrGly: 4.273 ± 0.444
1.602ThrHis: 1.602 ± 0.392
4.502ThrIle: 4.502 ± 0.496
4.807ThrLys: 4.807 ± 1.045
4.349ThrLeu: 4.349 ± 0.49
1.45ThrMet: 1.45 ± 0.331
3.281ThrAsn: 3.281 ± 0.449
3.968ThrPro: 3.968 ± 0.601
2.289ThrGln: 2.289 ± 0.375
3.129ThrArg: 3.129 ± 0.546
3.968ThrSer: 3.968 ± 0.493
5.265ThrThr: 5.265 ± 0.718
4.426ThrVal: 4.426 ± 0.627
0.534ThrTrp: 0.534 ± 0.167
2.442ThrTyr: 2.442 ± 0.4
0.0ThrXaa: 0.0 ± 0.0
Val
5.189ValAla: 5.189 ± 0.634
0.382ValCys: 0.382 ± 0.168
4.273ValAsp: 4.273 ± 0.532
3.357ValGlu: 3.357 ± 0.563
2.137ValPhe: 2.137 ± 0.323
4.578ValGly: 4.578 ± 0.634
1.908ValHis: 1.908 ± 0.395
4.121ValIle: 4.121 ± 0.465
5.113ValLys: 5.113 ± 0.695
3.892ValLeu: 3.892 ± 0.509
1.831ValMet: 1.831 ± 0.358
3.51ValAsn: 3.51 ± 0.607
2.594ValPro: 2.594 ± 0.371
2.518ValGln: 2.518 ± 0.383
2.976ValArg: 2.976 ± 0.428
5.647ValSer: 5.647 ± 0.663
4.807ValThr: 4.807 ± 0.618
4.349ValVal: 4.349 ± 0.584
1.068ValTrp: 1.068 ± 0.266
2.137ValTyr: 2.137 ± 0.328
0.0ValXaa: 0.0 ± 0.0
Trp
1.145TrpAla: 1.145 ± 0.376
0.305TrpCys: 0.305 ± 0.125
0.534TrpAsp: 0.534 ± 0.185
0.687TrpGlu: 0.687 ± 0.234
0.61TrpPhe: 0.61 ± 0.227
0.61TrpGly: 0.61 ± 0.202
0.458TrpHis: 0.458 ± 0.175
0.839TrpIle: 0.839 ± 0.263
0.763TrpLys: 0.763 ± 0.221
1.45TrpLeu: 1.45 ± 0.323
0.153TrpMet: 0.153 ± 0.101
0.839TrpAsn: 0.839 ± 0.257
0.229TrpPro: 0.229 ± 0.115
0.916TrpGln: 0.916 ± 0.264
0.153TrpArg: 0.153 ± 0.104
0.763TrpSer: 0.763 ± 0.205
0.916TrpThr: 0.916 ± 0.28
0.687TrpVal: 0.687 ± 0.235
0.305TrpTrp: 0.305 ± 0.126
0.839TrpTyr: 0.839 ± 0.264
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.671TyrAla: 2.671 ± 0.462
0.153TyrCys: 0.153 ± 0.099
2.9TyrAsp: 2.9 ± 0.434
1.755TyrGlu: 1.755 ± 0.365
1.755TyrPhe: 1.755 ± 0.335
3.205TyrGly: 3.205 ± 0.566
0.916TyrHis: 0.916 ± 0.261
1.908TyrIle: 1.908 ± 0.307
2.137TyrLys: 2.137 ± 0.436
3.663TyrLeu: 3.663 ± 0.575
1.145TyrMet: 1.145 ± 0.258
2.137TyrAsn: 2.137 ± 0.473
1.45TyrPro: 1.45 ± 0.4
2.137TyrGln: 2.137 ± 0.45
1.755TyrArg: 1.755 ± 0.49
2.289TyrSer: 2.289 ± 0.429
2.213TyrThr: 2.213 ± 0.415
2.289TyrVal: 2.289 ± 0.385
0.687TyrTrp: 0.687 ± 0.253
2.137TyrTyr: 2.137 ± 0.466
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 72 proteins (13106 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski