Amino acid dipepetide frequency for Lactococcus phage CHPC965

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.925AlaAla: 0.925 ± 0.535
0.132AlaCys: 0.132 ± 0.152
3.437AlaAsp: 3.437 ± 0.592
4.363AlaGlu: 4.363 ± 0.795
2.909AlaPhe: 2.909 ± 0.666
4.495AlaGly: 4.495 ± 1.152
0.793AlaHis: 0.793 ± 0.337
3.966AlaIle: 3.966 ± 0.945
6.61AlaLys: 6.61 ± 1.163
5.949AlaLeu: 5.949 ± 1.156
1.983AlaMet: 1.983 ± 0.535
5.156AlaAsn: 5.156 ± 0.904
0.397AlaPro: 0.397 ± 0.276
2.512AlaGln: 2.512 ± 0.634
1.983AlaArg: 1.983 ± 0.546
3.173AlaSer: 3.173 ± 0.637
4.495AlaThr: 4.495 ± 1.095
5.288AlaVal: 5.288 ± 1.268
2.247AlaTrp: 2.247 ± 0.73
2.247AlaTyr: 2.247 ± 0.507
0.0AlaXaa: 0.0 ± 0.0
Cys
0.397CysAla: 0.397 ± 0.19
0.0CysCys: 0.0 ± 0.0
0.264CysAsp: 0.264 ± 0.222
0.397CysGlu: 0.397 ± 0.213
0.264CysPhe: 0.264 ± 0.193
0.925CysGly: 0.925 ± 0.45
0.264CysHis: 0.264 ± 0.18
0.264CysIle: 0.264 ± 0.196
0.793CysLys: 0.793 ± 0.338
0.397CysLeu: 0.397 ± 0.25
0.132CysMet: 0.132 ± 0.144
0.661CysAsn: 0.661 ± 0.234
0.132CysPro: 0.132 ± 0.112
0.132CysGln: 0.132 ± 0.112
0.397CysArg: 0.397 ± 0.241
0.132CysSer: 0.132 ± 0.133
0.132CysThr: 0.132 ± 0.126
0.397CysVal: 0.397 ± 0.225
0.132CysTrp: 0.132 ± 0.137
0.264CysTyr: 0.264 ± 0.192
0.0CysXaa: 0.0 ± 0.0
Asp
1.851AspAla: 1.851 ± 0.495
0.397AspCys: 0.397 ± 0.219
2.776AspAsp: 2.776 ± 0.653
3.834AspGlu: 3.834 ± 0.837
3.305AspPhe: 3.305 ± 0.636
5.024AspGly: 5.024 ± 0.748
0.925AspHis: 0.925 ± 0.341
3.437AspIle: 3.437 ± 0.608
4.627AspLys: 4.627 ± 0.63
5.949AspLeu: 5.949 ± 1.079
0.925AspMet: 0.925 ± 0.289
3.966AspAsn: 3.966 ± 0.849
1.322AspPro: 1.322 ± 0.4
0.793AspGln: 0.793 ± 0.309
2.115AspArg: 2.115 ± 0.655
2.644AspSer: 2.644 ± 0.552
3.966AspThr: 3.966 ± 0.721
2.909AspVal: 2.909 ± 0.704
0.661AspTrp: 0.661 ± 0.292
3.173AspTyr: 3.173 ± 0.678
0.0AspXaa: 0.0 ± 0.0
Glu
3.437GluAla: 3.437 ± 0.515
0.264GluCys: 0.264 ± 0.201
2.38GluAsp: 2.38 ± 0.578
4.231GluGlu: 4.231 ± 0.876
4.098GluPhe: 4.098 ± 0.696
1.983GluGly: 1.983 ± 0.486
1.19GluHis: 1.19 ± 0.39
5.817GluIle: 5.817 ± 0.865
5.685GluLys: 5.685 ± 1.278
10.576GluLeu: 10.576 ± 1.97
2.776GluMet: 2.776 ± 0.558
5.553GluAsn: 5.553 ± 0.746
1.058GluPro: 1.058 ± 0.358
3.57GluGln: 3.57 ± 0.631
2.644GluArg: 2.644 ± 0.535
3.966GluSer: 3.966 ± 0.639
4.231GluThr: 4.231 ± 0.862
3.834GluVal: 3.834 ± 0.588
0.793GluTrp: 0.793 ± 0.322
3.041GluTyr: 3.041 ± 0.753
0.0GluXaa: 0.0 ± 0.0
Phe
2.776PheAla: 2.776 ± 0.518
0.264PheCys: 0.264 ± 0.201
2.776PheAsp: 2.776 ± 0.645
2.115PheGlu: 2.115 ± 0.608
1.851PhePhe: 1.851 ± 0.694
1.983PheGly: 1.983 ± 0.634
0.529PheHis: 0.529 ± 0.316
3.702PheIle: 3.702 ± 0.75
4.098PheLys: 4.098 ± 0.772
2.38PheLeu: 2.38 ± 0.541
0.661PheMet: 0.661 ± 0.338
2.644PheAsn: 2.644 ± 0.699
0.529PhePro: 0.529 ± 0.248
1.19PheGln: 1.19 ± 0.379
1.586PheArg: 1.586 ± 0.415
3.437PheSer: 3.437 ± 0.778
3.437PheThr: 3.437 ± 0.653
2.512PheVal: 2.512 ± 0.525
0.264PheTrp: 0.264 ± 0.185
1.719PheTyr: 1.719 ± 0.524
0.0PheXaa: 0.0 ± 0.0
Gly
3.437GlyAla: 3.437 ± 1.202
0.132GlyCys: 0.132 ± 0.133
3.305GlyAsp: 3.305 ± 0.575
3.437GlyGlu: 3.437 ± 0.675
2.247GlyPhe: 2.247 ± 0.638
3.57GlyGly: 3.57 ± 0.779
1.058GlyHis: 1.058 ± 0.387
4.363GlyIle: 4.363 ± 1.195
6.081GlyLys: 6.081 ± 0.843
6.081GlyLeu: 6.081 ± 1.306
1.586GlyMet: 1.586 ± 0.467
3.966GlyAsn: 3.966 ± 0.586
0.529GlyPro: 0.529 ± 0.326
1.983GlyGln: 1.983 ± 0.551
1.719GlyArg: 1.719 ± 0.393
4.627GlySer: 4.627 ± 0.936
3.173GlyThr: 3.173 ± 0.578
5.553GlyVal: 5.553 ± 0.731
0.925GlyTrp: 0.925 ± 0.336
3.041GlyTyr: 3.041 ± 0.821
0.0GlyXaa: 0.0 ± 0.0
His
0.793HisAla: 0.793 ± 0.353
0.529HisCys: 0.529 ± 0.354
0.925HisAsp: 0.925 ± 0.314
0.397HisGlu: 0.397 ± 0.234
0.397HisPhe: 0.397 ± 0.254
1.322HisGly: 1.322 ± 0.499
0.0HisHis: 0.0 ± 0.0
1.058HisIle: 1.058 ± 0.39
0.661HisLys: 0.661 ± 0.242
0.925HisLeu: 0.925 ± 0.357
0.0HisMet: 0.0 ± 0.0
1.454HisAsn: 1.454 ± 0.481
0.397HisPro: 0.397 ± 0.212
0.264HisGln: 0.264 ± 0.188
0.661HisArg: 0.661 ± 0.284
0.132HisSer: 0.132 ± 0.112
0.925HisThr: 0.925 ± 0.344
0.661HisVal: 0.661 ± 0.291
0.264HisTrp: 0.264 ± 0.28
0.529HisTyr: 0.529 ± 0.282
0.0HisXaa: 0.0 ± 0.0
Ile
4.759IleAla: 4.759 ± 0.574
0.132IleCys: 0.132 ± 0.131
5.42IleAsp: 5.42 ± 0.605
7.007IleGlu: 7.007 ± 0.976
2.644IlePhe: 2.644 ± 0.66
4.495IleGly: 4.495 ± 1.125
1.058IleHis: 1.058 ± 0.355
4.231IleIle: 4.231 ± 0.864
6.61IleLys: 6.61 ± 0.929
4.495IleLeu: 4.495 ± 0.998
1.322IleMet: 1.322 ± 0.456
4.627IleAsn: 4.627 ± 0.739
1.322IlePro: 1.322 ± 0.491
2.115IleGln: 2.115 ± 0.484
1.719IleArg: 1.719 ± 0.461
4.627IleSer: 4.627 ± 0.835
4.627IleThr: 4.627 ± 0.759
4.363IleVal: 4.363 ± 0.883
1.322IleTrp: 1.322 ± 0.46
2.38IleTyr: 2.38 ± 0.562
0.0IleXaa: 0.0 ± 0.0
Lys
5.949LysAla: 5.949 ± 0.894
0.264LysCys: 0.264 ± 0.202
4.627LysAsp: 4.627 ± 0.794
8.461LysGlu: 8.461 ± 1.58
2.115LysPhe: 2.115 ± 0.533
5.817LysGly: 5.817 ± 1.142
0.793LysHis: 0.793 ± 0.411
5.553LysIle: 5.553 ± 0.627
9.254LysLys: 9.254 ± 1.264
8.197LysLeu: 8.197 ± 0.986
3.437LysMet: 3.437 ± 0.488
5.024LysAsn: 5.024 ± 0.774
1.983LysPro: 1.983 ± 0.524
3.57LysGln: 3.57 ± 0.678
2.644LysArg: 2.644 ± 0.619
6.346LysSer: 6.346 ± 0.919
4.892LysThr: 4.892 ± 0.841
5.553LysVal: 5.553 ± 0.883
1.19LysTrp: 1.19 ± 0.443
4.363LysTyr: 4.363 ± 0.837
0.0LysXaa: 0.0 ± 0.0
Leu
5.949LeuAla: 5.949 ± 0.716
0.397LeuCys: 0.397 ± 0.242
4.363LeuAsp: 4.363 ± 0.753
5.42LeuGlu: 5.42 ± 0.995
3.57LeuPhe: 3.57 ± 0.794
4.098LeuGly: 4.098 ± 0.607
1.19LeuHis: 1.19 ± 0.376
7.403LeuIle: 7.403 ± 1.212
8.726LeuLys: 8.726 ± 1.106
7.271LeuLeu: 7.271 ± 1.394
1.586LeuMet: 1.586 ± 0.476
5.817LeuAsn: 5.817 ± 0.816
3.305LeuPro: 3.305 ± 0.596
3.57LeuGln: 3.57 ± 0.548
2.38LeuArg: 2.38 ± 0.526
4.627LeuSer: 4.627 ± 0.82
6.742LeuThr: 6.742 ± 0.747
6.478LeuVal: 6.478 ± 0.876
1.19LeuTrp: 1.19 ± 0.433
3.834LeuTyr: 3.834 ± 0.964
0.0LeuXaa: 0.0 ± 0.0
Met
2.512MetAla: 2.512 ± 0.549
0.132MetCys: 0.132 ± 0.118
1.19MetAsp: 1.19 ± 0.437
2.38MetGlu: 2.38 ± 0.72
0.529MetPhe: 0.529 ± 0.237
1.058MetGly: 1.058 ± 0.317
0.264MetHis: 0.264 ± 0.205
2.247MetIle: 2.247 ± 0.538
1.851MetLys: 1.851 ± 0.521
0.925MetLeu: 0.925 ± 0.461
0.397MetMet: 0.397 ± 0.24
2.247MetAsn: 2.247 ± 0.577
0.793MetPro: 0.793 ± 0.315
1.851MetGln: 1.851 ± 0.442
0.529MetArg: 0.529 ± 0.285
1.851MetSer: 1.851 ± 0.428
1.851MetThr: 1.851 ± 0.456
0.925MetVal: 0.925 ± 0.281
0.397MetTrp: 0.397 ± 0.233
1.19MetTyr: 1.19 ± 0.401
0.0MetXaa: 0.0 ± 0.0
Asn
5.553AsnAla: 5.553 ± 1.014
0.132AsnCys: 0.132 ± 0.132
3.173AsnAsp: 3.173 ± 0.602
4.627AsnGlu: 4.627 ± 0.786
2.115AsnPhe: 2.115 ± 0.628
6.742AsnGly: 6.742 ± 0.902
1.058AsnHis: 1.058 ± 0.333
3.966AsnIle: 3.966 ± 0.793
7.8AsnLys: 7.8 ± 1.519
6.61AsnLeu: 6.61 ± 0.82
1.322AsnMet: 1.322 ± 0.436
3.834AsnAsn: 3.834 ± 0.826
2.115AsnPro: 2.115 ± 0.506
2.909AsnGln: 2.909 ± 0.552
1.719AsnArg: 1.719 ± 0.455
6.081AsnSer: 6.081 ± 0.826
4.231AsnThr: 4.231 ± 0.792
3.305AsnVal: 3.305 ± 0.709
1.19AsnTrp: 1.19 ± 0.401
2.115AsnTyr: 2.115 ± 0.698
0.0AsnXaa: 0.0 ± 0.0
Pro
1.719ProAla: 1.719 ± 0.48
0.132ProCys: 0.132 ± 0.133
1.454ProAsp: 1.454 ± 0.43
1.586ProGlu: 1.586 ± 0.508
0.661ProPhe: 0.661 ± 0.343
0.264ProGly: 0.264 ± 0.175
0.0ProHis: 0.0 ± 0.0
1.586ProIle: 1.586 ± 0.479
2.115ProLys: 2.115 ± 0.658
1.851ProLeu: 1.851 ± 0.424
0.529ProMet: 0.529 ± 0.214
2.38ProAsn: 2.38 ± 0.737
0.529ProPro: 0.529 ± 0.312
0.529ProGln: 0.529 ± 0.245
0.529ProArg: 0.529 ± 0.21
1.454ProSer: 1.454 ± 0.57
2.644ProThr: 2.644 ± 0.479
1.454ProVal: 1.454 ± 0.484
0.397ProTrp: 0.397 ± 0.237
0.661ProTyr: 0.661 ± 0.309
0.0ProXaa: 0.0 ± 0.0
Gln
3.437GlnAla: 3.437 ± 0.804
0.397GlnCys: 0.397 ± 0.22
1.719GlnAsp: 1.719 ± 0.476
2.512GlnGlu: 2.512 ± 0.477
1.454GlnPhe: 1.454 ± 0.449
2.115GlnGly: 2.115 ± 0.527
0.397GlnHis: 0.397 ± 0.201
1.322GlnIle: 1.322 ± 0.341
3.173GlnLys: 3.173 ± 0.647
2.776GlnLeu: 2.776 ± 0.87
0.793GlnMet: 0.793 ± 0.257
2.115GlnAsn: 2.115 ± 0.468
0.925GlnPro: 0.925 ± 0.381
1.454GlnGln: 1.454 ± 0.39
1.586GlnArg: 1.586 ± 0.466
2.247GlnSer: 2.247 ± 0.538
2.776GlnThr: 2.776 ± 0.505
2.38GlnVal: 2.38 ± 0.626
0.793GlnTrp: 0.793 ± 0.279
1.454GlnTyr: 1.454 ± 0.347
0.0GlnXaa: 0.0 ± 0.0
Arg
1.586ArgAla: 1.586 ± 0.498
0.397ArgCys: 0.397 ± 0.245
1.454ArgAsp: 1.454 ± 0.404
1.851ArgGlu: 1.851 ± 0.388
0.793ArgPhe: 0.793 ± 0.295
2.115ArgGly: 2.115 ± 0.568
0.793ArgHis: 0.793 ± 0.28
1.983ArgIle: 1.983 ± 0.437
2.909ArgLys: 2.909 ± 0.882
4.098ArgLeu: 4.098 ± 0.874
0.661ArgMet: 0.661 ± 0.371
2.644ArgAsn: 2.644 ± 0.697
0.529ArgPro: 0.529 ± 0.222
1.19ArgGln: 1.19 ± 0.379
1.454ArgArg: 1.454 ± 0.501
1.719ArgSer: 1.719 ± 0.433
1.454ArgThr: 1.454 ± 0.387
2.247ArgVal: 2.247 ± 0.532
0.397ArgTrp: 0.397 ± 0.226
1.983ArgTyr: 1.983 ± 0.57
0.0ArgXaa: 0.0 ± 0.0
Ser
5.949SerAla: 5.949 ± 1.536
0.793SerCys: 0.793 ± 0.359
3.966SerAsp: 3.966 ± 0.871
4.231SerGlu: 4.231 ± 0.659
3.702SerPhe: 3.702 ± 0.856
4.495SerGly: 4.495 ± 1.011
0.397SerHis: 0.397 ± 0.253
3.966SerIle: 3.966 ± 0.619
4.231SerLys: 4.231 ± 0.867
5.42SerLeu: 5.42 ± 0.963
1.983SerMet: 1.983 ± 0.435
5.024SerAsn: 5.024 ± 1.053
1.586SerPro: 1.586 ± 0.651
1.586SerGln: 1.586 ± 0.51
2.247SerArg: 2.247 ± 0.457
6.081SerSer: 6.081 ± 1.149
3.57SerThr: 3.57 ± 0.75
4.627SerVal: 4.627 ± 0.702
0.925SerTrp: 0.925 ± 0.312
1.586SerTyr: 1.586 ± 0.435
0.0SerXaa: 0.0 ± 0.0
Thr
5.817ThrAla: 5.817 ± 0.85
0.397ThrCys: 0.397 ± 0.224
3.57ThrAsp: 3.57 ± 0.97
6.081ThrGlu: 6.081 ± 0.792
2.38ThrPhe: 2.38 ± 0.439
4.495ThrGly: 4.495 ± 0.739
0.0ThrHis: 0.0 ± 0.0
4.892ThrIle: 4.892 ± 0.849
3.834ThrLys: 3.834 ± 0.548
5.817ThrLeu: 5.817 ± 0.911
1.058ThrMet: 1.058 ± 0.425
4.495ThrAsn: 4.495 ± 0.634
2.115ThrPro: 2.115 ± 0.394
2.909ThrGln: 2.909 ± 0.678
1.586ThrArg: 1.586 ± 0.588
4.892ThrSer: 4.892 ± 0.856
4.495ThrThr: 4.495 ± 0.842
5.685ThrVal: 5.685 ± 0.782
1.19ThrTrp: 1.19 ± 0.421
2.247ThrTyr: 2.247 ± 0.784
0.0ThrXaa: 0.0 ± 0.0
Val
3.702ValAla: 3.702 ± 0.766
0.529ValCys: 0.529 ± 0.338
5.024ValAsp: 5.024 ± 0.691
4.231ValGlu: 4.231 ± 0.695
2.644ValPhe: 2.644 ± 0.472
3.173ValGly: 3.173 ± 0.434
0.529ValHis: 0.529 ± 0.215
5.288ValIle: 5.288 ± 0.985
7.139ValLys: 7.139 ± 0.948
3.702ValLeu: 3.702 ± 0.553
2.115ValMet: 2.115 ± 0.518
3.305ValAsn: 3.305 ± 0.618
1.19ValPro: 1.19 ± 0.367
1.719ValGln: 1.719 ± 0.542
3.173ValArg: 3.173 ± 0.71
5.288ValSer: 5.288 ± 0.977
6.214ValThr: 6.214 ± 0.872
3.702ValVal: 3.702 ± 0.752
0.264ValTrp: 0.264 ± 0.167
3.437ValTyr: 3.437 ± 0.747
0.0ValXaa: 0.0 ± 0.0
Trp
0.793TrpAla: 0.793 ± 0.418
0.264TrpCys: 0.264 ± 0.185
0.661TrpAsp: 0.661 ± 0.32
0.793TrpGlu: 0.793 ± 0.28
1.058TrpPhe: 1.058 ± 0.48
0.529TrpGly: 0.529 ± 0.243
0.132TrpHis: 0.132 ± 0.112
0.793TrpIle: 0.793 ± 0.285
1.058TrpLys: 1.058 ± 0.357
0.925TrpLeu: 0.925 ± 0.35
0.529TrpMet: 0.529 ± 0.243
2.115TrpAsn: 2.115 ± 0.919
0.0TrpPro: 0.0 ± 0.0
1.058TrpGln: 1.058 ± 0.355
0.661TrpArg: 0.661 ± 0.349
0.925TrpSer: 0.925 ± 0.268
0.661TrpThr: 0.661 ± 0.341
1.19TrpVal: 1.19 ± 0.383
0.0TrpTrp: 0.0 ± 0.0
0.793TrpTyr: 0.793 ± 0.34
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.719TyrAla: 1.719 ± 0.53
0.793TyrCys: 0.793 ± 0.454
2.115TyrAsp: 2.115 ± 0.742
3.173TyrGlu: 3.173 ± 0.763
1.719TyrPhe: 1.719 ± 0.516
2.115TyrGly: 2.115 ± 0.605
0.925TyrHis: 0.925 ± 0.341
3.437TyrIle: 3.437 ± 0.742
2.909TyrLys: 2.909 ± 0.778
3.57TyrLeu: 3.57 ± 0.9
1.19TyrMet: 1.19 ± 0.589
3.57TyrAsn: 3.57 ± 0.665
1.719TyrPro: 1.719 ± 0.531
0.925TyrGln: 0.925 ± 0.303
0.925TyrArg: 0.925 ± 0.402
2.115TyrSer: 2.115 ± 0.405
3.437TyrThr: 3.437 ± 0.902
3.305TyrVal: 3.305 ± 0.64
0.397TyrTrp: 0.397 ± 0.199
1.983TyrTyr: 1.983 ± 0.625
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 47 proteins (7565 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski