Amino acid dipepetide frequency for Lactococcus phage proPhi6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.067AlaAla: 3.067 ± 0.877
0.438AlaCys: 0.438 ± 0.176
4.031AlaAsp: 4.031 ± 0.721
4.469AlaGlu: 4.469 ± 0.846
3.33AlaPhe: 3.33 ± 0.523
3.768AlaGly: 3.768 ± 0.593
0.613AlaHis: 0.613 ± 0.237
4.82AlaIle: 4.82 ± 0.931
5.258AlaLys: 5.258 ± 0.908
5.872AlaLeu: 5.872 ± 0.845
1.402AlaMet: 1.402 ± 0.292
3.418AlaAsn: 3.418 ± 0.546
1.227AlaPro: 1.227 ± 0.369
2.366AlaGln: 2.366 ± 0.399
1.753AlaArg: 1.753 ± 0.392
4.294AlaSer: 4.294 ± 0.794
3.33AlaThr: 3.33 ± 0.56
3.418AlaVal: 3.418 ± 0.713
0.964AlaTrp: 0.964 ± 0.227
2.016AlaTyr: 2.016 ± 0.531
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.175CysCys: 0.175 ± 0.113
0.526CysAsp: 0.526 ± 0.176
0.438CysGlu: 0.438 ± 0.149
0.351CysPhe: 0.351 ± 0.15
0.351CysGly: 0.351 ± 0.174
0.175CysHis: 0.175 ± 0.142
0.526CysIle: 0.526 ± 0.241
0.526CysLys: 0.526 ± 0.217
0.438CysLeu: 0.438 ± 0.165
0.0CysMet: 0.0 ± 0.0
0.263CysAsn: 0.263 ± 0.157
0.175CysPro: 0.175 ± 0.123
0.263CysGln: 0.263 ± 0.161
0.175CysArg: 0.175 ± 0.098
0.263CysSer: 0.263 ± 0.157
0.175CysThr: 0.175 ± 0.109
0.175CysVal: 0.175 ± 0.101
0.175CysTrp: 0.175 ± 0.12
0.351CysTyr: 0.351 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
2.98AspAla: 2.98 ± 0.517
0.526AspCys: 0.526 ± 0.234
5.083AspAsp: 5.083 ± 0.645
5.959AspGlu: 5.959 ± 0.87
2.717AspPhe: 2.717 ± 0.585
4.382AspGly: 4.382 ± 0.579
0.438AspHis: 0.438 ± 0.148
4.732AspIle: 4.732 ± 0.543
6.66AspLys: 6.66 ± 0.8
4.206AspLeu: 4.206 ± 0.518
1.052AspMet: 1.052 ± 0.271
4.206AspAsn: 4.206 ± 0.64
1.402AspPro: 1.402 ± 0.428
1.315AspGln: 1.315 ± 0.349
2.191AspArg: 2.191 ± 0.325
3.33AspSer: 3.33 ± 0.528
3.505AspThr: 3.505 ± 0.541
3.067AspVal: 3.067 ± 0.492
1.577AspTrp: 1.577 ± 0.32
3.593AspTyr: 3.593 ± 0.471
0.0AspXaa: 0.0 ± 0.0
Glu
3.33GluAla: 3.33 ± 0.484
0.263GluCys: 0.263 ± 0.126
2.541GluAsp: 2.541 ± 0.539
5.521GluGlu: 5.521 ± 0.967
3.944GluPhe: 3.944 ± 0.566
2.454GluGly: 2.454 ± 0.57
1.753GluHis: 1.753 ± 0.61
5.609GluIle: 5.609 ± 0.722
7.186GluLys: 7.186 ± 1.132
8.062GluLeu: 8.062 ± 0.955
2.629GluMet: 2.629 ± 0.467
4.206GluAsn: 4.206 ± 0.598
2.191GluPro: 2.191 ± 0.457
2.98GluGln: 2.98 ± 0.548
2.98GluArg: 2.98 ± 0.577
3.505GluSer: 3.505 ± 0.605
3.768GluThr: 3.768 ± 0.747
4.908GluVal: 4.908 ± 0.694
0.876GluTrp: 0.876 ± 0.253
3.155GluTyr: 3.155 ± 0.509
0.0GluXaa: 0.0 ± 0.0
Phe
2.016PheAla: 2.016 ± 0.368
0.438PheCys: 0.438 ± 0.199
4.206PheAsp: 4.206 ± 0.446
3.418PheGlu: 3.418 ± 0.525
1.577PhePhe: 1.577 ± 0.401
2.98PheGly: 2.98 ± 0.458
0.701PheHis: 0.701 ± 0.239
3.067PheIle: 3.067 ± 0.503
4.557PheLys: 4.557 ± 0.595
2.454PheLeu: 2.454 ± 0.422
1.753PheMet: 1.753 ± 0.479
2.804PheAsn: 2.804 ± 0.831
1.139PhePro: 1.139 ± 0.34
1.753PheGln: 1.753 ± 0.408
1.402PheArg: 1.402 ± 0.332
3.505PheSer: 3.505 ± 0.567
2.629PheThr: 2.629 ± 0.477
2.279PheVal: 2.279 ± 0.463
0.263PheTrp: 0.263 ± 0.141
1.753PheTyr: 1.753 ± 0.45
0.0PheXaa: 0.0 ± 0.0
Gly
4.382GlyAla: 4.382 ± 0.794
0.351GlyCys: 0.351 ± 0.175
3.505GlyAsp: 3.505 ± 0.578
2.892GlyGlu: 2.892 ± 0.484
2.98GlyPhe: 2.98 ± 0.579
4.82GlyGly: 4.82 ± 0.873
0.701GlyHis: 0.701 ± 0.221
6.134GlyIle: 6.134 ± 0.962
5.609GlyLys: 5.609 ± 0.766
4.732GlyLeu: 4.732 ± 0.673
1.665GlyMet: 1.665 ± 0.443
4.119GlyAsn: 4.119 ± 0.572
0.438GlyPro: 0.438 ± 0.241
2.629GlyGln: 2.629 ± 0.547
2.804GlyArg: 2.804 ± 0.466
3.155GlySer: 3.155 ± 0.486
3.944GlyThr: 3.944 ± 0.561
4.645GlyVal: 4.645 ± 0.844
1.052GlyTrp: 1.052 ± 0.358
3.944GlyTyr: 3.944 ± 0.554
0.0GlyXaa: 0.0 ± 0.0
His
1.139HisAla: 1.139 ± 0.451
0.088HisCys: 0.088 ± 0.091
0.701HisAsp: 0.701 ± 0.219
1.227HisGlu: 1.227 ± 0.406
0.789HisPhe: 0.789 ± 0.236
0.701HisGly: 0.701 ± 0.275
0.263HisHis: 0.263 ± 0.168
0.613HisIle: 0.613 ± 0.289
1.052HisLys: 1.052 ± 0.29
1.139HisLeu: 1.139 ± 0.358
0.175HisMet: 0.175 ± 0.118
0.701HisAsn: 0.701 ± 0.255
0.438HisPro: 0.438 ± 0.174
0.526HisGln: 0.526 ± 0.209
0.263HisArg: 0.263 ± 0.142
1.052HisSer: 1.052 ± 0.334
0.526HisThr: 0.526 ± 0.24
0.613HisVal: 0.613 ± 0.209
0.351HisTrp: 0.351 ± 0.222
0.964HisTyr: 0.964 ± 0.259
0.0HisXaa: 0.0 ± 0.0
Ile
3.944IleAla: 3.944 ± 0.536
0.351IleCys: 0.351 ± 0.177
4.557IleAsp: 4.557 ± 0.639
5.258IleGlu: 5.258 ± 0.736
2.454IlePhe: 2.454 ± 0.594
5.17IleGly: 5.17 ± 0.787
1.052IleHis: 1.052 ± 0.344
4.908IleIle: 4.908 ± 0.679
7.624IleLys: 7.624 ± 0.777
4.732IleLeu: 4.732 ± 0.57
1.139IleMet: 1.139 ± 0.256
4.82IleAsn: 4.82 ± 0.662
2.279IlePro: 2.279 ± 0.457
3.33IleGln: 3.33 ± 0.487
1.928IleArg: 1.928 ± 0.347
6.485IleSer: 6.485 ± 0.734
4.645IleThr: 4.645 ± 0.698
3.681IleVal: 3.681 ± 0.638
0.526IleTrp: 0.526 ± 0.195
2.454IleTyr: 2.454 ± 0.548
0.0IleXaa: 0.0 ± 0.0
Lys
6.573LysAla: 6.573 ± 0.968
0.526LysCys: 0.526 ± 0.191
5.433LysAsp: 5.433 ± 0.736
7.011LysGlu: 7.011 ± 0.882
3.856LysPhe: 3.856 ± 0.641
7.361LysGly: 7.361 ± 0.743
1.753LysHis: 1.753 ± 0.416
5.258LysIle: 5.258 ± 0.731
10.604LysLys: 10.604 ± 1.191
7.975LysLeu: 7.975 ± 0.834
2.103LysMet: 2.103 ± 0.347
6.485LysAsn: 6.485 ± 0.742
2.541LysPro: 2.541 ± 0.396
4.82LysGln: 4.82 ± 0.701
4.294LysArg: 4.294 ± 0.707
5.17LysSer: 5.17 ± 0.791
5.872LysThr: 5.872 ± 0.718
4.645LysVal: 4.645 ± 0.609
0.964LysTrp: 0.964 ± 0.3
3.155LysTyr: 3.155 ± 0.607
0.0LysXaa: 0.0 ± 0.0
Leu
5.433LeuAla: 5.433 ± 0.846
0.175LeuCys: 0.175 ± 0.117
5.959LeuAsp: 5.959 ± 0.888
4.645LeuGlu: 4.645 ± 0.721
2.804LeuPhe: 2.804 ± 0.448
5.346LeuGly: 5.346 ± 0.622
0.964LeuHis: 0.964 ± 0.325
5.083LeuIle: 5.083 ± 0.552
9.727LeuLys: 9.727 ± 0.98
6.222LeuLeu: 6.222 ± 0.681
2.016LeuMet: 2.016 ± 0.424
4.382LeuAsn: 4.382 ± 0.51
2.717LeuPro: 2.717 ± 0.422
3.505LeuGln: 3.505 ± 0.478
1.928LeuArg: 1.928 ± 0.355
6.397LeuSer: 6.397 ± 0.604
4.908LeuThr: 4.908 ± 0.656
4.557LeuVal: 4.557 ± 0.523
0.701LeuTrp: 0.701 ± 0.237
2.717LeuTyr: 2.717 ± 0.392
0.0LeuXaa: 0.0 ± 0.0
Met
1.753MetAla: 1.753 ± 0.425
0.088MetCys: 0.088 ± 0.085
1.227MetAsp: 1.227 ± 0.311
1.84MetGlu: 1.84 ± 0.406
0.789MetPhe: 0.789 ± 0.27
1.315MetGly: 1.315 ± 0.426
0.175MetHis: 0.175 ± 0.122
0.964MetIle: 0.964 ± 0.244
2.804MetLys: 2.804 ± 0.684
0.789MetLeu: 0.789 ± 0.214
0.526MetMet: 0.526 ± 0.209
1.402MetAsn: 1.402 ± 0.407
0.964MetPro: 0.964 ± 0.315
1.402MetGln: 1.402 ± 0.306
1.139MetArg: 1.139 ± 0.314
1.577MetSer: 1.577 ± 0.4
2.98MetThr: 2.98 ± 0.493
0.876MetVal: 0.876 ± 0.266
0.263MetTrp: 0.263 ± 0.14
0.964MetTyr: 0.964 ± 0.357
0.0MetXaa: 0.0 ± 0.0
Asn
3.593AsnAla: 3.593 ± 0.587
0.438AsnCys: 0.438 ± 0.203
2.717AsnAsp: 2.717 ± 0.454
2.717AsnGlu: 2.717 ± 0.447
2.191AsnPhe: 2.191 ± 0.417
5.784AsnGly: 5.784 ± 1.095
0.789AsnHis: 0.789 ± 0.369
4.732AsnIle: 4.732 ± 0.64
5.959AsnLys: 5.959 ± 0.609
4.031AsnLeu: 4.031 ± 0.557
1.402AsnMet: 1.402 ± 0.299
3.593AsnAsn: 3.593 ± 0.645
2.892AsnPro: 2.892 ± 0.4
2.366AsnGln: 2.366 ± 0.531
2.366AsnArg: 2.366 ± 0.369
3.593AsnSer: 3.593 ± 0.423
2.892AsnThr: 2.892 ± 0.554
3.944AsnVal: 3.944 ± 0.583
0.876AsnTrp: 0.876 ± 0.23
3.768AsnTyr: 3.768 ± 0.688
0.0AsnXaa: 0.0 ± 0.0
Pro
1.665ProAla: 1.665 ± 0.41
0.0ProCys: 0.0 ± 0.0
2.454ProAsp: 2.454 ± 0.531
2.279ProGlu: 2.279 ± 0.404
1.84ProPhe: 1.84 ± 0.438
1.052ProGly: 1.052 ± 0.339
0.613ProHis: 0.613 ± 0.203
2.629ProIle: 2.629 ± 0.399
3.067ProLys: 3.067 ± 0.452
2.191ProLeu: 2.191 ± 0.366
0.263ProMet: 0.263 ± 0.134
1.665ProAsn: 1.665 ± 0.394
0.876ProPro: 0.876 ± 0.208
1.402ProGln: 1.402 ± 0.415
0.789ProArg: 0.789 ± 0.278
1.227ProSer: 1.227 ± 0.312
1.49ProThr: 1.49 ± 0.317
1.753ProVal: 1.753 ± 0.281
0.088ProTrp: 0.088 ± 0.068
1.577ProTyr: 1.577 ± 0.331
0.0ProXaa: 0.0 ± 0.0
Gln
4.732GlnAla: 4.732 ± 0.772
0.175GlnCys: 0.175 ± 0.117
2.454GlnAsp: 2.454 ± 0.424
3.505GlnGlu: 3.505 ± 0.541
1.928GlnPhe: 1.928 ± 0.368
2.016GlnGly: 2.016 ± 0.403
0.351GlnHis: 0.351 ± 0.194
2.191GlnIle: 2.191 ± 0.312
3.242GlnLys: 3.242 ± 0.674
3.067GlnLeu: 3.067 ± 0.37
1.139GlnMet: 1.139 ± 0.354
2.279GlnAsn: 2.279 ± 0.341
1.139GlnPro: 1.139 ± 0.354
2.103GlnGln: 2.103 ± 0.442
1.315GlnArg: 1.315 ± 0.295
2.98GlnSer: 2.98 ± 0.46
2.541GlnThr: 2.541 ± 0.376
2.454GlnVal: 2.454 ± 0.497
0.876GlnTrp: 0.876 ± 0.263
1.49GlnTyr: 1.49 ± 0.31
0.0GlnXaa: 0.0 ± 0.0
Arg
1.665ArgAla: 1.665 ± 0.492
0.613ArgCys: 0.613 ± 0.247
1.753ArgAsp: 1.753 ± 0.301
2.366ArgGlu: 2.366 ± 0.461
1.84ArgPhe: 1.84 ± 0.435
1.49ArgGly: 1.49 ± 0.268
0.175ArgHis: 0.175 ± 0.129
2.454ArgIle: 2.454 ± 0.445
3.242ArgLys: 3.242 ± 0.496
3.944ArgLeu: 3.944 ± 0.71
1.139ArgMet: 1.139 ± 0.312
1.665ArgAsn: 1.665 ± 0.397
0.789ArgPro: 0.789 ± 0.341
1.139ArgGln: 1.139 ± 0.296
1.227ArgArg: 1.227 ± 0.339
1.753ArgSer: 1.753 ± 0.315
2.016ArgThr: 2.016 ± 0.432
2.629ArgVal: 2.629 ± 0.398
0.263ArgTrp: 0.263 ± 0.173
1.665ArgTyr: 1.665 ± 0.415
0.0ArgXaa: 0.0 ± 0.0
Ser
3.505SerAla: 3.505 ± 0.551
0.175SerCys: 0.175 ± 0.118
4.732SerAsp: 4.732 ± 0.627
5.083SerGlu: 5.083 ± 0.777
3.33SerPhe: 3.33 ± 0.55
4.469SerGly: 4.469 ± 0.719
0.438SerHis: 0.438 ± 0.204
4.119SerIle: 4.119 ± 0.743
6.047SerLys: 6.047 ± 0.693
5.17SerLeu: 5.17 ± 0.611
1.753SerMet: 1.753 ± 0.359
4.645SerAsn: 4.645 ± 0.658
1.665SerPro: 1.665 ± 0.315
2.98SerGln: 2.98 ± 0.41
1.577SerArg: 1.577 ± 0.365
4.557SerSer: 4.557 ± 0.702
4.206SerThr: 4.206 ± 0.615
3.944SerVal: 3.944 ± 0.527
1.139SerTrp: 1.139 ± 0.371
2.804SerTyr: 2.804 ± 0.488
0.0SerXaa: 0.0 ± 0.0
Thr
4.206ThrAla: 4.206 ± 0.567
0.088ThrCys: 0.088 ± 0.098
3.242ThrAsp: 3.242 ± 0.597
5.17ThrGlu: 5.17 ± 0.805
3.067ThrPhe: 3.067 ± 0.504
4.557ThrGly: 4.557 ± 0.544
0.876ThrHis: 0.876 ± 0.304
3.768ThrIle: 3.768 ± 0.521
4.82ThrLys: 4.82 ± 0.609
4.294ThrLeu: 4.294 ± 0.625
1.052ThrMet: 1.052 ± 0.306
3.593ThrAsn: 3.593 ± 0.72
2.279ThrPro: 2.279 ± 0.353
1.84ThrGln: 1.84 ± 0.286
1.928ThrArg: 1.928 ± 0.408
3.242ThrSer: 3.242 ± 0.496
2.804ThrThr: 2.804 ± 0.634
4.469ThrVal: 4.469 ± 0.595
0.701ThrTrp: 0.701 ± 0.261
2.016ThrTyr: 2.016 ± 0.505
0.0ThrXaa: 0.0 ± 0.0
Val
2.804ValAla: 2.804 ± 0.555
0.263ValCys: 0.263 ± 0.143
4.557ValAsp: 4.557 ± 0.561
4.382ValGlu: 4.382 ± 0.647
2.804ValPhe: 2.804 ± 0.467
2.98ValGly: 2.98 ± 0.435
0.613ValHis: 0.613 ± 0.224
5.346ValIle: 5.346 ± 0.848
4.119ValLys: 4.119 ± 0.599
5.696ValLeu: 5.696 ± 0.87
1.227ValMet: 1.227 ± 0.36
3.593ValAsn: 3.593 ± 0.572
1.84ValPro: 1.84 ± 0.4
2.191ValGln: 2.191 ± 0.468
1.402ValArg: 1.402 ± 0.335
5.521ValSer: 5.521 ± 0.518
2.804ValThr: 2.804 ± 0.468
4.031ValVal: 4.031 ± 0.56
0.876ValTrp: 0.876 ± 0.237
2.191ValTyr: 2.191 ± 0.412
0.0ValXaa: 0.0 ± 0.0
Trp
0.876TrpAla: 0.876 ± 0.284
0.088TrpCys: 0.088 ± 0.081
0.613TrpAsp: 0.613 ± 0.213
0.526TrpGlu: 0.526 ± 0.22
0.263TrpPhe: 0.263 ± 0.135
0.789TrpGly: 0.789 ± 0.293
0.175TrpHis: 0.175 ± 0.102
1.227TrpIle: 1.227 ± 0.272
1.139TrpLys: 1.139 ± 0.261
1.139TrpLeu: 1.139 ± 0.338
0.263TrpMet: 0.263 ± 0.156
0.789TrpAsn: 0.789 ± 0.251
0.175TrpPro: 0.175 ± 0.092
1.227TrpGln: 1.227 ± 0.337
0.351TrpArg: 0.351 ± 0.165
0.876TrpSer: 0.876 ± 0.312
1.052TrpThr: 1.052 ± 0.344
1.227TrpVal: 1.227 ± 0.302
0.263TrpTrp: 0.263 ± 0.147
0.701TrpTyr: 0.701 ± 0.246
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.191TyrAla: 2.191 ± 0.372
0.263TyrCys: 0.263 ± 0.192
2.717TyrAsp: 2.717 ± 0.531
3.155TyrGlu: 3.155 ± 0.54
1.84TyrPhe: 1.84 ± 0.462
2.717TyrGly: 2.717 ± 0.476
0.701TyrHis: 0.701 ± 0.266
3.155TyrIle: 3.155 ± 0.62
3.155TyrLys: 3.155 ± 0.588
4.031TyrLeu: 4.031 ± 0.824
1.052TyrMet: 1.052 ± 0.296
2.016TyrAsn: 2.016 ± 0.443
1.753TyrPro: 1.753 ± 0.445
1.84TyrGln: 1.84 ± 0.36
2.016TyrArg: 2.016 ± 0.457
3.856TyrSer: 3.856 ± 0.574
1.928TyrThr: 1.928 ± 0.361
1.928TyrVal: 1.928 ± 0.366
0.964TyrTrp: 0.964 ± 0.245
1.577TyrTyr: 1.577 ± 0.412
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (11412 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski