Amino acid dipepetide frequency for Lactococcus phage CHPC959

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.348AlaAla: 0.348 ± 0.165
0.116AlaCys: 0.116 ± 0.098
2.898AlaAsp: 2.898 ± 0.555
4.753AlaGlu: 4.753 ± 0.755
2.666AlaPhe: 2.666 ± 0.699
3.826AlaGly: 3.826 ± 0.82
0.812AlaHis: 0.812 ± 0.347
4.985AlaIle: 4.985 ± 1.176
5.565AlaLys: 5.565 ± 0.97
7.072AlaLeu: 7.072 ± 1.176
1.623AlaMet: 1.623 ± 0.557
4.753AlaAsn: 4.753 ± 0.847
1.391AlaPro: 1.391 ± 0.408
1.971AlaGln: 1.971 ± 0.505
1.739AlaArg: 1.739 ± 0.421
3.014AlaSer: 3.014 ± 0.704
3.014AlaThr: 3.014 ± 0.858
3.942AlaVal: 3.942 ± 0.902
1.275AlaTrp: 1.275 ± 0.402
2.203AlaTyr: 2.203 ± 0.46
0.0AlaXaa: 0.0 ± 0.0
Cys
0.348CysAla: 0.348 ± 0.2
0.232CysCys: 0.232 ± 0.226
0.464CysAsp: 0.464 ± 0.227
0.348CysGlu: 0.348 ± 0.148
0.348CysPhe: 0.348 ± 0.205
1.159CysGly: 1.159 ± 0.432
0.232CysHis: 0.232 ± 0.141
0.464CysIle: 0.464 ± 0.231
0.927CysLys: 0.927 ± 0.406
0.232CysLeu: 0.232 ± 0.174
0.116CysMet: 0.116 ± 0.113
0.464CysAsn: 0.464 ± 0.235
0.348CysPro: 0.348 ± 0.192
0.232CysGln: 0.232 ± 0.147
0.58CysArg: 0.58 ± 0.276
0.232CysSer: 0.232 ± 0.169
0.232CysThr: 0.232 ± 0.165
0.58CysVal: 0.58 ± 0.231
0.232CysTrp: 0.232 ± 0.151
0.116CysTyr: 0.116 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
1.391AspAla: 1.391 ± 0.429
0.232AspCys: 0.232 ± 0.151
4.173AspAsp: 4.173 ± 0.823
4.289AspGlu: 4.289 ± 0.791
3.478AspPhe: 3.478 ± 0.529
3.826AspGly: 3.826 ± 0.871
0.927AspHis: 0.927 ± 0.312
3.826AspIle: 3.826 ± 0.718
4.637AspLys: 4.637 ± 0.697
5.912AspLeu: 5.912 ± 0.799
0.927AspMet: 0.927 ± 0.289
4.173AspAsn: 4.173 ± 0.653
1.623AspPro: 1.623 ± 0.434
0.812AspGln: 0.812 ± 0.322
2.087AspArg: 2.087 ± 0.54
3.13AspSer: 3.13 ± 0.612
4.637AspThr: 4.637 ± 0.778
3.246AspVal: 3.246 ± 0.678
0.58AspTrp: 0.58 ± 0.237
2.55AspTyr: 2.55 ± 0.548
0.0AspXaa: 0.0 ± 0.0
Glu
3.478GluAla: 3.478 ± 0.644
0.58GluCys: 0.58 ± 0.271
3.942GluAsp: 3.942 ± 0.623
5.796GluGlu: 5.796 ± 1.02
4.173GluPhe: 4.173 ± 0.689
2.666GluGly: 2.666 ± 0.432
0.696GluHis: 0.696 ± 0.317
5.217GluIle: 5.217 ± 0.765
6.26GluLys: 6.26 ± 0.952
11.129GluLeu: 11.129 ± 1.213
2.782GluMet: 2.782 ± 0.548
5.217GluAsn: 5.217 ± 0.777
1.391GluPro: 1.391 ± 0.399
3.478GluGln: 3.478 ± 0.634
2.782GluArg: 2.782 ± 0.604
3.594GluSer: 3.594 ± 0.581
4.405GluThr: 4.405 ± 0.708
3.942GluVal: 3.942 ± 0.621
1.043GluTrp: 1.043 ± 0.313
3.594GluTyr: 3.594 ± 0.719
0.0GluXaa: 0.0 ± 0.0
Phe
2.898PheAla: 2.898 ± 0.757
0.232PheCys: 0.232 ± 0.18
3.478PheAsp: 3.478 ± 0.79
2.55PheGlu: 2.55 ± 0.511
2.087PhePhe: 2.087 ± 0.671
2.782PheGly: 2.782 ± 0.663
0.58PheHis: 0.58 ± 0.235
3.362PheIle: 3.362 ± 0.615
5.101PheLys: 5.101 ± 0.648
2.55PheLeu: 2.55 ± 0.56
1.159PheMet: 1.159 ± 0.365
3.362PheAsn: 3.362 ± 0.695
0.696PhePro: 0.696 ± 0.249
1.043PheGln: 1.043 ± 0.314
1.159PheArg: 1.159 ± 0.266
4.405PheSer: 4.405 ± 0.79
2.782PheThr: 2.782 ± 0.461
2.666PheVal: 2.666 ± 0.448
0.232PheTrp: 0.232 ± 0.137
1.971PheTyr: 1.971 ± 0.423
0.0PheXaa: 0.0 ± 0.0
Gly
4.173GlyAla: 4.173 ± 1.464
0.464GlyCys: 0.464 ± 0.22
2.782GlyAsp: 2.782 ± 0.586
4.405GlyGlu: 4.405 ± 0.587
2.319GlyPhe: 2.319 ± 0.688
4.637GlyGly: 4.637 ± 1.119
0.927GlyHis: 0.927 ± 0.356
4.058GlyIle: 4.058 ± 1.126
6.724GlyLys: 6.724 ± 0.706
5.333GlyLeu: 5.333 ± 1.089
1.159GlyMet: 1.159 ± 0.384
3.246GlyAsn: 3.246 ± 0.61
0.116GlyPro: 0.116 ± 0.096
2.435GlyGln: 2.435 ± 0.466
2.203GlyArg: 2.203 ± 0.496
4.637GlySer: 4.637 ± 0.994
3.478GlyThr: 3.478 ± 0.89
4.985GlyVal: 4.985 ± 0.984
1.391GlyTrp: 1.391 ± 0.336
3.362GlyTyr: 3.362 ± 0.698
0.0GlyXaa: 0.0 ± 0.0
His
1.043HisAla: 1.043 ± 0.293
0.812HisCys: 0.812 ± 0.41
0.696HisAsp: 0.696 ± 0.275
0.58HisGlu: 0.58 ± 0.226
0.696HisPhe: 0.696 ± 0.305
0.696HisGly: 0.696 ± 0.289
0.116HisHis: 0.116 ± 0.105
0.696HisIle: 0.696 ± 0.269
1.043HisLys: 1.043 ± 0.381
1.391HisLeu: 1.391 ± 0.445
0.232HisMet: 0.232 ± 0.209
1.855HisAsn: 1.855 ± 0.526
0.116HisPro: 0.116 ± 0.126
0.348HisGln: 0.348 ± 0.203
0.232HisArg: 0.232 ± 0.244
0.116HisSer: 0.116 ± 0.096
0.927HisThr: 0.927 ± 0.312
1.043HisVal: 1.043 ± 0.339
0.232HisTrp: 0.232 ± 0.244
0.58HisTyr: 0.58 ± 0.265
0.0HisXaa: 0.0 ± 0.0
Ile
4.173IleAla: 4.173 ± 0.708
0.348IleCys: 0.348 ± 0.218
4.173IleAsp: 4.173 ± 0.507
6.376IleGlu: 6.376 ± 0.887
3.594IlePhe: 3.594 ± 0.66
3.71IleGly: 3.71 ± 0.918
1.043IleHis: 1.043 ± 0.356
4.753IleIle: 4.753 ± 0.634
7.188IleLys: 7.188 ± 0.764
4.637IleLeu: 4.637 ± 0.926
1.507IleMet: 1.507 ± 0.527
4.869IleAsn: 4.869 ± 0.54
1.043IlePro: 1.043 ± 0.327
1.855IleGln: 1.855 ± 0.369
1.971IleArg: 1.971 ± 0.446
3.246IleSer: 3.246 ± 0.753
5.217IleThr: 5.217 ± 0.781
3.942IleVal: 3.942 ± 0.616
0.927IleTrp: 0.927 ± 0.324
2.55IleTyr: 2.55 ± 0.539
0.0IleXaa: 0.0 ± 0.0
Lys
5.912LysAla: 5.912 ± 0.831
0.696LysCys: 0.696 ± 0.329
4.985LysAsp: 4.985 ± 0.728
9.158LysGlu: 9.158 ± 1.374
2.898LysPhe: 2.898 ± 0.539
6.492LysGly: 6.492 ± 0.979
1.391LysHis: 1.391 ± 0.427
5.681LysIle: 5.681 ± 0.849
9.274LysLys: 9.274 ± 1.168
7.304LysLeu: 7.304 ± 0.861
3.478LysMet: 3.478 ± 0.5
4.869LysAsn: 4.869 ± 0.81
2.087LysPro: 2.087 ± 0.551
3.246LysGln: 3.246 ± 0.59
3.362LysArg: 3.362 ± 0.674
5.449LysSer: 5.449 ± 0.787
5.796LysThr: 5.796 ± 0.92
5.912LysVal: 5.912 ± 0.7
0.927LysTrp: 0.927 ± 0.265
3.942LysTyr: 3.942 ± 0.646
0.0LysXaa: 0.0 ± 0.0
Leu
5.101LeuAla: 5.101 ± 0.776
0.348LeuCys: 0.348 ± 0.231
4.637LeuAsp: 4.637 ± 0.557
6.144LeuGlu: 6.144 ± 0.857
3.014LeuPhe: 3.014 ± 0.559
5.217LeuGly: 5.217 ± 0.816
1.391LeuHis: 1.391 ± 0.427
6.608LeuIle: 6.608 ± 0.897
8.463LeuLys: 8.463 ± 1.041
6.608LeuLeu: 6.608 ± 1.007
2.782LeuMet: 2.782 ± 0.746
5.681LeuAsn: 5.681 ± 0.86
2.666LeuPro: 2.666 ± 0.571
3.014LeuGln: 3.014 ± 0.548
3.13LeuArg: 3.13 ± 0.591
6.144LeuSer: 6.144 ± 1.005
6.028LeuThr: 6.028 ± 0.624
5.796LeuVal: 5.796 ± 0.655
1.159LeuTrp: 1.159 ± 0.382
4.289LeuTyr: 4.289 ± 0.844
0.0LeuXaa: 0.0 ± 0.0
Met
2.319MetAla: 2.319 ± 0.524
0.116MetCys: 0.116 ± 0.131
1.623MetAsp: 1.623 ± 0.36
2.435MetGlu: 2.435 ± 0.618
0.696MetPhe: 0.696 ± 0.334
1.043MetGly: 1.043 ± 0.303
0.348MetHis: 0.348 ± 0.176
2.55MetIle: 2.55 ± 0.624
2.435MetLys: 2.435 ± 0.444
1.623MetLeu: 1.623 ± 0.652
0.348MetMet: 0.348 ± 0.178
1.971MetAsn: 1.971 ± 0.468
0.696MetPro: 0.696 ± 0.334
1.855MetGln: 1.855 ± 0.35
0.348MetArg: 0.348 ± 0.225
1.623MetSer: 1.623 ± 0.395
1.855MetThr: 1.855 ± 0.462
1.623MetVal: 1.623 ± 0.371
0.232MetTrp: 0.232 ± 0.178
1.275MetTyr: 1.275 ± 0.41
0.0MetXaa: 0.0 ± 0.0
Asn
4.869AsnAla: 4.869 ± 0.967
0.116AsnCys: 0.116 ± 0.123
3.942AsnAsp: 3.942 ± 0.73
4.753AsnGlu: 4.753 ± 0.682
1.971AsnPhe: 1.971 ± 0.551
6.492AsnGly: 6.492 ± 0.912
1.159AsnHis: 1.159 ± 0.398
4.058AsnIle: 4.058 ± 0.619
6.376AsnLys: 6.376 ± 1.19
5.565AsnLeu: 5.565 ± 0.685
1.739AsnMet: 1.739 ± 0.417
3.246AsnAsn: 3.246 ± 0.602
2.203AsnPro: 2.203 ± 0.42
1.855AsnGln: 1.855 ± 0.436
1.855AsnArg: 1.855 ± 0.368
4.058AsnSer: 4.058 ± 0.472
4.173AsnThr: 4.173 ± 0.856
3.71AsnVal: 3.71 ± 0.632
1.275AsnTrp: 1.275 ± 0.322
2.55AsnTyr: 2.55 ± 0.527
0.0AsnXaa: 0.0 ± 0.0
Pro
1.623ProAla: 1.623 ± 0.486
0.116ProCys: 0.116 ± 0.119
1.507ProAsp: 1.507 ± 0.4
1.855ProGlu: 1.855 ± 0.52
1.275ProPhe: 1.275 ± 0.3
0.348ProGly: 0.348 ± 0.188
0.0ProHis: 0.0 ± 0.0
1.623ProIle: 1.623 ± 0.335
1.855ProLys: 1.855 ± 0.485
1.623ProLeu: 1.623 ± 0.407
0.58ProMet: 0.58 ± 0.232
2.319ProAsn: 2.319 ± 0.767
0.464ProPro: 0.464 ± 0.278
0.348ProGln: 0.348 ± 0.243
0.58ProArg: 0.58 ± 0.226
1.391ProSer: 1.391 ± 0.527
2.319ProThr: 2.319 ± 0.49
1.275ProVal: 1.275 ± 0.408
0.232ProTrp: 0.232 ± 0.161
0.927ProTyr: 0.927 ± 0.326
0.0ProXaa: 0.0 ± 0.0
Gln
3.014GlnAla: 3.014 ± 0.774
0.116GlnCys: 0.116 ± 0.106
1.275GlnAsp: 1.275 ± 0.395
2.087GlnGlu: 2.087 ± 0.554
1.623GlnPhe: 1.623 ± 0.375
1.855GlnGly: 1.855 ± 0.476
0.232GlnHis: 0.232 ± 0.162
1.507GlnIle: 1.507 ± 0.305
2.782GlnLys: 2.782 ± 0.638
3.826GlnLeu: 3.826 ± 0.712
0.927GlnMet: 0.927 ± 0.333
1.971GlnAsn: 1.971 ± 0.452
1.275GlnPro: 1.275 ± 0.393
1.739GlnGln: 1.739 ± 0.513
1.275GlnArg: 1.275 ± 0.381
2.203GlnSer: 2.203 ± 0.372
2.898GlnThr: 2.898 ± 0.58
2.203GlnVal: 2.203 ± 0.477
0.58GlnTrp: 0.58 ± 0.313
1.275GlnTyr: 1.275 ± 0.314
0.0GlnXaa: 0.0 ± 0.0
Arg
2.087ArgAla: 2.087 ± 0.513
0.464ArgCys: 0.464 ± 0.249
1.507ArgAsp: 1.507 ± 0.359
2.203ArgGlu: 2.203 ± 0.435
0.812ArgPhe: 0.812 ± 0.292
1.971ArgGly: 1.971 ± 0.374
0.812ArgHis: 0.812 ± 0.289
1.855ArgIle: 1.855 ± 0.463
4.173ArgLys: 4.173 ± 0.687
4.173ArgLeu: 4.173 ± 0.663
0.696ArgMet: 0.696 ± 0.329
2.319ArgAsn: 2.319 ± 0.519
0.812ArgPro: 0.812 ± 0.284
1.391ArgGln: 1.391 ± 0.361
1.971ArgArg: 1.971 ± 0.417
2.087ArgSer: 2.087 ± 0.581
1.391ArgThr: 1.391 ± 0.304
1.739ArgVal: 1.739 ± 0.468
0.348ArgTrp: 0.348 ± 0.176
1.739ArgTyr: 1.739 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
4.637SerAla: 4.637 ± 1.374
0.696SerCys: 0.696 ± 0.335
3.71SerAsp: 3.71 ± 0.572
3.13SerGlu: 3.13 ± 0.447
3.478SerPhe: 3.478 ± 0.859
4.869SerGly: 4.869 ± 1.6
0.927SerHis: 0.927 ± 0.306
4.289SerIle: 4.289 ± 0.715
5.449SerLys: 5.449 ± 0.81
4.985SerLeu: 4.985 ± 0.612
1.855SerMet: 1.855 ± 0.332
3.594SerAsn: 3.594 ± 0.64
0.812SerPro: 0.812 ± 0.293
1.971SerGln: 1.971 ± 0.382
2.55SerArg: 2.55 ± 0.413
4.289SerSer: 4.289 ± 0.867
3.362SerThr: 3.362 ± 0.654
4.289SerVal: 4.289 ± 0.621
0.812SerTrp: 0.812 ± 0.369
2.782SerTyr: 2.782 ± 0.59
0.0SerXaa: 0.0 ± 0.0
Thr
4.405ThrAla: 4.405 ± 0.687
0.58ThrCys: 0.58 ± 0.241
3.478ThrAsp: 3.478 ± 0.777
5.796ThrGlu: 5.796 ± 0.581
2.898ThrPhe: 2.898 ± 0.751
4.753ThrGly: 4.753 ± 0.706
0.232ThrHis: 0.232 ± 0.168
3.826ThrIle: 3.826 ± 0.715
4.753ThrLys: 4.753 ± 0.699
6.028ThrLeu: 6.028 ± 0.82
1.275ThrMet: 1.275 ± 0.322
4.289ThrAsn: 4.289 ± 0.623
1.739ThrPro: 1.739 ± 0.398
2.666ThrGln: 2.666 ± 0.482
1.739ThrArg: 1.739 ± 0.484
4.521ThrSer: 4.521 ± 0.666
3.478ThrThr: 3.478 ± 0.55
4.637ThrVal: 4.637 ± 0.756
1.159ThrTrp: 1.159 ± 0.374
2.435ThrTyr: 2.435 ± 0.633
0.0ThrXaa: 0.0 ± 0.0
Val
3.826ValAla: 3.826 ± 0.667
0.696ValCys: 0.696 ± 0.283
4.521ValAsp: 4.521 ± 0.777
4.985ValGlu: 4.985 ± 0.497
3.362ValPhe: 3.362 ± 0.497
2.898ValGly: 2.898 ± 0.626
0.464ValHis: 0.464 ± 0.239
3.362ValIle: 3.362 ± 0.559
6.26ValLys: 6.26 ± 0.742
2.898ValLeu: 2.898 ± 0.495
2.319ValMet: 2.319 ± 0.498
3.014ValAsn: 3.014 ± 0.559
1.623ValPro: 1.623 ± 0.467
2.087ValGln: 2.087 ± 0.524
3.13ValArg: 3.13 ± 0.535
5.333ValSer: 5.333 ± 1.049
4.637ValThr: 4.637 ± 0.65
3.71ValVal: 3.71 ± 0.609
0.696ValTrp: 0.696 ± 0.237
3.014ValTyr: 3.014 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.58TrpAla: 0.58 ± 0.313
0.348TrpCys: 0.348 ± 0.231
0.464TrpAsp: 0.464 ± 0.306
0.696TrpGlu: 0.696 ± 0.217
0.927TrpPhe: 0.927 ± 0.335
0.696TrpGly: 0.696 ± 0.323
0.348TrpHis: 0.348 ± 0.262
0.464TrpIle: 0.464 ± 0.225
0.812TrpLys: 0.812 ± 0.282
1.159TrpLeu: 1.159 ± 0.367
0.464TrpMet: 0.464 ± 0.217
1.507TrpAsn: 1.507 ± 0.377
0.0TrpPro: 0.0 ± 0.028
0.696TrpGln: 0.696 ± 0.223
0.58TrpArg: 0.58 ± 0.291
1.043TrpSer: 1.043 ± 0.276
0.812TrpThr: 0.812 ± 0.287
0.927TrpVal: 0.927 ± 0.376
0.232TrpTrp: 0.232 ± 0.136
0.927TrpTyr: 0.927 ± 0.301
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.739TyrAla: 1.739 ± 0.509
0.58TyrCys: 0.58 ± 0.304
2.203TyrAsp: 2.203 ± 0.656
3.826TyrGlu: 3.826 ± 0.64
2.782TyrPhe: 2.782 ± 0.583
2.898TyrGly: 2.898 ± 0.605
0.812TyrHis: 0.812 ± 0.367
3.942TyrIle: 3.942 ± 0.86
2.782TyrLys: 2.782 ± 0.694
4.173TyrLeu: 4.173 ± 0.873
0.927TyrMet: 0.927 ± 0.363
3.246TyrAsn: 3.246 ± 0.484
1.159TyrPro: 1.159 ± 0.359
1.739TyrGln: 1.739 ± 0.508
1.275TyrArg: 1.275 ± 0.388
1.971TyrSer: 1.971 ± 0.524
3.246TyrThr: 3.246 ± 0.769
2.666TyrVal: 2.666 ± 0.514
0.116TyrTrp: 0.116 ± 0.119
1.739TyrTyr: 1.739 ± 0.442
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 59 proteins (8627 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski