Amino acid dipepetide frequency for Lactococcus phage PLgT-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.313AlaAla: 4.313 ± 0.759
0.259AlaCys: 0.259 ± 0.143
3.019AlaAsp: 3.019 ± 0.516
3.795AlaGlu: 3.795 ± 0.757
3.105AlaPhe: 3.105 ± 0.584
4.054AlaGly: 4.054 ± 0.7
0.173AlaHis: 0.173 ± 0.123
5.866AlaIle: 5.866 ± 0.694
7.936AlaLys: 7.936 ± 1.134
6.383AlaLeu: 6.383 ± 0.796
1.984AlaMet: 1.984 ± 0.406
3.278AlaAsn: 3.278 ± 0.715
1.208AlaPro: 1.208 ± 0.367
2.07AlaGln: 2.07 ± 0.445
2.674AlaArg: 2.674 ± 0.533
4.917AlaSer: 4.917 ± 0.61
4.917AlaThr: 4.917 ± 0.887
5.607AlaVal: 5.607 ± 0.686
1.553AlaTrp: 1.553 ± 0.622
2.933AlaTyr: 2.933 ± 0.521
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.204
0.0CysCys: 0.0 ± 0.0
0.518CysAsp: 0.518 ± 0.26
0.173CysGlu: 0.173 ± 0.136
0.0CysPhe: 0.0 ± 0.0
0.431CysGly: 0.431 ± 0.186
0.086CysHis: 0.086 ± 0.076
0.259CysIle: 0.259 ± 0.189
0.431CysLys: 0.431 ± 0.204
0.173CysLeu: 0.173 ± 0.125
0.0CysMet: 0.0 ± 0.0
0.173CysAsn: 0.173 ± 0.137
0.259CysPro: 0.259 ± 0.169
0.086CysGln: 0.086 ± 0.09
0.345CysArg: 0.345 ± 0.237
0.431CysSer: 0.431 ± 0.219
0.173CysThr: 0.173 ± 0.116
0.604CysVal: 0.604 ± 0.255
0.086CysTrp: 0.086 ± 0.088
0.086CysTyr: 0.086 ± 0.086
0.0CysXaa: 0.0 ± 0.0
Asp
3.537AspAla: 3.537 ± 0.586
0.173AspCys: 0.173 ± 0.133
3.105AspAsp: 3.105 ± 0.554
5.607AspGlu: 5.607 ± 0.78
2.588AspPhe: 2.588 ± 0.441
4.572AspGly: 4.572 ± 0.512
0.259AspHis: 0.259 ± 0.134
5.262AspIle: 5.262 ± 0.55
5.952AspLys: 5.952 ± 0.79
5.089AspLeu: 5.089 ± 0.671
1.121AspMet: 1.121 ± 0.336
4.831AspAsn: 4.831 ± 0.646
1.898AspPro: 1.898 ± 0.435
1.38AspGln: 1.38 ± 0.271
2.415AspArg: 2.415 ± 0.38
3.709AspSer: 3.709 ± 0.58
2.415AspThr: 2.415 ± 0.368
3.623AspVal: 3.623 ± 0.684
1.466AspTrp: 1.466 ± 0.312
3.709AspTyr: 3.709 ± 0.524
0.0AspXaa: 0.0 ± 0.0
Glu
5.521GluAla: 5.521 ± 0.883
0.431GluCys: 0.431 ± 0.192
3.019GluAsp: 3.019 ± 0.553
6.642GluGlu: 6.642 ± 1.08
2.415GluPhe: 2.415 ± 0.43
3.019GluGly: 3.019 ± 0.438
1.639GluHis: 1.639 ± 0.56
7.073GluIle: 7.073 ± 0.87
6.814GluLys: 6.814 ± 0.868
6.556GluLeu: 6.556 ± 0.828
2.329GluMet: 2.329 ± 0.409
4.744GluAsn: 4.744 ± 0.752
1.035GluPro: 1.035 ± 0.328
3.278GluGln: 3.278 ± 0.681
2.933GluArg: 2.933 ± 0.511
3.364GluSer: 3.364 ± 0.642
2.674GluThr: 2.674 ± 0.527
3.795GluVal: 3.795 ± 0.483
1.208GluTrp: 1.208 ± 0.334
2.415GluTyr: 2.415 ± 0.638
0.0GluXaa: 0.0 ± 0.0
Phe
2.847PheAla: 2.847 ± 0.515
0.259PheCys: 0.259 ± 0.163
3.105PheAsp: 3.105 ± 0.644
2.588PheGlu: 2.588 ± 0.598
0.949PhePhe: 0.949 ± 0.304
2.933PheGly: 2.933 ± 0.48
0.518PheHis: 0.518 ± 0.206
2.243PheIle: 2.243 ± 0.395
3.537PheLys: 3.537 ± 0.44
2.933PheLeu: 2.933 ± 0.515
1.38PheMet: 1.38 ± 0.366
3.364PheAsn: 3.364 ± 0.457
1.208PhePro: 1.208 ± 0.308
0.518PheGln: 0.518 ± 0.178
1.38PheArg: 1.38 ± 0.461
2.588PheSer: 2.588 ± 0.397
2.502PheThr: 2.502 ± 0.394
1.466PheVal: 1.466 ± 0.423
0.518PheTrp: 0.518 ± 0.205
2.329PheTyr: 2.329 ± 0.484
0.0PheXaa: 0.0 ± 0.0
Gly
4.658GlyAla: 4.658 ± 0.811
0.259GlyCys: 0.259 ± 0.168
4.227GlyAsp: 4.227 ± 0.541
3.968GlyGlu: 3.968 ± 0.577
2.674GlyPhe: 2.674 ± 0.517
4.313GlyGly: 4.313 ± 0.83
1.035GlyHis: 1.035 ± 0.337
5.003GlyIle: 5.003 ± 0.766
5.348GlyLys: 5.348 ± 0.7
5.262GlyLeu: 5.262 ± 0.79
1.811GlyMet: 1.811 ± 0.419
3.537GlyAsn: 3.537 ± 0.663
0.863GlyPro: 0.863 ± 0.277
2.588GlyGln: 2.588 ± 0.521
1.984GlyArg: 1.984 ± 0.369
5.607GlySer: 5.607 ± 0.863
4.227GlyThr: 4.227 ± 0.77
4.485GlyVal: 4.485 ± 0.997
0.259GlyTrp: 0.259 ± 0.148
2.243GlyTyr: 2.243 ± 0.346
0.0GlyXaa: 0.0 ± 0.0
His
0.604HisAla: 0.604 ± 0.246
0.086HisCys: 0.086 ± 0.078
0.604HisAsp: 0.604 ± 0.268
1.035HisGlu: 1.035 ± 0.3
0.863HisPhe: 0.863 ± 0.307
0.949HisGly: 0.949 ± 0.281
0.086HisHis: 0.086 ± 0.082
1.035HisIle: 1.035 ± 0.36
0.776HisLys: 0.776 ± 0.292
0.431HisLeu: 0.431 ± 0.199
0.604HisMet: 0.604 ± 0.225
0.431HisAsn: 0.431 ± 0.209
0.69HisPro: 0.69 ± 0.287
0.345HisGln: 0.345 ± 0.189
0.345HisArg: 0.345 ± 0.154
1.38HisSer: 1.38 ± 0.304
0.345HisThr: 0.345 ± 0.199
1.035HisVal: 1.035 ± 0.271
0.345HisTrp: 0.345 ± 0.209
0.69HisTyr: 0.69 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
5.089IleAla: 5.089 ± 0.726
0.518IleCys: 0.518 ± 0.194
5.262IleAsp: 5.262 ± 0.612
5.521IleGlu: 5.521 ± 0.796
1.811IlePhe: 1.811 ± 0.424
5.693IleGly: 5.693 ± 1.213
1.121IleHis: 1.121 ± 0.361
3.105IleIle: 3.105 ± 0.658
6.814IleLys: 6.814 ± 0.766
5.262IleLeu: 5.262 ± 0.656
1.208IleMet: 1.208 ± 0.365
4.399IleAsn: 4.399 ± 0.653
1.984IlePro: 1.984 ± 0.44
3.192IleGln: 3.192 ± 0.507
1.984IleArg: 1.984 ± 0.359
4.658IleSer: 4.658 ± 0.573
3.795IleThr: 3.795 ± 0.905
4.744IleVal: 4.744 ± 0.7
0.776IleTrp: 0.776 ± 0.267
3.278IleTyr: 3.278 ± 0.572
0.0IleXaa: 0.0 ± 0.0
Lys
6.901LysAla: 6.901 ± 0.732
0.518LysCys: 0.518 ± 0.214
5.434LysAsp: 5.434 ± 0.741
6.556LysGlu: 6.556 ± 0.874
3.192LysPhe: 3.192 ± 0.479
4.399LysGly: 4.399 ± 0.695
1.035LysHis: 1.035 ± 0.305
5.434LysIle: 5.434 ± 0.67
8.54LysLys: 8.54 ± 0.919
7.246LysLeu: 7.246 ± 0.814
3.45LysMet: 3.45 ± 0.536
5.607LysAsn: 5.607 ± 0.719
2.674LysPro: 2.674 ± 0.404
4.227LysGln: 4.227 ± 0.716
3.795LysArg: 3.795 ± 0.657
6.124LysSer: 6.124 ± 0.954
6.901LysThr: 6.901 ± 1.139
5.003LysVal: 5.003 ± 0.78
1.811LysTrp: 1.811 ± 0.479
3.795LysTyr: 3.795 ± 0.588
0.0LysXaa: 0.0 ± 0.0
Leu
5.693LeuAla: 5.693 ± 0.791
0.345LeuCys: 0.345 ± 0.187
6.211LeuAsp: 6.211 ± 0.864
7.332LeuGlu: 7.332 ± 0.969
3.537LeuPhe: 3.537 ± 0.609
4.399LeuGly: 4.399 ± 0.506
0.863LeuHis: 0.863 ± 0.344
5.262LeuIle: 5.262 ± 0.541
8.54LeuLys: 8.54 ± 0.958
5.607LeuLeu: 5.607 ± 0.882
1.639LeuMet: 1.639 ± 0.361
4.227LeuAsn: 4.227 ± 0.743
2.674LeuPro: 2.674 ± 0.449
4.572LeuGln: 4.572 ± 0.68
3.105LeuArg: 3.105 ± 0.605
4.917LeuSer: 4.917 ± 0.572
4.485LeuThr: 4.485 ± 0.672
3.968LeuVal: 3.968 ± 0.649
0.69LeuTrp: 0.69 ± 0.208
2.156LeuTyr: 2.156 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
2.674MetAla: 2.674 ± 0.45
0.086MetCys: 0.086 ± 0.102
1.208MetAsp: 1.208 ± 0.259
1.984MetGlu: 1.984 ± 0.397
1.208MetPhe: 1.208 ± 0.322
1.725MetGly: 1.725 ± 0.448
0.345MetHis: 0.345 ± 0.181
1.294MetIle: 1.294 ± 0.291
2.933MetLys: 2.933 ± 0.522
1.725MetLeu: 1.725 ± 0.367
0.69MetMet: 0.69 ± 0.244
2.415MetAsn: 2.415 ± 0.494
1.121MetPro: 1.121 ± 0.371
1.121MetGln: 1.121 ± 0.274
0.345MetArg: 0.345 ± 0.18
1.639MetSer: 1.639 ± 0.405
2.243MetThr: 2.243 ± 0.473
1.121MetVal: 1.121 ± 0.305
0.173MetTrp: 0.173 ± 0.116
1.035MetTyr: 1.035 ± 0.298
0.0MetXaa: 0.0 ± 0.0
Asn
4.399AsnAla: 4.399 ± 0.676
0.086AsnCys: 0.086 ± 0.088
3.537AsnAsp: 3.537 ± 0.487
4.14AsnGlu: 4.14 ± 0.653
2.243AsnPhe: 2.243 ± 0.38
4.831AsnGly: 4.831 ± 0.785
0.518AsnHis: 0.518 ± 0.24
4.313AsnIle: 4.313 ± 0.694
6.038AsnLys: 6.038 ± 0.851
4.572AsnLeu: 4.572 ± 0.767
1.725AsnMet: 1.725 ± 0.473
4.313AsnAsn: 4.313 ± 0.755
2.588AsnPro: 2.588 ± 0.531
3.192AsnGln: 3.192 ± 0.568
2.76AsnArg: 2.76 ± 0.449
3.968AsnSer: 3.968 ± 0.888
2.502AsnThr: 2.502 ± 0.496
3.364AsnVal: 3.364 ± 0.574
0.431AsnTrp: 0.431 ± 0.172
2.847AsnTyr: 2.847 ± 0.463
0.0AsnXaa: 0.0 ± 0.0
Pro
1.898ProAla: 1.898 ± 0.462
0.0ProCys: 0.0 ± 0.0
2.329ProAsp: 2.329 ± 0.423
2.329ProGlu: 2.329 ± 0.332
1.208ProPhe: 1.208 ± 0.343
0.863ProGly: 0.863 ± 0.274
0.69ProHis: 0.69 ± 0.278
2.415ProIle: 2.415 ± 0.406
2.847ProLys: 2.847 ± 0.478
2.156ProLeu: 2.156 ± 0.437
0.863ProMet: 0.863 ± 0.333
1.811ProAsn: 1.811 ± 0.413
0.69ProPro: 0.69 ± 0.271
1.466ProGln: 1.466 ± 0.332
0.949ProArg: 0.949 ± 0.298
1.984ProSer: 1.984 ± 0.34
0.863ProThr: 0.863 ± 0.239
1.466ProVal: 1.466 ± 0.318
0.173ProTrp: 0.173 ± 0.137
0.69ProTyr: 0.69 ± 0.223
0.0ProXaa: 0.0 ± 0.0
Gln
3.795GlnAla: 3.795 ± 0.646
0.086GlnCys: 0.086 ± 0.097
1.984GlnAsp: 1.984 ± 0.346
2.933GlnGlu: 2.933 ± 0.572
1.898GlnPhe: 1.898 ± 0.342
2.674GlnGly: 2.674 ± 0.484
0.345GlnHis: 0.345 ± 0.154
2.933GlnIle: 2.933 ± 0.528
3.105GlnLys: 3.105 ± 0.463
3.795GlnLeu: 3.795 ± 0.572
1.121GlnMet: 1.121 ± 0.314
1.639GlnAsn: 1.639 ± 0.428
1.035GlnPro: 1.035 ± 0.332
2.329GlnGln: 2.329 ± 0.498
1.725GlnArg: 1.725 ± 0.313
2.847GlnSer: 2.847 ± 0.536
3.192GlnThr: 3.192 ± 0.465
2.674GlnVal: 2.674 ± 0.44
0.259GlnTrp: 0.259 ± 0.128
1.121GlnTyr: 1.121 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
1.725ArgAla: 1.725 ± 0.359
0.345ArgCys: 0.345 ± 0.218
2.156ArgAsp: 2.156 ± 0.44
3.019ArgGlu: 3.019 ± 0.498
1.811ArgPhe: 1.811 ± 0.458
1.466ArgGly: 1.466 ± 0.328
0.431ArgHis: 0.431 ± 0.198
3.019ArgIle: 3.019 ± 0.529
2.502ArgLys: 2.502 ± 0.553
3.709ArgLeu: 3.709 ± 0.659
0.949ArgMet: 0.949 ± 0.231
2.415ArgAsn: 2.415 ± 0.464
1.121ArgPro: 1.121 ± 0.248
1.811ArgGln: 1.811 ± 0.417
1.121ArgArg: 1.121 ± 0.392
1.639ArgSer: 1.639 ± 0.374
1.898ArgThr: 1.898 ± 0.397
1.811ArgVal: 1.811 ± 0.382
0.604ArgTrp: 0.604 ± 0.233
2.243ArgTyr: 2.243 ± 0.488
0.0ArgXaa: 0.0 ± 0.0
Ser
4.572SerAla: 4.572 ± 0.719
0.345SerCys: 0.345 ± 0.196
4.744SerAsp: 4.744 ± 0.543
3.882SerGlu: 3.882 ± 0.543
3.019SerPhe: 3.019 ± 0.481
5.779SerGly: 5.779 ± 0.715
1.208SerHis: 1.208 ± 0.305
5.003SerIle: 5.003 ± 0.861
5.952SerLys: 5.952 ± 0.586
5.434SerLeu: 5.434 ± 0.734
2.415SerMet: 2.415 ± 0.554
4.744SerAsn: 4.744 ± 0.767
1.035SerPro: 1.035 ± 0.271
2.156SerGln: 2.156 ± 0.498
1.984SerArg: 1.984 ± 0.343
4.572SerSer: 4.572 ± 0.725
3.364SerThr: 3.364 ± 0.601
3.882SerVal: 3.882 ± 0.646
1.121SerTrp: 1.121 ± 0.321
2.156SerTyr: 2.156 ± 0.466
0.0SerXaa: 0.0 ± 0.0
Thr
4.744ThrAla: 4.744 ± 0.809
0.086ThrCys: 0.086 ± 0.093
3.882ThrAsp: 3.882 ± 0.569
3.537ThrGlu: 3.537 ± 0.522
2.502ThrPhe: 2.502 ± 0.54
4.831ThrGly: 4.831 ± 0.556
1.208ThrHis: 1.208 ± 0.406
3.709ThrIle: 3.709 ± 0.65
6.038ThrLys: 6.038 ± 1.042
3.795ThrLeu: 3.795 ± 0.564
0.949ThrMet: 0.949 ± 0.264
2.502ThrAsn: 2.502 ± 0.401
2.329ThrPro: 2.329 ± 0.476
2.243ThrGln: 2.243 ± 0.41
1.466ThrArg: 1.466 ± 0.278
3.623ThrSer: 3.623 ± 0.645
3.45ThrThr: 3.45 ± 0.635
3.537ThrVal: 3.537 ± 0.67
0.863ThrTrp: 0.863 ± 0.246
1.984ThrTyr: 1.984 ± 0.401
0.0ThrXaa: 0.0 ± 0.0
Val
4.14ValAla: 4.14 ± 0.718
0.345ValCys: 0.345 ± 0.174
4.917ValAsp: 4.917 ± 0.798
3.192ValGlu: 3.192 ± 0.519
2.07ValPhe: 2.07 ± 0.387
3.795ValGly: 3.795 ± 0.543
0.518ValHis: 0.518 ± 0.201
4.399ValIle: 4.399 ± 0.656
4.831ValLys: 4.831 ± 0.612
5.176ValLeu: 5.176 ± 0.813
0.863ValMet: 0.863 ± 0.245
4.399ValAsn: 4.399 ± 0.721
1.553ValPro: 1.553 ± 0.33
2.329ValGln: 2.329 ± 0.342
1.725ValArg: 1.725 ± 0.336
4.917ValSer: 4.917 ± 0.606
3.795ValThr: 3.795 ± 0.852
3.968ValVal: 3.968 ± 0.686
0.604ValTrp: 0.604 ± 0.203
2.329ValTyr: 2.329 ± 0.524
0.0ValXaa: 0.0 ± 0.0
Trp
0.259TrpAla: 0.259 ± 0.147
0.259TrpCys: 0.259 ± 0.158
0.69TrpAsp: 0.69 ± 0.253
0.604TrpGlu: 0.604 ± 0.279
0.69TrpPhe: 0.69 ± 0.217
0.863TrpGly: 0.863 ± 0.264
0.0TrpHis: 0.0 ± 0.0
0.345TrpIle: 0.345 ± 0.186
0.69TrpLys: 0.69 ± 0.223
1.811TrpLeu: 1.811 ± 0.316
0.431TrpMet: 0.431 ± 0.183
1.294TrpAsn: 1.294 ± 0.386
0.0TrpPro: 0.0 ± 0.0
0.776TrpGln: 0.776 ± 0.227
0.863TrpArg: 0.863 ± 0.222
1.466TrpSer: 1.466 ± 0.341
0.863TrpThr: 0.863 ± 0.287
1.121TrpVal: 1.121 ± 0.352
0.086TrpTrp: 0.086 ± 0.086
0.431TrpTyr: 0.431 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.243TyrAla: 2.243 ± 0.481
0.259TyrCys: 0.259 ± 0.209
2.76TyrAsp: 2.76 ± 0.564
2.156TyrGlu: 2.156 ± 0.451
1.38TyrPhe: 1.38 ± 0.337
2.76TyrGly: 2.76 ± 0.374
0.604TyrHis: 0.604 ± 0.208
2.243TyrIle: 2.243 ± 0.541
3.019TyrLys: 3.019 ± 0.568
3.105TyrLeu: 3.105 ± 0.509
1.466TyrMet: 1.466 ± 0.369
2.243TyrAsn: 2.243 ± 0.409
1.725TyrPro: 1.725 ± 0.396
1.811TyrGln: 1.811 ± 0.267
1.898TyrArg: 1.898 ± 0.437
3.019TyrSer: 3.019 ± 0.518
2.674TyrThr: 2.674 ± 0.394
2.588TyrVal: 2.588 ± 0.486
0.518TyrTrp: 0.518 ± 0.214
1.38TyrTyr: 1.38 ± 0.431
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (11594 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski