Amino acid dipepetide frequency for Lactococcus phage 49801

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.851AlaAla: 2.851 ± 0.657
0.46AlaCys: 0.46 ± 0.186
3.771AlaAsp: 3.771 ± 0.662
4.507AlaGlu: 4.507 ± 0.876
2.759AlaPhe: 2.759 ± 0.453
3.587AlaGly: 3.587 ± 0.794
0.552AlaHis: 0.552 ± 0.212
4.415AlaIle: 4.415 ± 1.016
5.426AlaLys: 5.426 ± 0.577
5.61AlaLeu: 5.61 ± 0.741
1.839AlaMet: 1.839 ± 0.345
4.599AlaAsn: 4.599 ± 0.64
1.104AlaPro: 1.104 ± 0.353
2.759AlaGln: 2.759 ± 0.518
2.299AlaArg: 2.299 ± 0.506
3.495AlaSer: 3.495 ± 0.493
3.679AlaThr: 3.679 ± 0.57
3.403AlaVal: 3.403 ± 0.7
1.747AlaTrp: 1.747 ± 0.535
1.931AlaTyr: 1.931 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
0.184CysAla: 0.184 ± 0.138
0.0CysCys: 0.0 ± 0.0
0.644CysAsp: 0.644 ± 0.303
0.276CysGlu: 0.276 ± 0.145
0.46CysPhe: 0.46 ± 0.195
0.368CysGly: 0.368 ± 0.175
0.0CysHis: 0.0 ± 0.0
0.184CysIle: 0.184 ± 0.12
0.552CysLys: 0.552 ± 0.218
0.46CysLeu: 0.46 ± 0.217
0.0CysMet: 0.0 ± 0.0
0.276CysAsn: 0.276 ± 0.171
0.184CysPro: 0.184 ± 0.122
0.0CysGln: 0.0 ± 0.0
0.276CysArg: 0.276 ± 0.136
0.736CysSer: 0.736 ± 0.277
0.184CysThr: 0.184 ± 0.12
0.276CysVal: 0.276 ± 0.163
0.184CysTrp: 0.184 ± 0.135
0.184CysTyr: 0.184 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
2.851AspAla: 2.851 ± 0.457
0.276AspCys: 0.276 ± 0.164
4.782AspAsp: 4.782 ± 0.941
5.702AspGlu: 5.702 ± 0.936
3.035AspPhe: 3.035 ± 0.546
4.966AspGly: 4.966 ± 0.966
0.46AspHis: 0.46 ± 0.196
4.874AspIle: 4.874 ± 0.672
5.794AspLys: 5.794 ± 0.641
4.415AspLeu: 4.415 ± 0.567
1.472AspMet: 1.472 ± 0.436
2.943AspAsn: 2.943 ± 0.408
1.196AspPro: 1.196 ± 0.405
1.104AspGln: 1.104 ± 0.306
2.299AspArg: 2.299 ± 0.354
4.691AspSer: 4.691 ± 0.641
4.139AspThr: 4.139 ± 0.564
3.863AspVal: 3.863 ± 0.678
1.38AspTrp: 1.38 ± 0.282
1.931AspTyr: 1.931 ± 0.365
0.0AspXaa: 0.0 ± 0.0
Glu
3.679GluAla: 3.679 ± 0.714
0.276GluCys: 0.276 ± 0.14
3.311GluAsp: 3.311 ± 0.43
5.978GluGlu: 5.978 ± 1.229
3.863GluPhe: 3.863 ± 0.581
3.219GluGly: 3.219 ± 0.621
1.104GluHis: 1.104 ± 0.345
4.507GluIle: 4.507 ± 0.539
8.001GluLys: 8.001 ± 1.385
8.001GluLeu: 8.001 ± 1.13
1.839GluMet: 1.839 ± 0.441
5.058GluAsn: 5.058 ± 0.807
2.575GluPro: 2.575 ± 0.578
3.771GluGln: 3.771 ± 0.665
2.759GluArg: 2.759 ± 0.499
3.035GluSer: 3.035 ± 0.515
3.955GluThr: 3.955 ± 0.7
5.886GluVal: 5.886 ± 0.973
1.012GluTrp: 1.012 ± 0.298
4.415GluTyr: 4.415 ± 0.733
0.0GluXaa: 0.0 ± 0.0
Phe
2.207PheAla: 2.207 ± 0.503
0.552PheCys: 0.552 ± 0.213
3.955PheAsp: 3.955 ± 0.505
2.943PheGlu: 2.943 ± 0.602
1.655PhePhe: 1.655 ± 0.404
2.667PheGly: 2.667 ± 0.574
0.46PheHis: 0.46 ± 0.174
3.403PheIle: 3.403 ± 0.603
4.139PheLys: 4.139 ± 0.601
2.759PheLeu: 2.759 ± 0.51
1.747PheMet: 1.747 ± 0.502
3.127PheAsn: 3.127 ± 0.581
0.736PhePro: 0.736 ± 0.299
1.472PheGln: 1.472 ± 0.394
1.472PheArg: 1.472 ± 0.407
3.219PheSer: 3.219 ± 0.487
3.035PheThr: 3.035 ± 0.603
2.483PheVal: 2.483 ± 0.417
0.46PheTrp: 0.46 ± 0.21
2.115PheTyr: 2.115 ± 0.408
0.0PheXaa: 0.0 ± 0.0
Gly
3.311GlyAla: 3.311 ± 0.689
0.368GlyCys: 0.368 ± 0.171
3.495GlyAsp: 3.495 ± 0.509
3.679GlyGlu: 3.679 ± 0.586
2.943GlyPhe: 2.943 ± 0.53
4.139GlyGly: 4.139 ± 0.801
0.736GlyHis: 0.736 ± 0.243
4.782GlyIle: 4.782 ± 0.746
6.254GlyLys: 6.254 ± 0.68
5.242GlyLeu: 5.242 ± 1.255
1.747GlyMet: 1.747 ± 0.525
2.943GlyAsn: 2.943 ± 0.759
1.012GlyPro: 1.012 ± 0.341
2.483GlyGln: 2.483 ± 0.64
2.943GlyArg: 2.943 ± 0.557
3.863GlySer: 3.863 ± 0.867
4.966GlyThr: 4.966 ± 0.88
3.219GlyVal: 3.219 ± 0.726
0.46GlyTrp: 0.46 ± 0.235
3.587GlyTyr: 3.587 ± 0.559
0.0GlyXaa: 0.0 ± 0.0
His
1.196HisAla: 1.196 ± 0.362
0.184HisCys: 0.184 ± 0.144
0.736HisAsp: 0.736 ± 0.214
1.472HisGlu: 1.472 ± 0.337
0.644HisPhe: 0.644 ± 0.221
0.736HisGly: 0.736 ± 0.229
0.184HisHis: 0.184 ± 0.129
0.184HisIle: 0.184 ± 0.119
0.92HisLys: 0.92 ± 0.307
1.104HisLeu: 1.104 ± 0.35
0.184HisMet: 0.184 ± 0.13
0.92HisAsn: 0.92 ± 0.296
0.552HisPro: 0.552 ± 0.2
0.552HisGln: 0.552 ± 0.208
0.276HisArg: 0.276 ± 0.159
0.736HisSer: 0.736 ± 0.283
0.368HisThr: 0.368 ± 0.18
0.828HisVal: 0.828 ± 0.219
0.276HisTrp: 0.276 ± 0.17
0.552HisTyr: 0.552 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
4.139IleAla: 4.139 ± 0.762
0.46IleCys: 0.46 ± 0.163
4.139IleAsp: 4.139 ± 0.621
6.346IleGlu: 6.346 ± 0.933
2.391IlePhe: 2.391 ± 0.405
3.495IleGly: 3.495 ± 0.672
0.644IleHis: 0.644 ± 0.251
3.311IleIle: 3.311 ± 0.685
7.91IleLys: 7.91 ± 0.759
4.231IleLeu: 4.231 ± 0.585
1.38IleMet: 1.38 ± 0.313
4.323IleAsn: 4.323 ± 0.67
2.023IlePro: 2.023 ± 0.418
2.851IleGln: 2.851 ± 0.513
2.391IleArg: 2.391 ± 0.367
5.15IleSer: 5.15 ± 0.828
4.507IleThr: 4.507 ± 0.606
3.495IleVal: 3.495 ± 0.707
0.828IleTrp: 0.828 ± 0.307
2.023IleTyr: 2.023 ± 0.482
0.0IleXaa: 0.0 ± 0.0
Lys
7.082LysAla: 7.082 ± 1.06
0.184LysCys: 0.184 ± 0.126
5.334LysAsp: 5.334 ± 0.587
7.082LysGlu: 7.082 ± 1.049
3.127LysPhe: 3.127 ± 0.499
6.162LysGly: 6.162 ± 0.855
1.931LysHis: 1.931 ± 0.409
6.07LysIle: 6.07 ± 0.876
9.381LysLys: 9.381 ± 1.286
8.645LysLeu: 8.645 ± 0.99
2.207LysMet: 2.207 ± 0.456
5.426LysAsn: 5.426 ± 0.683
2.207LysPro: 2.207 ± 0.456
4.966LysGln: 4.966 ± 0.742
4.047LysArg: 4.047 ± 0.754
5.15LysSer: 5.15 ± 0.667
5.61LysThr: 5.61 ± 0.898
3.863LysVal: 3.863 ± 0.771
1.104LysTrp: 1.104 ± 0.332
3.863LysTyr: 3.863 ± 0.677
0.0LysXaa: 0.0 ± 0.0
Leu
4.599LeuAla: 4.599 ± 0.749
0.828LeuCys: 0.828 ± 0.327
5.702LeuAsp: 5.702 ± 0.632
5.334LeuGlu: 5.334 ± 0.834
2.759LeuPhe: 2.759 ± 0.533
4.966LeuGly: 4.966 ± 0.681
1.196LeuHis: 1.196 ± 0.41
5.242LeuIle: 5.242 ± 0.68
7.634LeuLys: 7.634 ± 1.071
6.99LeuLeu: 6.99 ± 0.801
2.299LeuMet: 2.299 ± 0.447
5.61LeuAsn: 5.61 ± 0.635
3.587LeuPro: 3.587 ± 0.524
4.139LeuGln: 4.139 ± 0.641
2.115LeuArg: 2.115 ± 0.541
5.702LeuSer: 5.702 ± 0.587
5.058LeuThr: 5.058 ± 0.83
3.035LeuVal: 3.035 ± 0.518
1.472LeuTrp: 1.472 ± 0.666
2.667LeuTyr: 2.667 ± 0.508
0.0LeuXaa: 0.0 ± 0.0
Met
2.483MetAla: 2.483 ± 0.358
0.092MetCys: 0.092 ± 0.093
1.288MetAsp: 1.288 ± 0.335
2.207MetGlu: 2.207 ± 0.572
0.736MetPhe: 0.736 ± 0.244
1.104MetGly: 1.104 ± 0.392
0.184MetHis: 0.184 ± 0.13
1.564MetIle: 1.564 ± 0.395
2.115MetLys: 2.115 ± 0.587
1.564MetLeu: 1.564 ± 0.377
0.368MetMet: 0.368 ± 0.194
2.115MetAsn: 2.115 ± 0.438
0.736MetPro: 0.736 ± 0.261
1.564MetGln: 1.564 ± 0.369
1.012MetArg: 1.012 ± 0.291
1.931MetSer: 1.931 ± 0.401
3.035MetThr: 3.035 ± 0.586
1.104MetVal: 1.104 ± 0.321
0.276MetTrp: 0.276 ± 0.144
0.276MetTyr: 0.276 ± 0.205
0.0MetXaa: 0.0 ± 0.0
Asn
4.047AsnAla: 4.047 ± 0.752
0.276AsnCys: 0.276 ± 0.136
3.035AsnAsp: 3.035 ± 0.454
4.507AsnGlu: 4.507 ± 0.785
2.759AsnPhe: 2.759 ± 0.412
5.978AsnGly: 5.978 ± 0.958
0.552AsnHis: 0.552 ± 0.329
3.035AsnIle: 3.035 ± 0.474
5.518AsnLys: 5.518 ± 0.563
6.07AsnLeu: 6.07 ± 0.708
1.472AsnMet: 1.472 ± 0.38
4.874AsnAsn: 4.874 ± 0.738
1.839AsnPro: 1.839 ± 0.362
2.759AsnGln: 2.759 ± 0.439
1.931AsnArg: 1.931 ± 0.353
3.863AsnSer: 3.863 ± 0.626
2.943AsnThr: 2.943 ± 0.555
3.679AsnVal: 3.679 ± 0.523
0.736AsnTrp: 0.736 ± 0.253
2.023AsnTyr: 2.023 ± 0.539
0.0AsnXaa: 0.0 ± 0.0
Pro
1.196ProAla: 1.196 ± 0.315
0.092ProCys: 0.092 ± 0.083
1.931ProAsp: 1.931 ± 0.508
2.391ProGlu: 2.391 ± 0.436
1.655ProPhe: 1.655 ± 0.385
0.828ProGly: 0.828 ± 0.233
0.644ProHis: 0.644 ± 0.196
1.839ProIle: 1.839 ± 0.443
2.851ProLys: 2.851 ± 0.517
2.207ProLeu: 2.207 ± 0.412
0.46ProMet: 0.46 ± 0.214
1.655ProAsn: 1.655 ± 0.393
0.644ProPro: 0.644 ± 0.21
1.012ProGln: 1.012 ± 0.315
0.736ProArg: 0.736 ± 0.28
1.747ProSer: 1.747 ± 0.502
1.472ProThr: 1.472 ± 0.332
1.931ProVal: 1.931 ± 0.46
0.276ProTrp: 0.276 ± 0.17
0.92ProTyr: 0.92 ± 0.245
0.0ProXaa: 0.0 ± 0.0
Gln
3.863GlnAla: 3.863 ± 0.689
0.276GlnCys: 0.276 ± 0.146
1.196GlnAsp: 1.196 ± 0.406
3.587GlnGlu: 3.587 ± 0.567
1.747GlnPhe: 1.747 ± 0.408
2.575GlnGly: 2.575 ± 0.584
0.368GlnHis: 0.368 ± 0.182
2.943GlnIle: 2.943 ± 0.518
3.127GlnLys: 3.127 ± 0.539
3.035GlnLeu: 3.035 ± 0.448
1.38GlnMet: 1.38 ± 0.357
2.483GlnAsn: 2.483 ± 0.461
1.564GlnPro: 1.564 ± 0.453
2.023GlnGln: 2.023 ± 0.432
1.472GlnArg: 1.472 ± 0.376
2.115GlnSer: 2.115 ± 0.547
3.311GlnThr: 3.311 ± 0.45
3.311GlnVal: 3.311 ± 0.598
0.92GlnTrp: 0.92 ± 0.286
1.747GlnTyr: 1.747 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
2.023ArgAla: 2.023 ± 0.383
0.368ArgCys: 0.368 ± 0.205
2.023ArgAsp: 2.023 ± 0.439
2.667ArgGlu: 2.667 ± 0.5
2.115ArgPhe: 2.115 ± 0.447
1.747ArgGly: 1.747 ± 0.414
0.184ArgHis: 0.184 ± 0.149
2.943ArgIle: 2.943 ± 0.517
4.231ArgLys: 4.231 ± 0.7
3.955ArgLeu: 3.955 ± 0.661
0.92ArgMet: 0.92 ± 0.279
2.299ArgAsn: 2.299 ± 0.436
1.012ArgPro: 1.012 ± 0.345
1.104ArgGln: 1.104 ± 0.316
1.38ArgArg: 1.38 ± 0.385
1.931ArgSer: 1.931 ± 0.357
1.931ArgThr: 1.931 ± 0.393
2.391ArgVal: 2.391 ± 0.383
0.552ArgTrp: 0.552 ± 0.231
1.564ArgTyr: 1.564 ± 0.35
0.0ArgXaa: 0.0 ± 0.0
Ser
4.323SerAla: 4.323 ± 1.056
0.184SerCys: 0.184 ± 0.127
5.61SerAsp: 5.61 ± 0.474
5.15SerGlu: 5.15 ± 0.683
3.403SerPhe: 3.403 ± 0.507
4.691SerGly: 4.691 ± 0.726
1.104SerHis: 1.104 ± 0.286
3.771SerIle: 3.771 ± 0.561
3.955SerLys: 3.955 ± 0.566
3.219SerLeu: 3.219 ± 0.484
1.931SerMet: 1.931 ± 0.415
3.679SerAsn: 3.679 ± 0.579
1.472SerPro: 1.472 ± 0.347
3.311SerGln: 3.311 ± 0.464
2.115SerArg: 2.115 ± 0.445
4.415SerSer: 4.415 ± 0.698
3.219SerThr: 3.219 ± 0.508
4.966SerVal: 4.966 ± 0.612
1.104SerTrp: 1.104 ± 0.265
2.851SerTyr: 2.851 ± 0.52
0.0SerXaa: 0.0 ± 0.0
Thr
4.874ThrAla: 4.874 ± 0.62
0.184ThrCys: 0.184 ± 0.128
3.863ThrAsp: 3.863 ± 0.51
4.599ThrGlu: 4.599 ± 0.815
3.127ThrPhe: 3.127 ± 0.484
4.782ThrGly: 4.782 ± 0.678
0.552ThrHis: 0.552 ± 0.202
5.426ThrIle: 5.426 ± 0.793
5.518ThrLys: 5.518 ± 0.7
4.231ThrLeu: 4.231 ± 0.628
1.38ThrMet: 1.38 ± 0.358
2.759ThrAsn: 2.759 ± 0.541
1.288ThrPro: 1.288 ± 0.33
1.931ThrGln: 1.931 ± 0.373
2.943ThrArg: 2.943 ± 0.574
3.495ThrSer: 3.495 ± 0.587
4.966ThrThr: 4.966 ± 0.55
4.599ThrVal: 4.599 ± 0.834
0.552ThrTrp: 0.552 ± 0.285
2.023ThrTyr: 2.023 ± 0.423
0.0ThrXaa: 0.0 ± 0.0
Val
2.667ValAla: 2.667 ± 0.535
0.092ValCys: 0.092 ± 0.09
4.139ValAsp: 4.139 ± 0.82
4.874ValGlu: 4.874 ± 0.753
2.391ValPhe: 2.391 ± 0.364
3.311ValGly: 3.311 ± 0.633
0.92ValHis: 0.92 ± 0.284
3.955ValIle: 3.955 ± 0.586
5.702ValLys: 5.702 ± 0.82
4.139ValLeu: 4.139 ± 0.569
1.655ValMet: 1.655 ± 0.414
3.863ValAsn: 3.863 ± 0.689
1.196ValPro: 1.196 ± 0.372
2.483ValGln: 2.483 ± 0.578
1.931ValArg: 1.931 ± 0.421
4.874ValSer: 4.874 ± 0.545
4.323ValThr: 4.323 ± 0.793
4.507ValVal: 4.507 ± 0.633
0.736ValTrp: 0.736 ± 0.196
1.839ValTyr: 1.839 ± 0.398
0.0ValXaa: 0.0 ± 0.0
Trp
1.196TrpAla: 1.196 ± 0.309
0.0TrpCys: 0.0 ± 0.0
0.736TrpAsp: 0.736 ± 0.421
1.012TrpGlu: 1.012 ± 0.325
0.92TrpPhe: 0.92 ± 0.244
0.644TrpGly: 0.644 ± 0.218
0.184TrpHis: 0.184 ± 0.118
1.196TrpIle: 1.196 ± 0.258
1.38TrpLys: 1.38 ± 0.367
1.288TrpLeu: 1.288 ± 0.426
0.276TrpMet: 0.276 ± 0.17
1.104TrpAsn: 1.104 ± 0.508
0.276TrpPro: 0.276 ± 0.145
0.92TrpGln: 0.92 ± 0.333
0.736TrpArg: 0.736 ± 0.249
0.828TrpSer: 0.828 ± 0.268
0.736TrpThr: 0.736 ± 0.285
0.828TrpVal: 0.828 ± 0.307
0.552TrpTrp: 0.552 ± 0.233
0.46TrpTyr: 0.46 ± 0.219
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.023TyrAla: 2.023 ± 0.417
0.276TyrCys: 0.276 ± 0.158
2.483TyrAsp: 2.483 ± 0.471
1.931TyrGlu: 1.931 ± 0.43
2.115TyrPhe: 2.115 ± 0.357
2.207TyrGly: 2.207 ± 0.541
0.644TyrHis: 0.644 ± 0.221
2.391TyrIle: 2.391 ± 0.558
3.127TyrLys: 3.127 ± 0.502
3.771TyrLeu: 3.771 ± 0.777
1.104TyrMet: 1.104 ± 0.279
1.839TyrAsn: 1.839 ± 0.358
1.288TyrPro: 1.288 ± 0.314
1.839TyrGln: 1.839 ± 0.454
2.207TyrArg: 2.207 ± 0.454
3.495TyrSer: 3.495 ± 0.558
1.655TyrThr: 1.655 ± 0.495
1.931TyrVal: 1.931 ± 0.36
0.644TyrTrp: 0.644 ± 0.231
1.655TyrTyr: 1.655 ± 0.379
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (10874 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski