Amino acid dipepetide frequency for Lactococcus phage 98101

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.667AlaAla: 2.667 ± 0.627
0.762AlaCys: 0.762 ± 0.278
3.811AlaAsp: 3.811 ± 0.73
4.477AlaGlu: 4.477 ± 0.973
2.763AlaPhe: 2.763 ± 0.458
3.334AlaGly: 3.334 ± 0.777
0.762AlaHis: 0.762 ± 0.258
5.049AlaIle: 5.049 ± 0.956
4.954AlaLys: 4.954 ± 0.601
5.906AlaLeu: 5.906 ± 0.671
1.62AlaMet: 1.62 ± 0.301
4.859AlaAsn: 4.859 ± 0.543
1.715AlaPro: 1.715 ± 0.424
3.048AlaGln: 3.048 ± 0.639
2.667AlaArg: 2.667 ± 0.521
3.144AlaSer: 3.144 ± 0.524
3.811AlaThr: 3.811 ± 0.686
3.62AlaVal: 3.62 ± 0.694
1.81AlaTrp: 1.81 ± 0.509
2.096AlaTyr: 2.096 ± 0.372
0.0AlaXaa: 0.0 ± 0.0
Cys
0.191CysAla: 0.191 ± 0.147
0.0CysCys: 0.0 ± 0.0
1.048CysAsp: 1.048 ± 0.335
0.572CysGlu: 0.572 ± 0.253
0.286CysPhe: 0.286 ± 0.164
0.476CysGly: 0.476 ± 0.256
0.381CysHis: 0.381 ± 0.296
0.095CysIle: 0.095 ± 0.097
0.572CysLys: 0.572 ± 0.241
0.476CysLeu: 0.476 ± 0.231
0.381CysMet: 0.381 ± 0.215
0.191CysAsn: 0.191 ± 0.141
0.286CysPro: 0.286 ± 0.177
0.0CysGln: 0.0 ± 0.0
0.191CysArg: 0.191 ± 0.111
0.857CysSer: 0.857 ± 0.293
0.476CysThr: 0.476 ± 0.178
0.476CysVal: 0.476 ± 0.212
0.095CysTrp: 0.095 ± 0.094
0.191CysTyr: 0.191 ± 0.116
0.0CysXaa: 0.0 ± 0.0
Asp
2.858AspAla: 2.858 ± 0.504
0.381AspCys: 0.381 ± 0.192
3.906AspAsp: 3.906 ± 0.76
5.811AspGlu: 5.811 ± 0.93
3.144AspPhe: 3.144 ± 0.576
5.525AspGly: 5.525 ± 0.835
0.476AspHis: 0.476 ± 0.177
4.859AspIle: 4.859 ± 0.635
5.525AspLys: 5.525 ± 0.71
4.001AspLeu: 4.001 ± 0.599
1.524AspMet: 1.524 ± 0.417
3.048AspAsn: 3.048 ± 0.482
1.048AspPro: 1.048 ± 0.343
1.334AspGln: 1.334 ± 0.299
2.572AspArg: 2.572 ± 0.365
4.573AspSer: 4.573 ± 0.654
4.192AspThr: 4.192 ± 0.597
3.715AspVal: 3.715 ± 0.527
1.143AspTrp: 1.143 ± 0.293
2.096AspTyr: 2.096 ± 0.364
0.0AspXaa: 0.0 ± 0.0
Glu
3.62GluAla: 3.62 ± 0.675
0.286GluCys: 0.286 ± 0.148
2.763GluAsp: 2.763 ± 0.542
6.383GluGlu: 6.383 ± 1.213
4.382GluPhe: 4.382 ± 0.579
3.239GluGly: 3.239 ± 0.669
0.857GluHis: 0.857 ± 0.272
5.24GluIle: 5.24 ± 0.797
7.812GluLys: 7.812 ± 1.532
7.621GluLeu: 7.621 ± 0.887
1.81GluMet: 1.81 ± 0.39
4.477GluAsn: 4.477 ± 0.766
2.477GluPro: 2.477 ± 0.571
3.334GluGln: 3.334 ± 0.547
3.334GluArg: 3.334 ± 0.534
3.43GluSer: 3.43 ± 0.534
4.859GluThr: 4.859 ± 0.709
6.383GluVal: 6.383 ± 1.05
1.143GluTrp: 1.143 ± 0.316
4.192GluTyr: 4.192 ± 0.737
0.0GluXaa: 0.0 ± 0.0
Phe
2.382PheAla: 2.382 ± 0.485
0.667PheCys: 0.667 ± 0.231
3.62PheAsp: 3.62 ± 0.485
3.239PheGlu: 3.239 ± 0.635
1.62PhePhe: 1.62 ± 0.397
2.763PheGly: 2.763 ± 0.572
0.572PheHis: 0.572 ± 0.238
2.953PheIle: 2.953 ± 0.538
4.573PheLys: 4.573 ± 0.687
2.191PheLeu: 2.191 ± 0.459
1.524PheMet: 1.524 ± 0.501
3.334PheAsn: 3.334 ± 0.638
0.667PhePro: 0.667 ± 0.342
1.715PheGln: 1.715 ± 0.383
1.048PheArg: 1.048 ± 0.329
3.43PheSer: 3.43 ± 0.52
3.144PheThr: 3.144 ± 0.591
2.477PheVal: 2.477 ± 0.599
0.381PheTrp: 0.381 ± 0.187
1.62PheTyr: 1.62 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
3.334GlyAla: 3.334 ± 0.718
0.476GlyCys: 0.476 ± 0.171
2.953GlyAsp: 2.953 ± 0.461
3.811GlyGlu: 3.811 ± 0.681
2.572GlyPhe: 2.572 ± 0.454
4.477GlyGly: 4.477 ± 0.905
0.572GlyHis: 0.572 ± 0.209
5.144GlyIle: 5.144 ± 0.593
5.716GlyLys: 5.716 ± 0.656
5.621GlyLeu: 5.621 ± 1.25
1.905GlyMet: 1.905 ± 0.523
3.239GlyAsn: 3.239 ± 0.758
0.762GlyPro: 0.762 ± 0.346
2.953GlyGln: 2.953 ± 0.749
2.667GlyArg: 2.667 ± 0.522
4.096GlySer: 4.096 ± 0.808
4.287GlyThr: 4.287 ± 0.742
3.715GlyVal: 3.715 ± 0.761
1.238GlyTrp: 1.238 ± 0.344
3.62GlyTyr: 3.62 ± 0.663
0.0GlyXaa: 0.0 ± 0.0
His
1.143HisAla: 1.143 ± 0.301
0.095HisCys: 0.095 ± 0.106
0.857HisAsp: 0.857 ± 0.243
1.62HisGlu: 1.62 ± 0.378
0.857HisPhe: 0.857 ± 0.278
0.857HisGly: 0.857 ± 0.288
0.191HisHis: 0.191 ± 0.142
0.476HisIle: 0.476 ± 0.181
0.762HisLys: 0.762 ± 0.239
0.857HisLeu: 0.857 ± 0.291
0.191HisMet: 0.191 ± 0.153
0.762HisAsn: 0.762 ± 0.295
0.381HisPro: 0.381 ± 0.176
0.572HisGln: 0.572 ± 0.253
0.476HisArg: 0.476 ± 0.191
0.857HisSer: 0.857 ± 0.273
0.381HisThr: 0.381 ± 0.23
0.572HisVal: 0.572 ± 0.178
0.286HisTrp: 0.286 ± 0.172
0.667HisTyr: 0.667 ± 0.244
0.0HisXaa: 0.0 ± 0.0
Ile
5.144IleAla: 5.144 ± 0.826
0.286IleCys: 0.286 ± 0.143
3.906IleAsp: 3.906 ± 0.708
5.811IleGlu: 5.811 ± 0.823
2.382IlePhe: 2.382 ± 0.482
3.811IleGly: 3.811 ± 0.603
0.667IleHis: 0.667 ± 0.329
3.525IleIle: 3.525 ± 0.652
7.145IleLys: 7.145 ± 0.752
4.192IleLeu: 4.192 ± 0.631
1.62IleMet: 1.62 ± 0.343
5.144IleAsn: 5.144 ± 0.903
1.62IlePro: 1.62 ± 0.37
2.763IleGln: 2.763 ± 0.633
2.191IleArg: 2.191 ± 0.37
4.477IleSer: 4.477 ± 0.695
4.573IleThr: 4.573 ± 0.55
3.62IleVal: 3.62 ± 0.674
0.572IleTrp: 0.572 ± 0.265
2.382IleTyr: 2.382 ± 0.586
0.0IleXaa: 0.0 ± 0.0
Lys
7.335LysAla: 7.335 ± 1.216
0.191LysCys: 0.191 ± 0.188
5.43LysAsp: 5.43 ± 0.608
7.05LysGlu: 7.05 ± 1.108
2.763LysPhe: 2.763 ± 0.476
5.621LysGly: 5.621 ± 0.808
1.905LysHis: 1.905 ± 0.509
6.478LysIle: 6.478 ± 0.7
8.764LysLys: 8.764 ± 1.159
7.526LysLeu: 7.526 ± 0.853
2.286LysMet: 2.286 ± 0.475
6.002LysAsn: 6.002 ± 0.857
2.286LysPro: 2.286 ± 0.476
4.954LysGln: 4.954 ± 0.848
3.62LysArg: 3.62 ± 0.706
5.43LysSer: 5.43 ± 0.733
4.763LysThr: 4.763 ± 0.78
4.001LysVal: 4.001 ± 0.734
0.857LysTrp: 0.857 ± 0.337
3.525LysTyr: 3.525 ± 0.649
0.0LysXaa: 0.0 ± 0.0
Leu
4.668LeuAla: 4.668 ± 0.682
0.953LeuCys: 0.953 ± 0.321
5.811LeuAsp: 5.811 ± 0.715
5.906LeuGlu: 5.906 ± 0.965
3.144LeuPhe: 3.144 ± 0.491
4.573LeuGly: 4.573 ± 0.514
0.857LeuHis: 0.857 ± 0.266
4.573LeuIle: 4.573 ± 0.625
7.335LeuLys: 7.335 ± 0.919
6.478LeuLeu: 6.478 ± 1.011
2.191LeuMet: 2.191 ± 0.47
5.525LeuAsn: 5.525 ± 0.614
3.048LeuPro: 3.048 ± 0.541
3.811LeuGln: 3.811 ± 0.683
2.191LeuArg: 2.191 ± 0.439
6.573LeuSer: 6.573 ± 0.723
5.43LeuThr: 5.43 ± 0.818
3.525LeuVal: 3.525 ± 0.546
1.429LeuTrp: 1.429 ± 0.681
2.191LeuTyr: 2.191 ± 0.411
0.0LeuXaa: 0.0 ± 0.0
Met
2.858MetAla: 2.858 ± 0.525
0.191MetCys: 0.191 ± 0.147
1.334MetAsp: 1.334 ± 0.352
2.001MetGlu: 2.001 ± 0.658
0.572MetPhe: 0.572 ± 0.179
1.238MetGly: 1.238 ± 0.36
0.381MetHis: 0.381 ± 0.21
1.429MetIle: 1.429 ± 0.417
2.001MetLys: 2.001 ± 0.437
2.096MetLeu: 2.096 ± 0.469
0.572MetMet: 0.572 ± 0.236
1.905MetAsn: 1.905 ± 0.388
0.476MetPro: 0.476 ± 0.298
1.238MetGln: 1.238 ± 0.319
1.048MetArg: 1.048 ± 0.353
1.715MetSer: 1.715 ± 0.431
3.144MetThr: 3.144 ± 0.661
0.762MetVal: 0.762 ± 0.278
0.286MetTrp: 0.286 ± 0.163
0.572MetTyr: 0.572 ± 0.253
0.0MetXaa: 0.0 ± 0.0
Asn
4.573AsnAla: 4.573 ± 0.731
0.381AsnCys: 0.381 ± 0.174
3.43AsnAsp: 3.43 ± 0.523
4.001AsnGlu: 4.001 ± 0.724
2.572AsnPhe: 2.572 ± 0.438
6.192AsnGly: 6.192 ± 1.044
0.476AsnHis: 0.476 ± 0.263
3.334AsnIle: 3.334 ± 0.576
5.24AsnLys: 5.24 ± 0.537
6.288AsnLeu: 6.288 ± 0.634
1.429AsnMet: 1.429 ± 0.412
4.192AsnAsn: 4.192 ± 0.65
1.905AsnPro: 1.905 ± 0.396
2.953AsnGln: 2.953 ± 0.395
1.905AsnArg: 1.905 ± 0.325
4.287AsnSer: 4.287 ± 0.733
3.048AsnThr: 3.048 ± 0.585
3.715AsnVal: 3.715 ± 0.557
0.857AsnTrp: 0.857 ± 0.201
2.382AsnTyr: 2.382 ± 0.463
0.0AsnXaa: 0.0 ± 0.0
Pro
1.048ProAla: 1.048 ± 0.307
0.095ProCys: 0.095 ± 0.109
1.81ProAsp: 1.81 ± 0.512
2.572ProGlu: 2.572 ± 0.404
1.238ProPhe: 1.238 ± 0.337
0.857ProGly: 0.857 ± 0.247
0.762ProHis: 0.762 ± 0.289
1.334ProIle: 1.334 ± 0.423
2.477ProLys: 2.477 ± 0.53
2.191ProLeu: 2.191 ± 0.495
0.572ProMet: 0.572 ± 0.19
1.715ProAsn: 1.715 ± 0.404
0.476ProPro: 0.476 ± 0.171
1.143ProGln: 1.143 ± 0.308
0.572ProArg: 0.572 ± 0.245
1.238ProSer: 1.238 ± 0.345
2.001ProThr: 2.001 ± 0.461
2.001ProVal: 2.001 ± 0.443
0.191ProTrp: 0.191 ± 0.143
0.953ProTyr: 0.953 ± 0.266
0.0ProXaa: 0.0 ± 0.0
Gln
4.096GlnAla: 4.096 ± 0.598
0.476GlnCys: 0.476 ± 0.212
0.953GlnAsp: 0.953 ± 0.398
3.906GlnGlu: 3.906 ± 0.53
1.429GlnPhe: 1.429 ± 0.385
2.572GlnGly: 2.572 ± 0.655
0.191GlnHis: 0.191 ± 0.116
3.144GlnIle: 3.144 ± 0.554
3.334GlnLys: 3.334 ± 0.611
3.144GlnLeu: 3.144 ± 0.506
1.238GlnMet: 1.238 ± 0.375
2.572GlnAsn: 2.572 ± 0.437
1.524GlnPro: 1.524 ± 0.404
2.286GlnGln: 2.286 ± 0.481
1.62GlnArg: 1.62 ± 0.379
1.81GlnSer: 1.81 ± 0.542
2.953GlnThr: 2.953 ± 0.525
3.239GlnVal: 3.239 ± 0.582
0.667GlnTrp: 0.667 ± 0.228
1.81GlnTyr: 1.81 ± 0.338
0.0GlnXaa: 0.0 ± 0.0
Arg
2.572ArgAla: 2.572 ± 0.448
0.286ArgCys: 0.286 ± 0.214
2.382ArgAsp: 2.382 ± 0.497
2.953ArgGlu: 2.953 ± 0.674
2.001ArgPhe: 2.001 ± 0.444
1.81ArgGly: 1.81 ± 0.441
0.191ArgHis: 0.191 ± 0.127
2.382ArgIle: 2.382 ± 0.421
4.287ArgLys: 4.287 ± 0.556
4.192ArgLeu: 4.192 ± 0.805
1.334ArgMet: 1.334 ± 0.309
1.905ArgAsn: 1.905 ± 0.43
0.857ArgPro: 0.857 ± 0.343
1.143ArgGln: 1.143 ± 0.288
1.524ArgArg: 1.524 ± 0.47
2.191ArgSer: 2.191 ± 0.487
1.905ArgThr: 1.905 ± 0.368
2.286ArgVal: 2.286 ± 0.435
0.381ArgTrp: 0.381 ± 0.194
1.143ArgTyr: 1.143 ± 0.391
0.0ArgXaa: 0.0 ± 0.0
Ser
4.001SerAla: 4.001 ± 0.955
0.572SerCys: 0.572 ± 0.279
5.811SerAsp: 5.811 ± 0.501
4.573SerGlu: 4.573 ± 0.811
3.62SerPhe: 3.62 ± 0.644
4.763SerGly: 4.763 ± 0.76
1.143SerHis: 1.143 ± 0.291
3.334SerIle: 3.334 ± 0.475
4.001SerLys: 4.001 ± 0.611
4.096SerLeu: 4.096 ± 0.593
1.429SerMet: 1.429 ± 0.361
4.096SerAsn: 4.096 ± 0.602
1.143SerPro: 1.143 ± 0.347
2.572SerGln: 2.572 ± 0.491
2.191SerArg: 2.191 ± 0.362
4.287SerSer: 4.287 ± 0.803
3.906SerThr: 3.906 ± 0.495
4.477SerVal: 4.477 ± 0.524
0.857SerTrp: 0.857 ± 0.227
2.477SerTyr: 2.477 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
4.763ThrAla: 4.763 ± 0.709
0.476ThrCys: 0.476 ± 0.243
4.001ThrAsp: 4.001 ± 0.521
4.763ThrGlu: 4.763 ± 0.726
3.144ThrPhe: 3.144 ± 0.498
4.859ThrGly: 4.859 ± 0.749
0.572ThrHis: 0.572 ± 0.247
4.668ThrIle: 4.668 ± 0.846
5.811ThrLys: 5.811 ± 0.692
4.763ThrLeu: 4.763 ± 0.575
1.238ThrMet: 1.238 ± 0.384
3.239ThrAsn: 3.239 ± 0.554
1.429ThrPro: 1.429 ± 0.324
1.715ThrGln: 1.715 ± 0.396
3.048ThrArg: 3.048 ± 0.436
3.144ThrSer: 3.144 ± 0.541
4.287ThrThr: 4.287 ± 0.486
5.049ThrVal: 5.049 ± 0.838
0.667ThrTrp: 0.667 ± 0.281
1.81ThrTyr: 1.81 ± 0.473
0.0ThrXaa: 0.0 ± 0.0
Val
3.048ValAla: 3.048 ± 0.519
0.286ValCys: 0.286 ± 0.149
4.382ValAsp: 4.382 ± 0.853
5.049ValGlu: 5.049 ± 0.805
2.763ValPhe: 2.763 ± 0.476
3.144ValGly: 3.144 ± 0.63
1.048ValHis: 1.048 ± 0.334
3.715ValIle: 3.715 ± 0.494
6.192ValLys: 6.192 ± 0.836
4.477ValLeu: 4.477 ± 0.579
1.334ValMet: 1.334 ± 0.393
3.715ValAsn: 3.715 ± 0.652
1.524ValPro: 1.524 ± 0.359
2.286ValGln: 2.286 ± 0.524
2.001ValArg: 2.001 ± 0.466
4.573ValSer: 4.573 ± 0.6
4.192ValThr: 4.192 ± 0.861
4.573ValVal: 4.573 ± 0.688
0.762ValTrp: 0.762 ± 0.199
1.62ValTyr: 1.62 ± 0.403
0.0ValXaa: 0.0 ± 0.0
Trp
1.238TrpAla: 1.238 ± 0.272
0.095TrpCys: 0.095 ± 0.096
0.953TrpAsp: 0.953 ± 0.477
1.048TrpGlu: 1.048 ± 0.336
0.667TrpPhe: 0.667 ± 0.199
0.572TrpGly: 0.572 ± 0.281
0.286TrpHis: 0.286 ± 0.16
1.524TrpIle: 1.524 ± 0.303
1.334TrpLys: 1.334 ± 0.402
0.857TrpLeu: 0.857 ± 0.298
0.095TrpMet: 0.095 ± 0.106
1.143TrpAsn: 1.143 ± 0.486
0.095TrpPro: 0.095 ± 0.083
0.953TrpGln: 0.953 ± 0.305
0.953TrpArg: 0.953 ± 0.266
0.762TrpSer: 0.762 ± 0.231
0.572TrpThr: 0.572 ± 0.211
0.667TrpVal: 0.667 ± 0.32
0.381TrpTrp: 0.381 ± 0.185
0.286TrpTyr: 0.286 ± 0.187
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.334TyrAla: 1.334 ± 0.362
0.286TyrCys: 0.286 ± 0.154
2.572TyrAsp: 2.572 ± 0.483
2.096TyrGlu: 2.096 ± 0.504
2.001TyrPhe: 2.001 ± 0.431
2.477TyrGly: 2.477 ± 0.571
0.572TyrHis: 0.572 ± 0.197
2.572TyrIle: 2.572 ± 0.497
2.953TyrLys: 2.953 ± 0.523
3.334TyrLeu: 3.334 ± 0.757
1.429TyrMet: 1.429 ± 0.304
2.096TyrAsn: 2.096 ± 0.512
1.524TyrPro: 1.524 ± 0.305
2.191TyrGln: 2.191 ± 0.506
1.905TyrArg: 1.905 ± 0.466
2.477TyrSer: 2.477 ± 0.385
1.524TyrThr: 1.524 ± 0.387
1.81TyrVal: 1.81 ± 0.379
0.476TyrTrp: 0.476 ± 0.231
1.238TyrTyr: 1.238 ± 0.346
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 55 proteins (10498 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski