Amino acid dipepetide frequency for Lactococcus phage phismq86

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.244AlaAla: 4.244 ± 0.853
0.283AlaCys: 0.283 ± 0.17
4.15AlaAsp: 4.15 ± 0.626
3.773AlaGlu: 3.773 ± 0.656
1.792AlaPhe: 1.792 ± 0.336
5.282AlaGly: 5.282 ± 0.777
1.226AlaHis: 1.226 ± 0.331
4.527AlaIle: 4.527 ± 0.958
4.81AlaLys: 4.81 ± 0.509
6.979AlaLeu: 6.979 ± 1.265
1.698AlaMet: 1.698 ± 0.38
3.773AlaAsn: 3.773 ± 0.572
2.169AlaPro: 2.169 ± 0.465
3.018AlaGln: 3.018 ± 0.47
1.792AlaArg: 1.792 ± 0.427
3.49AlaSer: 3.49 ± 0.536
4.15AlaThr: 4.15 ± 0.837
4.621AlaVal: 4.621 ± 0.485
0.755AlaTrp: 0.755 ± 0.22
2.075AlaTyr: 2.075 ± 0.464
0.0AlaXaa: 0.0 ± 0.0
Cys
0.094CysAla: 0.094 ± 0.094
0.0CysCys: 0.0 ± 0.0
0.66CysAsp: 0.66 ± 0.273
0.943CysGlu: 0.943 ± 0.273
0.472CysPhe: 0.472 ± 0.2
0.755CysGly: 0.755 ± 0.376
0.377CysHis: 0.377 ± 0.318
0.189CysIle: 0.189 ± 0.124
0.283CysLys: 0.283 ± 0.189
0.189CysLeu: 0.189 ± 0.129
0.094CysMet: 0.094 ± 0.103
0.0CysAsn: 0.0 ± 0.0
0.094CysPro: 0.094 ± 0.095
0.0CysGln: 0.0 ± 0.0
0.283CysArg: 0.283 ± 0.157
0.472CysSer: 0.472 ± 0.235
0.283CysThr: 0.283 ± 0.176
0.094CysVal: 0.094 ± 0.088
0.094CysTrp: 0.094 ± 0.103
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.301AspAla: 3.301 ± 0.56
0.566AspCys: 0.566 ± 0.257
3.961AspAsp: 3.961 ± 0.574
4.999AspGlu: 4.999 ± 0.568
3.49AspPhe: 3.49 ± 0.526
6.13AspGly: 6.13 ± 1.235
0.566AspHis: 0.566 ± 0.233
4.244AspIle: 4.244 ± 0.563
5.942AspLys: 5.942 ± 0.749
3.961AspLeu: 3.961 ± 0.632
1.792AspMet: 1.792 ± 0.391
3.867AspAsn: 3.867 ± 0.449
1.226AspPro: 1.226 ± 0.388
0.943AspGln: 0.943 ± 0.321
2.264AspArg: 2.264 ± 0.331
4.621AspSer: 4.621 ± 0.632
4.244AspThr: 4.244 ± 0.567
4.15AspVal: 4.15 ± 0.596
0.472AspTrp: 0.472 ± 0.2
3.018AspTyr: 3.018 ± 0.575
0.0AspXaa: 0.0 ± 0.0
Glu
4.055GluAla: 4.055 ± 0.689
0.377GluCys: 0.377 ± 0.199
3.395GluAsp: 3.395 ± 0.651
6.225GluGlu: 6.225 ± 1.19
3.867GluPhe: 3.867 ± 0.622
2.735GluGly: 2.735 ± 0.431
1.226GluHis: 1.226 ± 0.335
5.753GluIle: 5.753 ± 0.738
6.13GluLys: 6.13 ± 1.157
7.451GluLeu: 7.451 ± 0.892
2.452GluMet: 2.452 ± 0.526
4.244GluAsn: 4.244 ± 0.785
1.792GluPro: 1.792 ± 0.528
3.49GluGln: 3.49 ± 0.567
3.207GluArg: 3.207 ± 0.724
4.716GluSer: 4.716 ± 0.847
4.244GluThr: 4.244 ± 0.598
4.81GluVal: 4.81 ± 0.964
1.32GluTrp: 1.32 ± 0.346
2.735GluTyr: 2.735 ± 0.483
0.0GluXaa: 0.0 ± 0.0
Phe
2.169PheAla: 2.169 ± 0.446
0.189PheCys: 0.189 ± 0.136
2.924PheAsp: 2.924 ± 0.569
3.678PheGlu: 3.678 ± 0.652
1.603PhePhe: 1.603 ± 0.359
2.452PheGly: 2.452 ± 0.501
0.283PheHis: 0.283 ± 0.161
3.301PheIle: 3.301 ± 0.486
3.49PheLys: 3.49 ± 0.447
2.641PheLeu: 2.641 ± 0.514
1.32PheMet: 1.32 ± 0.378
3.018PheAsn: 3.018 ± 0.405
1.415PhePro: 1.415 ± 0.335
1.698PheGln: 1.698 ± 0.485
1.132PheArg: 1.132 ± 0.401
2.358PheSer: 2.358 ± 0.416
2.264PheThr: 2.264 ± 0.397
2.829PheVal: 2.829 ± 0.43
0.566PheTrp: 0.566 ± 0.213
2.075PheTyr: 2.075 ± 0.396
0.0PheXaa: 0.0 ± 0.0
Gly
3.018GlyAla: 3.018 ± 0.519
0.472GlyCys: 0.472 ± 0.194
3.961GlyAsp: 3.961 ± 0.525
3.395GlyGlu: 3.395 ± 0.644
3.584GlyPhe: 3.584 ± 0.641
4.15GlyGly: 4.15 ± 0.664
1.226GlyHis: 1.226 ± 0.316
6.225GlyIle: 6.225 ± 0.732
5.187GlyLys: 5.187 ± 0.654
5.659GlyLeu: 5.659 ± 0.868
1.037GlyMet: 1.037 ± 0.297
3.773GlyAsn: 3.773 ± 0.504
1.32GlyPro: 1.32 ± 0.411
2.829GlyGln: 2.829 ± 0.478
2.169GlyArg: 2.169 ± 0.403
5.093GlySer: 5.093 ± 0.772
5.847GlyThr: 5.847 ± 1.007
3.867GlyVal: 3.867 ± 0.57
0.755GlyTrp: 0.755 ± 0.282
2.264GlyTyr: 2.264 ± 0.42
0.0GlyXaa: 0.0 ± 0.0
His
0.566HisAla: 0.566 ± 0.265
0.189HisCys: 0.189 ± 0.133
0.849HisAsp: 0.849 ± 0.251
1.603HisGlu: 1.603 ± 0.351
0.377HisPhe: 0.377 ± 0.173
1.037HisGly: 1.037 ± 0.271
0.377HisHis: 0.377 ± 0.172
0.943HisIle: 0.943 ± 0.306
0.849HisLys: 0.849 ± 0.242
0.943HisLeu: 0.943 ± 0.274
0.377HisMet: 0.377 ± 0.184
0.566HisAsn: 0.566 ± 0.233
0.566HisPro: 0.566 ± 0.23
0.377HisGln: 0.377 ± 0.197
0.472HisArg: 0.472 ± 0.176
1.037HisSer: 1.037 ± 0.338
0.566HisThr: 0.566 ± 0.213
0.943HisVal: 0.943 ± 0.36
0.094HisTrp: 0.094 ± 0.084
0.849HisTyr: 0.849 ± 0.34
0.0HisXaa: 0.0 ± 0.0
Ile
5.187IleAla: 5.187 ± 0.668
0.283IleCys: 0.283 ± 0.172
5.47IleAsp: 5.47 ± 0.675
5.282IleGlu: 5.282 ± 0.875
2.075IlePhe: 2.075 ± 0.428
4.244IleGly: 4.244 ± 0.808
1.32IleHis: 1.32 ± 0.402
5.753IleIle: 5.753 ± 1.594
6.602IleLys: 6.602 ± 0.785
5.47IleLeu: 5.47 ± 0.688
1.603IleMet: 1.603 ± 0.457
4.999IleAsn: 4.999 ± 0.633
3.018IlePro: 3.018 ± 0.663
3.018IleGln: 3.018 ± 0.497
3.018IleArg: 3.018 ± 0.597
5.942IleSer: 5.942 ± 0.724
4.433IleThr: 4.433 ± 0.884
3.773IleVal: 3.773 ± 0.668
0.283IleTrp: 0.283 ± 0.157
1.415IleTyr: 1.415 ± 0.32
0.0IleXaa: 0.0 ± 0.0
Lys
6.319LysAla: 6.319 ± 0.956
0.472LysCys: 0.472 ± 0.242
4.527LysAsp: 4.527 ± 0.643
7.545LysGlu: 7.545 ± 1.327
3.584LysPhe: 3.584 ± 0.627
4.716LysGly: 4.716 ± 0.601
0.66LysHis: 0.66 ± 0.281
5.847LysIle: 5.847 ± 0.691
7.168LysLys: 7.168 ± 1.249
7.168LysLeu: 7.168 ± 0.863
3.112LysMet: 3.112 ± 0.497
4.621LysAsn: 4.621 ± 0.669
1.792LysPro: 1.792 ± 0.509
3.773LysGln: 3.773 ± 0.716
3.018LysArg: 3.018 ± 0.501
5.282LysSer: 5.282 ± 0.817
4.15LysThr: 4.15 ± 0.586
4.433LysVal: 4.433 ± 0.52
1.415LysTrp: 1.415 ± 0.401
2.641LysTyr: 2.641 ± 0.437
0.0LysXaa: 0.0 ± 0.0
Leu
5.093LeuAla: 5.093 ± 0.67
0.377LeuCys: 0.377 ± 0.185
4.15LeuAsp: 4.15 ± 0.577
6.225LeuGlu: 6.225 ± 0.885
3.395LeuPhe: 3.395 ± 0.478
5.093LeuGly: 5.093 ± 0.781
0.283LeuHis: 0.283 ± 0.156
5.564LeuIle: 5.564 ± 1.1
6.319LeuLys: 6.319 ± 1.039
5.753LeuLeu: 5.753 ± 0.894
1.886LeuMet: 1.886 ± 0.474
4.338LeuAsn: 4.338 ± 0.696
3.773LeuPro: 3.773 ± 0.97
2.641LeuGln: 2.641 ± 0.508
3.678LeuArg: 3.678 ± 0.655
6.508LeuSer: 6.508 ± 0.756
6.696LeuThr: 6.696 ± 0.733
4.716LeuVal: 4.716 ± 0.713
1.037LeuTrp: 1.037 ± 0.322
1.603LeuTyr: 1.603 ± 0.462
0.0LeuXaa: 0.0 ± 0.0
Met
1.603MetAla: 1.603 ± 0.426
0.0MetCys: 0.0 ± 0.0
1.698MetAsp: 1.698 ± 0.41
1.603MetGlu: 1.603 ± 0.363
1.132MetPhe: 1.132 ± 0.303
1.981MetGly: 1.981 ± 0.37
0.283MetHis: 0.283 ± 0.154
1.792MetIle: 1.792 ± 0.407
2.075MetLys: 2.075 ± 0.468
1.32MetLeu: 1.32 ± 0.343
0.283MetMet: 0.283 ± 0.163
1.698MetAsn: 1.698 ± 0.351
0.189MetPro: 0.189 ± 0.103
1.226MetGln: 1.226 ± 0.352
0.943MetArg: 0.943 ± 0.256
1.886MetSer: 1.886 ± 0.495
3.018MetThr: 3.018 ± 0.398
0.849MetVal: 0.849 ± 0.274
0.283MetTrp: 0.283 ± 0.142
0.943MetTyr: 0.943 ± 0.327
0.0MetXaa: 0.0 ± 0.0
Asn
3.678AsnAla: 3.678 ± 0.724
0.094AsnCys: 0.094 ± 0.073
3.773AsnAsp: 3.773 ± 0.595
3.584AsnGlu: 3.584 ± 0.624
3.301AsnPhe: 3.301 ± 0.508
5.753AsnGly: 5.753 ± 0.915
0.66AsnHis: 0.66 ± 0.263
4.904AsnIle: 4.904 ± 0.678
4.055AsnLys: 4.055 ± 0.533
4.055AsnLeu: 4.055 ± 0.546
0.755AsnMet: 0.755 ± 0.294
4.244AsnAsn: 4.244 ± 0.682
2.358AsnPro: 2.358 ± 0.558
2.641AsnGln: 2.641 ± 0.489
2.169AsnArg: 2.169 ± 0.448
3.49AsnSer: 3.49 ± 0.635
3.112AsnThr: 3.112 ± 0.513
2.829AsnVal: 2.829 ± 0.419
0.66AsnTrp: 0.66 ± 0.2
2.169AsnTyr: 2.169 ± 0.419
0.0AsnXaa: 0.0 ± 0.0
Pro
1.415ProAla: 1.415 ± 0.416
0.094ProCys: 0.094 ± 0.105
1.792ProAsp: 1.792 ± 0.396
1.698ProGlu: 1.698 ± 0.399
1.037ProPhe: 1.037 ± 0.301
0.66ProGly: 0.66 ± 0.225
0.566ProHis: 0.566 ± 0.193
1.698ProIle: 1.698 ± 0.325
1.981ProLys: 1.981 ± 0.559
2.075ProLeu: 2.075 ± 0.592
0.755ProMet: 0.755 ± 0.247
1.509ProAsn: 1.509 ± 0.347
1.037ProPro: 1.037 ± 0.348
2.829ProGln: 2.829 ± 0.62
1.226ProArg: 1.226 ± 0.397
2.452ProSer: 2.452 ± 0.383
2.829ProThr: 2.829 ± 0.804
1.792ProVal: 1.792 ± 0.413
0.566ProTrp: 0.566 ± 0.227
1.415ProTyr: 1.415 ± 0.366
0.0ProXaa: 0.0 ± 0.0
Gln
4.055GlnAla: 4.055 ± 0.865
0.189GlnCys: 0.189 ± 0.124
1.698GlnAsp: 1.698 ± 0.382
2.924GlnGlu: 2.924 ± 0.55
1.603GlnPhe: 1.603 ± 0.38
2.924GlnGly: 2.924 ± 0.581
0.566GlnHis: 0.566 ± 0.183
3.207GlnIle: 3.207 ± 0.636
3.395GlnLys: 3.395 ± 0.467
3.49GlnLeu: 3.49 ± 0.629
1.886GlnMet: 1.886 ± 0.378
2.264GlnAsn: 2.264 ± 0.604
1.32GlnPro: 1.32 ± 0.31
1.886GlnGln: 1.886 ± 0.427
1.981GlnArg: 1.981 ± 0.637
2.264GlnSer: 2.264 ± 0.354
2.264GlnThr: 2.264 ± 0.398
2.264GlnVal: 2.264 ± 0.425
0.283GlnTrp: 0.283 ± 0.18
0.755GlnTyr: 0.755 ± 0.242
0.0GlnXaa: 0.0 ± 0.0
Arg
2.075ArgAla: 2.075 ± 0.467
0.566ArgCys: 0.566 ± 0.27
3.018ArgAsp: 3.018 ± 0.537
3.207ArgGlu: 3.207 ± 0.537
1.132ArgPhe: 1.132 ± 0.291
1.792ArgGly: 1.792 ± 0.458
0.66ArgHis: 0.66 ± 0.249
2.735ArgIle: 2.735 ± 0.503
4.244ArgLys: 4.244 ± 0.751
3.49ArgLeu: 3.49 ± 0.609
1.32ArgMet: 1.32 ± 0.304
1.603ArgAsn: 1.603 ± 0.392
0.849ArgPro: 0.849 ± 0.297
1.698ArgGln: 1.698 ± 0.395
1.981ArgArg: 1.981 ± 0.391
1.415ArgSer: 1.415 ± 0.361
1.792ArgThr: 1.792 ± 0.422
2.641ArgVal: 2.641 ± 0.517
0.943ArgTrp: 0.943 ± 0.31
1.226ArgTyr: 1.226 ± 0.384
0.0ArgXaa: 0.0 ± 0.0
Ser
4.338SerAla: 4.338 ± 0.686
0.189SerCys: 0.189 ± 0.133
5.282SerAsp: 5.282 ± 0.835
4.055SerGlu: 4.055 ± 0.63
3.49SerPhe: 3.49 ± 0.466
5.187SerGly: 5.187 ± 0.593
1.32SerHis: 1.32 ± 0.401
4.433SerIle: 4.433 ± 0.769
6.036SerLys: 6.036 ± 0.781
5.187SerLeu: 5.187 ± 0.668
1.132SerMet: 1.132 ± 0.301
4.433SerAsn: 4.433 ± 0.584
1.509SerPro: 1.509 ± 0.401
1.603SerGln: 1.603 ± 0.432
2.641SerArg: 2.641 ± 0.538
5.376SerSer: 5.376 ± 0.889
3.961SerThr: 3.961 ± 0.69
4.244SerVal: 4.244 ± 0.66
0.849SerTrp: 0.849 ± 0.273
3.207SerTyr: 3.207 ± 0.521
0.0SerXaa: 0.0 ± 0.0
Thr
5.564ThrAla: 5.564 ± 1.242
0.377ThrCys: 0.377 ± 0.226
3.49ThrAsp: 3.49 ± 0.652
4.716ThrGlu: 4.716 ± 0.659
1.981ThrPhe: 1.981 ± 0.534
5.282ThrGly: 5.282 ± 0.702
1.132ThrHis: 1.132 ± 0.282
4.621ThrIle: 4.621 ± 0.721
4.904ThrLys: 4.904 ± 0.757
5.187ThrLeu: 5.187 ± 0.728
1.037ThrMet: 1.037 ± 0.338
4.716ThrAsn: 4.716 ± 0.781
2.829ThrPro: 2.829 ± 0.52
3.207ThrGln: 3.207 ± 0.453
2.452ThrArg: 2.452 ± 0.44
3.773ThrSer: 3.773 ± 0.568
5.847ThrThr: 5.847 ± 1.255
4.716ThrVal: 4.716 ± 1.155
0.66ThrTrp: 0.66 ± 0.251
2.358ThrTyr: 2.358 ± 0.456
0.0ThrXaa: 0.0 ± 0.0
Val
4.621ValAla: 4.621 ± 0.566
0.283ValCys: 0.283 ± 0.18
4.999ValAsp: 4.999 ± 0.404
5.376ValGlu: 5.376 ± 0.832
1.698ValPhe: 1.698 ± 0.341
3.112ValGly: 3.112 ± 0.462
0.377ValHis: 0.377 ± 0.168
4.716ValIle: 4.716 ± 0.713
4.716ValLys: 4.716 ± 0.802
3.207ValLeu: 3.207 ± 0.681
1.037ValMet: 1.037 ± 0.26
2.358ValAsn: 2.358 ± 0.564
1.415ValPro: 1.415 ± 0.482
2.924ValGln: 2.924 ± 0.756
1.886ValArg: 1.886 ± 0.446
4.621ValSer: 4.621 ± 0.516
5.942ValThr: 5.942 ± 0.764
3.961ValVal: 3.961 ± 0.738
0.66ValTrp: 0.66 ± 0.231
1.981ValTyr: 1.981 ± 0.434
0.0ValXaa: 0.0 ± 0.0
Trp
1.226TrpAla: 1.226 ± 0.344
0.094TrpCys: 0.094 ± 0.083
0.943TrpAsp: 0.943 ± 0.353
0.755TrpGlu: 0.755 ± 0.255
0.283TrpPhe: 0.283 ± 0.148
0.377TrpGly: 0.377 ± 0.177
0.189TrpHis: 0.189 ± 0.11
0.66TrpIle: 0.66 ± 0.19
1.226TrpLys: 1.226 ± 0.277
1.132TrpLeu: 1.132 ± 0.356
0.566TrpMet: 0.566 ± 0.228
0.943TrpAsn: 0.943 ± 0.298
0.094TrpPro: 0.094 ± 0.086
0.377TrpGln: 0.377 ± 0.152
0.566TrpArg: 0.566 ± 0.199
1.037TrpSer: 1.037 ± 0.346
0.66TrpThr: 0.66 ± 0.343
0.472TrpVal: 0.472 ± 0.199
0.283TrpTrp: 0.283 ± 0.149
0.566TrpTyr: 0.566 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.075TyrAla: 2.075 ± 0.517
0.283TyrCys: 0.283 ± 0.263
3.018TyrAsp: 3.018 ± 0.498
2.641TyrGlu: 2.641 ± 0.458
1.509TyrPhe: 1.509 ± 0.366
2.169TyrGly: 2.169 ± 0.492
0.283TyrHis: 0.283 ± 0.183
2.075TyrIle: 2.075 ± 0.463
2.829TyrLys: 2.829 ± 0.468
3.773TyrLeu: 3.773 ± 0.691
0.566TyrMet: 0.566 ± 0.192
1.32TyrAsn: 1.32 ± 0.314
0.66TyrPro: 0.66 ± 0.289
1.037TyrGln: 1.037 ± 0.238
1.509TyrArg: 1.509 ± 0.322
2.641TyrSer: 2.641 ± 0.545
2.546TyrThr: 2.546 ± 0.476
1.886TyrVal: 1.886 ± 0.414
0.472TyrTrp: 0.472 ± 0.188
1.603TyrTyr: 1.603 ± 0.401
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (10604 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski