Amino acid dipepetide frequency for Enterococcus phage MSF2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.243AlaAla: 0.243 ± 0.129
0.243AlaCys: 0.243 ± 0.126
3.883AlaAsp: 3.883 ± 0.545
4.853AlaGlu: 4.853 ± 0.71
2.265AlaPhe: 2.265 ± 0.452
3.316AlaGly: 3.316 ± 0.626
0.809AlaHis: 0.809 ± 0.252
4.853AlaIle: 4.853 ± 0.804
5.824AlaLys: 5.824 ± 0.84
4.691AlaLeu: 4.691 ± 0.789
2.507AlaMet: 2.507 ± 0.633
3.155AlaAsn: 3.155 ± 0.537
2.022AlaPro: 2.022 ± 0.314
1.699AlaGln: 1.699 ± 0.405
1.618AlaArg: 1.618 ± 0.29
2.831AlaSer: 2.831 ± 0.503
4.772AlaThr: 4.772 ± 0.613
3.721AlaVal: 3.721 ± 0.627
0.728AlaTrp: 0.728 ± 0.232
2.912AlaTyr: 2.912 ± 0.407
0.0AlaXaa: 0.0 ± 0.0
Cys
0.404CysAla: 0.404 ± 0.171
0.0CysCys: 0.0 ± 0.0
0.647CysAsp: 0.647 ± 0.248
0.647CysGlu: 0.647 ± 0.261
0.162CysPhe: 0.162 ± 0.122
0.404CysGly: 0.404 ± 0.185
0.243CysHis: 0.243 ± 0.149
0.404CysIle: 0.404 ± 0.18
0.566CysLys: 0.566 ± 0.22
0.404CysLeu: 0.404 ± 0.2
0.162CysMet: 0.162 ± 0.135
0.566CysAsn: 0.566 ± 0.24
0.081CysPro: 0.081 ± 0.097
0.243CysGln: 0.243 ± 0.144
0.162CysArg: 0.162 ± 0.116
0.566CysSer: 0.566 ± 0.198
0.243CysThr: 0.243 ± 0.156
0.081CysVal: 0.081 ± 0.08
0.081CysTrp: 0.081 ± 0.083
0.243CysTyr: 0.243 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
2.993AspAla: 2.993 ± 0.541
0.162AspCys: 0.162 ± 0.121
2.993AspAsp: 2.993 ± 0.557
4.449AspGlu: 4.449 ± 0.773
3.235AspPhe: 3.235 ± 0.627
5.419AspGly: 5.419 ± 0.739
0.809AspHis: 0.809 ± 0.279
4.53AspIle: 4.53 ± 0.745
5.986AspLys: 5.986 ± 0.816
6.228AspLeu: 6.228 ± 0.873
1.941AspMet: 1.941 ± 0.361
4.772AspAsn: 4.772 ± 0.594
2.103AspPro: 2.103 ± 0.512
1.78AspGln: 1.78 ± 0.367
1.78AspArg: 1.78 ± 0.363
3.235AspSer: 3.235 ± 0.544
3.397AspThr: 3.397 ± 0.74
4.287AspVal: 4.287 ± 0.537
0.809AspTrp: 0.809 ± 0.289
2.669AspTyr: 2.669 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
4.611GluAla: 4.611 ± 0.673
0.324GluCys: 0.324 ± 0.206
5.096GluAsp: 5.096 ± 0.684
6.228GluGlu: 6.228 ± 0.963
2.912GluPhe: 2.912 ± 0.454
4.772GluGly: 4.772 ± 0.723
1.618GluHis: 1.618 ± 0.413
3.721GluIle: 3.721 ± 0.547
5.258GluLys: 5.258 ± 0.458
8.978GluLeu: 8.978 ± 0.989
2.993GluMet: 2.993 ± 0.423
4.287GluAsn: 4.287 ± 0.586
2.912GluPro: 2.912 ± 0.616
3.64GluGln: 3.64 ± 0.607
3.478GluArg: 3.478 ± 0.498
3.397GluSer: 3.397 ± 0.522
4.53GluThr: 4.53 ± 0.463
6.714GluVal: 6.714 ± 0.982
1.618GluTrp: 1.618 ± 0.407
2.993GluTyr: 2.993 ± 0.699
0.0GluXaa: 0.0 ± 0.0
Phe
1.618PheAla: 1.618 ± 0.329
0.243PheCys: 0.243 ± 0.128
2.427PheAsp: 2.427 ± 0.523
2.507PheGlu: 2.507 ± 0.546
0.971PhePhe: 0.971 ± 0.317
2.993PheGly: 2.993 ± 0.603
0.404PheHis: 0.404 ± 0.191
3.721PheIle: 3.721 ± 0.805
4.368PheLys: 4.368 ± 0.715
1.618PheLeu: 1.618 ± 0.384
0.971PheMet: 0.971 ± 0.233
3.316PheAsn: 3.316 ± 0.47
0.404PhePro: 0.404 ± 0.163
1.294PheGln: 1.294 ± 0.357
1.86PheArg: 1.86 ± 0.384
2.993PheSer: 2.993 ± 0.468
4.287PheThr: 4.287 ± 0.764
2.993PheVal: 2.993 ± 0.614
0.566PheTrp: 0.566 ± 0.225
1.618PheTyr: 1.618 ± 0.351
0.0PheXaa: 0.0 ± 0.0
Gly
4.368GlyAla: 4.368 ± 1.579
0.243GlyCys: 0.243 ± 0.159
3.478GlyAsp: 3.478 ± 0.494
3.478GlyGlu: 3.478 ± 0.479
2.993GlyPhe: 2.993 ± 0.444
4.53GlyGly: 4.53 ± 1.155
0.809GlyHis: 0.809 ± 0.254
5.662GlyIle: 5.662 ± 1.054
6.066GlyLys: 6.066 ± 0.603
5.5GlyLeu: 5.5 ± 0.941
2.265GlyMet: 2.265 ± 0.573
4.368GlyAsn: 4.368 ± 0.599
0.89GlyPro: 0.89 ± 0.409
2.265GlyGln: 2.265 ± 0.586
2.346GlyArg: 2.346 ± 0.397
2.912GlySer: 2.912 ± 0.631
4.368GlyThr: 4.368 ± 0.754
4.125GlyVal: 4.125 ± 0.611
0.89GlyTrp: 0.89 ± 0.24
2.75GlyTyr: 2.75 ± 0.607
0.0GlyXaa: 0.0 ± 0.0
His
0.89HisAla: 0.89 ± 0.298
0.162HisCys: 0.162 ± 0.123
0.809HisAsp: 0.809 ± 0.273
1.375HisGlu: 1.375 ± 0.328
1.375HisPhe: 1.375 ± 0.4
1.052HisGly: 1.052 ± 0.306
0.324HisHis: 0.324 ± 0.17
0.485HisIle: 0.485 ± 0.2
1.699HisLys: 1.699 ± 0.422
1.213HisLeu: 1.213 ± 0.383
0.404HisMet: 0.404 ± 0.196
1.537HisAsn: 1.537 ± 0.363
0.404HisPro: 0.404 ± 0.195
0.566HisGln: 0.566 ± 0.268
0.404HisArg: 0.404 ± 0.171
0.728HisSer: 0.728 ± 0.213
0.809HisThr: 0.809 ± 0.359
0.809HisVal: 0.809 ± 0.311
0.0HisTrp: 0.0 ± 0.0
0.971HisTyr: 0.971 ± 0.297
0.0HisXaa: 0.0 ± 0.0
Ile
3.883IleAla: 3.883 ± 0.635
0.566IleCys: 0.566 ± 0.214
5.258IleAsp: 5.258 ± 0.678
7.361IleGlu: 7.361 ± 1.033
2.022IlePhe: 2.022 ± 0.419
5.015IleGly: 5.015 ± 0.678
0.89IleHis: 0.89 ± 0.32
4.044IleIle: 4.044 ± 0.564
5.905IleLys: 5.905 ± 0.633
5.339IleLeu: 5.339 ± 0.601
1.699IleMet: 1.699 ± 0.452
3.963IleAsn: 3.963 ± 0.455
2.346IlePro: 2.346 ± 0.415
2.831IleGln: 2.831 ± 0.472
1.86IleArg: 1.86 ± 0.311
4.044IleSer: 4.044 ± 0.62
4.368IleThr: 4.368 ± 0.708
4.125IleVal: 4.125 ± 0.609
0.89IleTrp: 0.89 ± 0.323
2.507IleTyr: 2.507 ± 0.554
0.0IleXaa: 0.0 ± 0.0
Lys
5.743LysAla: 5.743 ± 0.793
0.566LysCys: 0.566 ± 0.249
5.743LysAsp: 5.743 ± 0.837
8.412LysGlu: 8.412 ± 0.906
3.64LysPhe: 3.64 ± 0.493
5.5LysGly: 5.5 ± 1.077
1.294LysHis: 1.294 ± 0.297
4.611LysIle: 4.611 ± 0.6
6.875LysLys: 6.875 ± 0.807
6.228LysLeu: 6.228 ± 0.646
3.316LysMet: 3.316 ± 0.429
4.934LysAsn: 4.934 ± 0.556
3.155LysPro: 3.155 ± 0.672
4.287LysGln: 4.287 ± 0.585
3.802LysArg: 3.802 ± 0.611
4.287LysSer: 4.287 ± 0.674
5.581LysThr: 5.581 ± 0.796
5.339LysVal: 5.339 ± 0.664
1.294LysTrp: 1.294 ± 0.385
3.883LysTyr: 3.883 ± 0.484
0.0LysXaa: 0.0 ± 0.0
Leu
5.258LeuAla: 5.258 ± 0.819
0.566LeuCys: 0.566 ± 0.258
5.662LeuAsp: 5.662 ± 0.603
8.008LeuGlu: 8.008 ± 0.724
2.993LeuPhe: 2.993 ± 0.452
4.611LeuGly: 4.611 ± 1.027
1.052LeuHis: 1.052 ± 0.29
5.096LeuIle: 5.096 ± 0.759
7.522LeuLys: 7.522 ± 0.78
6.39LeuLeu: 6.39 ± 1.01
1.537LeuMet: 1.537 ± 0.338
5.905LeuAsn: 5.905 ± 0.657
2.993LeuPro: 2.993 ± 0.502
3.963LeuGln: 3.963 ± 0.611
2.669LeuArg: 2.669 ± 0.458
3.883LeuSer: 3.883 ± 0.532
4.934LeuThr: 4.934 ± 0.536
5.824LeuVal: 5.824 ± 0.654
0.809LeuTrp: 0.809 ± 0.263
2.588LeuTyr: 2.588 ± 0.449
0.0LeuXaa: 0.0 ± 0.0
Met
2.022MetAla: 2.022 ± 0.707
0.324MetCys: 0.324 ± 0.179
1.78MetAsp: 1.78 ± 0.386
2.831MetGlu: 2.831 ± 0.675
1.213MetPhe: 1.213 ± 0.455
1.78MetGly: 1.78 ± 0.463
0.162MetHis: 0.162 ± 0.122
1.86MetIle: 1.86 ± 0.434
2.427MetLys: 2.427 ± 0.437
2.346MetLeu: 2.346 ± 0.479
0.647MetMet: 0.647 ± 0.276
1.699MetAsn: 1.699 ± 0.376
0.485MetPro: 0.485 ± 0.207
1.132MetGln: 1.132 ± 0.29
1.456MetArg: 1.456 ± 0.4
1.294MetSer: 1.294 ± 0.336
2.588MetThr: 2.588 ± 0.381
1.294MetVal: 1.294 ± 0.381
0.404MetTrp: 0.404 ± 0.253
1.294MetTyr: 1.294 ± 0.322
0.0MetXaa: 0.0 ± 0.0
Asn
4.449AsnAla: 4.449 ± 0.817
0.404AsnCys: 0.404 ± 0.192
4.044AsnAsp: 4.044 ± 0.581
5.015AsnGlu: 5.015 ± 0.626
1.86AsnPhe: 1.86 ± 0.43
5.986AsnGly: 5.986 ± 0.711
0.89AsnHis: 0.89 ± 0.279
4.853AsnIle: 4.853 ± 0.821
5.662AsnLys: 5.662 ± 0.766
4.691AsnLeu: 4.691 ± 0.644
2.022AsnMet: 2.022 ± 0.365
3.883AsnAsn: 3.883 ± 0.535
2.022AsnPro: 2.022 ± 0.339
1.618AsnGln: 1.618 ± 0.436
1.537AsnArg: 1.537 ± 0.404
2.831AsnSer: 2.831 ± 0.553
4.853AsnThr: 4.853 ± 0.682
4.044AsnVal: 4.044 ± 0.607
0.809AsnTrp: 0.809 ± 0.319
2.588AsnTyr: 2.588 ± 0.56
0.0AsnXaa: 0.0 ± 0.0
Pro
2.346ProAla: 2.346 ± 0.548
0.162ProCys: 0.162 ± 0.123
1.618ProAsp: 1.618 ± 0.434
2.831ProGlu: 2.831 ± 0.413
1.294ProPhe: 1.294 ± 0.268
0.081ProGly: 0.081 ± 0.075
0.485ProHis: 0.485 ± 0.216
2.022ProIle: 2.022 ± 0.339
2.588ProLys: 2.588 ± 0.572
3.316ProLeu: 3.316 ± 0.628
0.809ProMet: 0.809 ± 0.277
2.022ProAsn: 2.022 ± 0.416
0.485ProPro: 0.485 ± 0.259
1.294ProGln: 1.294 ± 0.405
0.485ProArg: 0.485 ± 0.201
1.456ProSer: 1.456 ± 0.324
1.699ProThr: 1.699 ± 0.426
2.507ProVal: 2.507 ± 0.462
0.243ProTrp: 0.243 ± 0.141
1.941ProTyr: 1.941 ± 0.418
0.0ProXaa: 0.0 ± 0.0
Gln
2.75GlnAla: 2.75 ± 0.865
0.404GlnCys: 0.404 ± 0.184
1.78GlnAsp: 1.78 ± 0.297
2.507GlnGlu: 2.507 ± 0.527
1.699GlnPhe: 1.699 ± 0.46
2.022GlnGly: 2.022 ± 0.409
0.728GlnHis: 0.728 ± 0.221
2.831GlnIle: 2.831 ± 0.47
2.184GlnLys: 2.184 ± 0.477
3.397GlnLeu: 3.397 ± 0.414
0.809GlnMet: 0.809 ± 0.289
1.456GlnAsn: 1.456 ± 0.276
1.294GlnPro: 1.294 ± 0.228
2.184GlnGln: 2.184 ± 0.471
2.022GlnArg: 2.022 ± 0.425
2.265GlnSer: 2.265 ± 0.448
2.588GlnThr: 2.588 ± 0.557
2.103GlnVal: 2.103 ± 0.427
0.566GlnTrp: 0.566 ± 0.221
2.427GlnTyr: 2.427 ± 0.371
0.0GlnXaa: 0.0 ± 0.0
Arg
1.375ArgAla: 1.375 ± 0.473
0.324ArgCys: 0.324 ± 0.158
2.669ArgAsp: 2.669 ± 0.42
1.941ArgGlu: 1.941 ± 0.395
1.456ArgPhe: 1.456 ± 0.23
1.618ArgGly: 1.618 ± 0.469
0.647ArgHis: 0.647 ± 0.292
2.346ArgIle: 2.346 ± 0.393
2.75ArgLys: 2.75 ± 0.607
2.912ArgLeu: 2.912 ± 0.441
0.89ArgMet: 0.89 ± 0.2
2.669ArgAsn: 2.669 ± 0.527
1.052ArgPro: 1.052 ± 0.276
0.971ArgGln: 0.971 ± 0.344
0.971ArgArg: 0.971 ± 0.29
1.537ArgSer: 1.537 ± 0.247
2.427ArgThr: 2.427 ± 0.462
2.103ArgVal: 2.103 ± 0.446
0.404ArgTrp: 0.404 ± 0.165
2.022ArgTyr: 2.022 ± 0.423
0.0ArgXaa: 0.0 ± 0.0
Ser
2.912SerAla: 2.912 ± 0.548
0.081SerCys: 0.081 ± 0.082
2.669SerAsp: 2.669 ± 0.452
3.559SerGlu: 3.559 ± 0.546
2.831SerPhe: 2.831 ± 0.457
4.853SerGly: 4.853 ± 0.883
1.699SerHis: 1.699 ± 0.443
3.721SerIle: 3.721 ± 0.58
3.963SerLys: 3.963 ± 0.618
3.559SerLeu: 3.559 ± 0.544
1.456SerMet: 1.456 ± 0.328
3.155SerAsn: 3.155 ± 0.493
0.809SerPro: 0.809 ± 0.27
1.78SerGln: 1.78 ± 0.492
1.456SerArg: 1.456 ± 0.348
2.507SerSer: 2.507 ± 0.486
3.397SerThr: 3.397 ± 0.714
3.074SerVal: 3.074 ± 0.501
0.566SerTrp: 0.566 ± 0.241
2.669SerTyr: 2.669 ± 0.623
0.0SerXaa: 0.0 ± 0.0
Thr
3.963ThrAla: 3.963 ± 0.576
0.324ThrCys: 0.324 ± 0.175
3.963ThrAsp: 3.963 ± 0.575
3.721ThrGlu: 3.721 ± 0.556
3.155ThrPhe: 3.155 ± 0.466
3.64ThrGly: 3.64 ± 0.494
1.375ThrHis: 1.375 ± 0.313
5.581ThrIle: 5.581 ± 0.595
7.28ThrLys: 7.28 ± 0.842
5.662ThrLeu: 5.662 ± 1.108
1.618ThrMet: 1.618 ± 0.478
4.044ThrAsn: 4.044 ± 0.624
2.669ThrPro: 2.669 ± 0.523
3.235ThrGln: 3.235 ± 0.571
1.618ThrArg: 1.618 ± 0.331
2.669ThrSer: 2.669 ± 0.593
4.611ThrThr: 4.611 ± 0.764
3.64ThrVal: 3.64 ± 0.558
0.809ThrTrp: 0.809 ± 0.238
2.346ThrTyr: 2.346 ± 0.488
0.0ThrXaa: 0.0 ± 0.0
Val
4.772ValAla: 4.772 ± 0.594
0.404ValCys: 0.404 ± 0.176
4.611ValAsp: 4.611 ± 0.572
4.934ValGlu: 4.934 ± 0.763
2.831ValPhe: 2.831 ± 0.484
3.235ValGly: 3.235 ± 0.563
0.728ValHis: 0.728 ± 0.276
4.125ValIle: 4.125 ± 0.702
6.309ValLys: 6.309 ± 0.909
5.015ValLeu: 5.015 ± 0.715
1.86ValMet: 1.86 ± 0.331
4.53ValAsn: 4.53 ± 0.488
2.346ValPro: 2.346 ± 0.454
2.022ValGln: 2.022 ± 0.429
1.618ValArg: 1.618 ± 0.3
4.691ValSer: 4.691 ± 0.396
3.478ValThr: 3.478 ± 0.642
4.206ValVal: 4.206 ± 0.718
0.89ValTrp: 0.89 ± 0.356
2.507ValTyr: 2.507 ± 0.52
0.0ValXaa: 0.0 ± 0.0
Trp
0.324TrpAla: 0.324 ± 0.192
0.081TrpCys: 0.081 ± 0.082
0.971TrpAsp: 0.971 ± 0.455
1.456TrpGlu: 1.456 ± 0.301
0.809TrpPhe: 0.809 ± 0.217
0.971TrpGly: 0.971 ± 0.308
0.324TrpHis: 0.324 ± 0.174
1.052TrpIle: 1.052 ± 0.291
1.294TrpLys: 1.294 ± 0.301
0.971TrpLeu: 0.971 ± 0.271
0.081TrpMet: 0.081 ± 0.074
0.485TrpAsn: 0.485 ± 0.205
0.0TrpPro: 0.0 ± 0.0
0.404TrpGln: 0.404 ± 0.145
0.728TrpArg: 0.728 ± 0.262
0.566TrpSer: 0.566 ± 0.183
0.728TrpThr: 0.728 ± 0.205
1.213TrpVal: 1.213 ± 0.334
0.324TrpTrp: 0.324 ± 0.142
0.162TrpTyr: 0.162 ± 0.101
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.022TyrAla: 2.022 ± 0.456
0.728TyrCys: 0.728 ± 0.266
3.478TyrAsp: 3.478 ± 0.613
3.963TyrGlu: 3.963 ± 0.679
1.456TyrPhe: 1.456 ± 0.284
2.507TyrGly: 2.507 ± 0.537
0.728TyrHis: 0.728 ± 0.224
3.397TyrIle: 3.397 ± 0.545
3.963TyrLys: 3.963 ± 0.53
3.883TyrLeu: 3.883 ± 0.703
0.971TyrMet: 0.971 ± 0.323
3.316TyrAsn: 3.316 ± 0.531
1.213TyrPro: 1.213 ± 0.394
0.809TyrGln: 0.809 ± 0.223
1.213TyrArg: 1.213 ± 0.386
2.022TyrSer: 2.022 ± 0.452
2.265TyrThr: 2.265 ± 0.467
2.912TyrVal: 2.912 ± 0.593
0.243TyrTrp: 0.243 ± 0.127
2.265TyrTyr: 2.265 ± 0.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (12364 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski