Amino acid dipepetide frequency for Pectobacterium phage Wc4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.932AlaAla: 7.932 ± 0.774
0.748AlaCys: 0.748 ± 0.161
3.891AlaAsp: 3.891 ± 0.39
4.565AlaGlu: 4.565 ± 0.543
3.106AlaPhe: 3.106 ± 0.344
5.238AlaGly: 5.238 ± 0.449
1.497AlaHis: 1.497 ± 0.257
5.313AlaIle: 5.313 ± 0.461
5.126AlaLys: 5.126 ± 0.483
7.259AlaLeu: 7.259 ± 0.634
2.245AlaMet: 2.245 ± 0.312
4.191AlaAsn: 4.191 ± 0.481
2.507AlaPro: 2.507 ± 0.287
3.517AlaGln: 3.517 ± 0.375
3.667AlaArg: 3.667 ± 0.327
5.126AlaSer: 5.126 ± 0.677
4.864AlaThr: 4.864 ± 0.605
5.126AlaVal: 5.126 ± 0.374
0.973AlaTrp: 0.973 ± 0.181
2.507AlaTyr: 2.507 ± 0.277
0.0AlaXaa: 0.0 ± 0.0
Cys
0.823CysAla: 0.823 ± 0.173
0.15CysCys: 0.15 ± 0.086
0.823CysAsp: 0.823 ± 0.191
1.197CysGlu: 1.197 ± 0.193
0.561CysPhe: 0.561 ± 0.176
1.16CysGly: 1.16 ± 0.234
0.449CysHis: 0.449 ± 0.146
0.599CysIle: 0.599 ± 0.149
1.31CysLys: 1.31 ± 0.222
0.973CysLeu: 0.973 ± 0.211
0.225CysMet: 0.225 ± 0.095
0.412CysAsn: 0.412 ± 0.129
0.412CysPro: 0.412 ± 0.109
0.412CysGln: 0.412 ± 0.155
0.412CysArg: 0.412 ± 0.109
1.01CysSer: 1.01 ± 0.183
0.711CysThr: 0.711 ± 0.159
0.823CysVal: 0.823 ± 0.156
0.225CysTrp: 0.225 ± 0.089
0.524CysTyr: 0.524 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
5.425AspAla: 5.425 ± 0.54
0.786AspCys: 0.786 ± 0.175
3.891AspAsp: 3.891 ± 0.464
4.827AspGlu: 4.827 ± 0.452
2.694AspPhe: 2.694 ± 0.285
5.575AspGly: 5.575 ± 0.383
0.861AspHis: 0.861 ± 0.215
3.742AspIle: 3.742 ± 0.393
3.48AspLys: 3.48 ± 0.423
3.704AspLeu: 3.704 ± 0.392
1.759AspMet: 1.759 ± 0.255
3.068AspAsn: 3.068 ± 0.245
1.384AspPro: 1.384 ± 0.262
1.646AspGln: 1.646 ± 0.323
2.282AspArg: 2.282 ± 0.307
4.153AspSer: 4.153 ± 0.383
2.881AspThr: 2.881 ± 0.254
4.378AspVal: 4.378 ± 0.36
1.16AspTrp: 1.16 ± 0.202
1.721AspTyr: 1.721 ± 0.279
0.0AspXaa: 0.0 ± 0.0
Glu
4.902GluAla: 4.902 ± 0.549
0.674GluCys: 0.674 ± 0.175
3.629GluAsp: 3.629 ± 0.414
3.704GluGlu: 3.704 ± 0.498
2.095GluPhe: 2.095 ± 0.23
4.191GluGly: 4.191 ± 0.368
1.609GluHis: 1.609 ± 0.229
3.33GluIle: 3.33 ± 0.415
4.116GluLys: 4.116 ± 0.454
6.136GluLeu: 6.136 ± 0.466
2.806GluMet: 2.806 ± 0.33
2.919GluAsn: 2.919 ± 0.341
2.47GluPro: 2.47 ± 0.304
2.694GluGln: 2.694 ± 0.305
3.18GluArg: 3.18 ± 0.401
2.806GluSer: 2.806 ± 0.38
2.993GluThr: 2.993 ± 0.314
5.014GluVal: 5.014 ± 0.388
0.973GluTrp: 0.973 ± 0.169
1.534GluTyr: 1.534 ± 0.238
0.0GluXaa: 0.0 ± 0.0
Phe
2.731PheAla: 2.731 ± 0.329
0.524PheCys: 0.524 ± 0.157
2.507PheAsp: 2.507 ± 0.298
2.769PheGlu: 2.769 ± 0.338
1.572PhePhe: 1.572 ± 0.253
3.442PheGly: 3.442 ± 0.382
0.636PheHis: 0.636 ± 0.166
2.245PheIle: 2.245 ± 0.247
2.582PheLys: 2.582 ± 0.296
2.881PheLeu: 2.881 ± 0.373
1.123PheMet: 1.123 ± 0.196
2.619PheAsn: 2.619 ± 0.326
1.085PhePro: 1.085 ± 0.186
1.946PheGln: 1.946 ± 0.277
1.796PheArg: 1.796 ± 0.212
3.48PheSer: 3.48 ± 0.376
3.405PheThr: 3.405 ± 0.373
2.395PheVal: 2.395 ± 0.349
0.412PheTrp: 0.412 ± 0.119
1.16PheTyr: 1.16 ± 0.214
0.0PheXaa: 0.0 ± 0.0
Gly
5.5GlyAla: 5.5 ± 0.517
1.235GlyCys: 1.235 ± 0.222
4.116GlyAsp: 4.116 ± 0.326
4.976GlyGlu: 4.976 ± 0.512
2.919GlyPhe: 2.919 ± 0.41
4.827GlyGly: 4.827 ± 0.472
1.272GlyHis: 1.272 ± 0.241
4.565GlyIle: 4.565 ± 0.455
4.864GlyLys: 4.864 ± 0.497
5.575GlyLeu: 5.575 ± 0.518
1.721GlyMet: 1.721 ± 0.27
3.031GlyAsn: 3.031 ± 0.339
0.037GlyPro: 0.037 ± 0.036
2.282GlyGln: 2.282 ± 0.295
3.255GlyArg: 3.255 ± 0.337
5.463GlySer: 5.463 ± 0.603
4.677GlyThr: 4.677 ± 0.603
5.8GlyVal: 5.8 ± 0.528
1.085GlyTrp: 1.085 ± 0.217
2.993GlyTyr: 2.993 ± 0.325
0.0GlyXaa: 0.0 ± 0.0
His
1.048HisAla: 1.048 ± 0.181
0.449HisCys: 0.449 ± 0.118
1.123HisAsp: 1.123 ± 0.206
0.935HisGlu: 0.935 ± 0.171
0.786HisPhe: 0.786 ± 0.149
1.609HisGly: 1.609 ± 0.233
0.524HisHis: 0.524 ± 0.143
1.384HisIle: 1.384 ± 0.238
1.347HisLys: 1.347 ± 0.219
1.833HisLeu: 1.833 ± 0.286
0.524HisMet: 0.524 ± 0.127
0.973HisAsn: 0.973 ± 0.185
0.561HisPro: 0.561 ± 0.141
0.561HisGln: 0.561 ± 0.147
0.748HisArg: 0.748 ± 0.146
1.534HisSer: 1.534 ± 0.283
1.384HisThr: 1.384 ± 0.279
1.609HisVal: 1.609 ± 0.219
0.412HisTrp: 0.412 ± 0.131
0.711HisTyr: 0.711 ± 0.193
0.0HisXaa: 0.0 ± 0.0
Ile
5.164IleAla: 5.164 ± 0.526
0.711IleCys: 0.711 ± 0.168
3.143IleAsp: 3.143 ± 0.286
3.704IleGlu: 3.704 ± 0.362
2.432IlePhe: 2.432 ± 0.332
3.255IleGly: 3.255 ± 0.341
0.935IleHis: 0.935 ± 0.171
3.405IleIle: 3.405 ± 0.395
4.228IleLys: 4.228 ± 0.457
3.929IleLeu: 3.929 ± 0.335
0.973IleMet: 0.973 ± 0.227
3.031IleAsn: 3.031 ± 0.267
2.021IlePro: 2.021 ± 0.238
2.17IleGln: 2.17 ± 0.272
3.106IleArg: 3.106 ± 0.374
3.891IleSer: 3.891 ± 0.453
4.378IleThr: 4.378 ± 0.378
4.078IleVal: 4.078 ± 0.448
0.898IleTrp: 0.898 ± 0.187
2.657IleTyr: 2.657 ± 0.342
0.0IleXaa: 0.0 ± 0.0
Lys
5.613LysAla: 5.613 ± 0.444
0.861LysCys: 0.861 ± 0.193
3.817LysAsp: 3.817 ± 0.501
3.742LysGlu: 3.742 ± 0.418
2.095LysPhe: 2.095 ± 0.273
4.49LysGly: 4.49 ± 0.438
1.16LysHis: 1.16 ± 0.227
3.068LysIle: 3.068 ± 0.355
4.153LysLys: 4.153 ± 0.407
5.238LysLeu: 5.238 ± 0.369
2.32LysMet: 2.32 ± 0.256
2.582LysAsn: 2.582 ± 0.322
2.619LysPro: 2.619 ± 0.293
2.17LysGln: 2.17 ± 0.354
3.555LysArg: 3.555 ± 0.413
3.891LysSer: 3.891 ± 0.362
3.592LysThr: 3.592 ± 0.349
5.425LysVal: 5.425 ± 0.505
0.711LysTrp: 0.711 ± 0.157
2.47LysTyr: 2.47 ± 0.319
0.0LysXaa: 0.0 ± 0.0
Leu
5.65LeuAla: 5.65 ± 0.601
1.347LeuCys: 1.347 ± 0.226
5.313LeuAsp: 5.313 ± 0.431
5.425LeuGlu: 5.425 ± 0.549
3.255LeuPhe: 3.255 ± 0.367
4.191LeuGly: 4.191 ± 0.426
1.646LeuHis: 1.646 ± 0.255
4.677LeuIle: 4.677 ± 0.487
5.164LeuLys: 5.164 ± 0.452
5.5LeuLeu: 5.5 ± 0.566
2.582LeuMet: 2.582 ± 0.318
4.34LeuAsn: 4.34 ± 0.359
3.218LeuPro: 3.218 ± 0.344
3.143LeuGln: 3.143 ± 0.365
4.602LeuArg: 4.602 ± 0.352
5.949LeuSer: 5.949 ± 0.532
5.65LeuThr: 5.65 ± 0.439
5.164LeuVal: 5.164 ± 0.453
1.459LeuTrp: 1.459 ± 0.228
2.657LeuTyr: 2.657 ± 0.319
0.0LeuXaa: 0.0 ± 0.0
Met
2.956MetAla: 2.956 ± 0.365
0.15MetCys: 0.15 ± 0.061
1.272MetAsp: 1.272 ± 0.228
1.422MetGlu: 1.422 ± 0.247
1.235MetPhe: 1.235 ± 0.282
1.497MetGly: 1.497 ± 0.247
0.449MetHis: 0.449 ± 0.124
1.422MetIle: 1.422 ± 0.211
2.619MetLys: 2.619 ± 0.279
2.17MetLeu: 2.17 ± 0.33
0.823MetMet: 0.823 ± 0.169
1.459MetAsn: 1.459 ± 0.233
0.973MetPro: 0.973 ± 0.186
1.497MetGln: 1.497 ± 0.225
1.684MetArg: 1.684 ± 0.253
1.908MetSer: 1.908 ± 0.236
2.245MetThr: 2.245 ± 0.281
1.646MetVal: 1.646 ± 0.276
0.225MetTrp: 0.225 ± 0.096
0.748MetTyr: 0.748 ± 0.146
0.0MetXaa: 0.0 ± 0.0
Asn
3.442AsnAla: 3.442 ± 0.4
0.674AsnCys: 0.674 ± 0.157
2.133AsnAsp: 2.133 ± 0.237
2.32AsnGlu: 2.32 ± 0.288
1.983AsnPhe: 1.983 ± 0.236
4.378AsnGly: 4.378 ± 0.52
0.823AsnHis: 0.823 ± 0.161
3.18AsnIle: 3.18 ± 0.378
3.143AsnLys: 3.143 ± 0.307
4.677AsnLeu: 4.677 ± 0.392
0.711AsnMet: 0.711 ± 0.16
2.245AsnAsn: 2.245 ± 0.343
2.544AsnPro: 2.544 ± 0.317
1.871AsnGln: 1.871 ± 0.264
2.32AsnArg: 2.32 ± 0.298
3.704AsnSer: 3.704 ± 0.302
2.881AsnThr: 2.881 ± 0.352
3.33AsnVal: 3.33 ± 0.397
0.861AsnTrp: 0.861 ± 0.172
1.646AsnTyr: 1.646 ± 0.232
0.0AsnXaa: 0.0 ± 0.0
Pro
2.32ProAla: 2.32 ± 0.263
0.262ProCys: 0.262 ± 0.085
3.068ProAsp: 3.068 ± 0.317
2.694ProGlu: 2.694 ± 0.325
1.609ProPhe: 1.609 ± 0.215
0.075ProGly: 0.075 ± 0.056
0.599ProHis: 0.599 ± 0.151
2.357ProIle: 2.357 ± 0.279
1.422ProLys: 1.422 ± 0.325
2.657ProLeu: 2.657 ± 0.264
1.123ProMet: 1.123 ± 0.242
1.646ProAsn: 1.646 ± 0.285
1.384ProPro: 1.384 ± 0.266
1.16ProGln: 1.16 ± 0.268
1.684ProArg: 1.684 ± 0.246
2.021ProSer: 2.021 ± 0.327
2.395ProThr: 2.395 ± 0.286
2.806ProVal: 2.806 ± 0.342
0.299ProTrp: 0.299 ± 0.101
1.16ProTyr: 1.16 ± 0.172
0.0ProXaa: 0.0 ± 0.0
Gln
3.368GlnAla: 3.368 ± 0.381
0.524GlnCys: 0.524 ± 0.131
1.721GlnAsp: 1.721 ± 0.265
2.17GlnGlu: 2.17 ± 0.307
1.796GlnPhe: 1.796 ± 0.261
2.919GlnGly: 2.919 ± 0.325
0.861GlnHis: 0.861 ± 0.189
2.507GlnIle: 2.507 ± 0.348
1.796GlnLys: 1.796 ± 0.269
3.592GlnLeu: 3.592 ± 0.443
0.674GlnMet: 0.674 ± 0.16
1.759GlnAsn: 1.759 ± 0.219
1.534GlnPro: 1.534 ± 0.314
1.983GlnGln: 1.983 ± 0.297
2.357GlnArg: 2.357 ± 0.322
2.058GlnSer: 2.058 ± 0.293
2.731GlnThr: 2.731 ± 0.306
2.956GlnVal: 2.956 ± 0.348
0.898GlnTrp: 0.898 ± 0.176
1.646GlnTyr: 1.646 ± 0.207
0.0GlnXaa: 0.0 ± 0.0
Arg
3.592ArgAla: 3.592 ± 0.391
0.524ArgCys: 0.524 ± 0.111
3.368ArgAsp: 3.368 ± 0.302
2.619ArgGlu: 2.619 ± 0.327
2.208ArgPhe: 2.208 ± 0.314
3.442ArgGly: 3.442 ± 0.338
1.01ArgHis: 1.01 ± 0.159
3.33ArgIle: 3.33 ± 0.336
3.031ArgLys: 3.031 ± 0.338
3.929ArgLeu: 3.929 ± 0.411
1.908ArgMet: 1.908 ± 0.24
2.432ArgAsn: 2.432 ± 0.338
1.459ArgPro: 1.459 ± 0.204
2.245ArgGln: 2.245 ± 0.281
3.218ArgArg: 3.218 ± 0.349
2.395ArgSer: 2.395 ± 0.301
2.619ArgThr: 2.619 ± 0.337
2.919ArgVal: 2.919 ± 0.348
0.786ArgTrp: 0.786 ± 0.215
2.357ArgTyr: 2.357 ± 0.395
0.0ArgXaa: 0.0 ± 0.0
Ser
5.463SerAla: 5.463 ± 0.537
0.748SerCys: 0.748 ± 0.151
4.266SerAsp: 4.266 ± 0.415
3.704SerGlu: 3.704 ± 0.434
2.881SerPhe: 2.881 ± 0.352
5.613SerGly: 5.613 ± 0.446
1.459SerHis: 1.459 ± 0.273
3.368SerIle: 3.368 ± 0.408
3.966SerLys: 3.966 ± 0.354
5.874SerLeu: 5.874 ± 0.446
1.871SerMet: 1.871 ± 0.266
2.582SerAsn: 2.582 ± 0.394
2.058SerPro: 2.058 ± 0.264
2.806SerGln: 2.806 ± 0.325
2.619SerArg: 2.619 ± 0.274
4.49SerSer: 4.49 ± 0.616
4.153SerThr: 4.153 ± 0.572
5.425SerVal: 5.425 ± 0.509
0.973SerTrp: 0.973 ± 0.208
2.582SerTyr: 2.582 ± 0.336
0.0SerXaa: 0.0 ± 0.0
Thr
4.902ThrAla: 4.902 ± 0.485
0.786ThrCys: 0.786 ± 0.166
3.48ThrAsp: 3.48 ± 0.39
4.266ThrGlu: 4.266 ± 0.466
2.507ThrPhe: 2.507 ± 0.293
6.249ThrGly: 6.249 ± 0.548
1.347ThrHis: 1.347 ± 0.184
3.18ThrIle: 3.18 ± 0.318
3.405ThrLys: 3.405 ± 0.307
5.201ThrLeu: 5.201 ± 0.49
1.384ThrMet: 1.384 ± 0.247
2.844ThrAsn: 2.844 ± 0.38
2.731ThrPro: 2.731 ± 0.286
2.619ThrGln: 2.619 ± 0.328
2.919ThrArg: 2.919 ± 0.384
4.078ThrSer: 4.078 ± 0.399
3.704ThrThr: 3.704 ± 0.502
5.425ThrVal: 5.425 ± 0.497
0.486ThrTrp: 0.486 ± 0.133
2.133ThrTyr: 2.133 ± 0.308
0.0ThrXaa: 0.0 ± 0.0
Val
5.201ValAla: 5.201 ± 0.468
1.235ValCys: 1.235 ± 0.246
4.789ValAsp: 4.789 ± 0.44
4.078ValGlu: 4.078 ± 0.431
3.293ValPhe: 3.293 ± 0.357
4.303ValGly: 4.303 ± 0.425
1.833ValHis: 1.833 ± 0.251
3.704ValIle: 3.704 ± 0.423
4.715ValLys: 4.715 ± 0.526
5.014ValLeu: 5.014 ± 0.527
1.946ValMet: 1.946 ± 0.254
3.929ValAsn: 3.929 ± 0.413
2.208ValPro: 2.208 ± 0.325
3.068ValGln: 3.068 ± 0.397
3.592ValArg: 3.592 ± 0.322
5.65ValSer: 5.65 ± 0.533
5.388ValThr: 5.388 ± 0.63
4.677ValVal: 4.677 ± 0.514
0.748ValTrp: 0.748 ± 0.175
2.993ValTyr: 2.993 ± 0.314
0.0ValXaa: 0.0 ± 0.0
Trp
1.01TrpAla: 1.01 ± 0.202
0.15TrpCys: 0.15 ± 0.077
0.861TrpAsp: 0.861 ± 0.173
0.748TrpGlu: 0.748 ± 0.168
0.599TrpPhe: 0.599 ± 0.134
0.861TrpGly: 0.861 ± 0.193
0.225TrpHis: 0.225 ± 0.082
0.486TrpIle: 0.486 ± 0.138
1.16TrpLys: 1.16 ± 0.229
1.646TrpLeu: 1.646 ± 0.233
0.486TrpMet: 0.486 ± 0.143
0.898TrpAsn: 0.898 ± 0.194
0.262TrpPro: 0.262 ± 0.107
0.486TrpGln: 0.486 ± 0.133
0.636TrpArg: 0.636 ± 0.181
0.898TrpSer: 0.898 ± 0.18
0.711TrpThr: 0.711 ± 0.162
1.31TrpVal: 1.31 ± 0.221
0.15TrpTrp: 0.15 ± 0.065
0.561TrpTyr: 0.561 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.357TyrAla: 2.357 ± 0.299
0.674TyrCys: 0.674 ± 0.14
2.208TyrAsp: 2.208 ± 0.287
1.908TyrGlu: 1.908 ± 0.272
1.684TyrPhe: 1.684 ± 0.282
2.919TyrGly: 2.919 ± 0.348
0.861TyrHis: 0.861 ± 0.171
2.245TyrIle: 2.245 ± 0.279
1.946TyrLys: 1.946 ± 0.296
3.143TyrLeu: 3.143 ± 0.326
1.16TyrMet: 1.16 ± 0.198
1.908TyrAsn: 1.908 ± 0.226
1.272TyrPro: 1.272 ± 0.2
1.572TyrGln: 1.572 ± 0.231
1.759TyrArg: 1.759 ± 0.249
2.395TyrSer: 2.395 ± 0.337
2.282TyrThr: 2.282 ± 0.276
2.058TyrVal: 2.058 ± 0.23
0.412TyrTrp: 0.412 ± 0.154
0.973TyrTyr: 0.973 ± 0.168
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 145 proteins (26727 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski