Amino acid dipepetide frequency for Pseudomonas phage vB_PaeM_C2-10_Ab08

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.562AlaAla: 6.562 ± 0.684
1.233AlaCys: 1.233 ± 0.19
4.242AlaAsp: 4.242 ± 0.423
5.874AlaGlu: 5.874 ± 0.453
3.191AlaPhe: 3.191 ± 0.322
5.475AlaGly: 5.475 ± 0.497
1.885AlaHis: 1.885 ± 0.274
5.439AlaIle: 5.439 ± 0.462
5.221AlaLys: 5.221 ± 0.52
6.671AlaLeu: 6.671 ± 0.664
2.828AlaMet: 2.828 ± 0.328
3.227AlaAsn: 3.227 ± 0.484
2.103AlaPro: 2.103 ± 0.304
3.082AlaGln: 3.082 ± 0.372
4.786AlaArg: 4.786 ± 0.413
3.698AlaSer: 3.698 ± 0.358
4.931AlaThr: 4.931 ± 0.539
5.547AlaVal: 5.547 ± 0.495
1.559AlaTrp: 1.559 ± 0.249
2.901AlaTyr: 2.901 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.979CysAla: 0.979 ± 0.244
0.29CysCys: 0.29 ± 0.117
1.015CysAsp: 1.015 ± 0.228
1.196CysGlu: 1.196 ± 0.23
0.435CysPhe: 0.435 ± 0.124
1.124CysGly: 1.124 ± 0.226
0.363CysHis: 0.363 ± 0.129
0.725CysIle: 0.725 ± 0.169
0.979CysLys: 0.979 ± 0.183
0.725CysLeu: 0.725 ± 0.144
0.616CysMet: 0.616 ± 0.152
0.979CysAsn: 0.979 ± 0.184
0.471CysPro: 0.471 ± 0.134
0.471CysGln: 0.471 ± 0.111
0.979CysArg: 0.979 ± 0.231
0.761CysSer: 0.761 ± 0.138
0.29CysThr: 0.29 ± 0.116
0.87CysVal: 0.87 ± 0.183
0.29CysTrp: 0.29 ± 0.093
0.689CysTyr: 0.689 ± 0.149
0.0CysXaa: 0.0 ± 0.0
Asp
4.46AspAla: 4.46 ± 0.416
1.015AspCys: 1.015 ± 0.216
3.408AspAsp: 3.408 ± 0.303
4.025AspGlu: 4.025 ± 0.427
2.756AspPhe: 2.756 ± 0.31
4.605AspGly: 4.605 ± 0.371
1.269AspHis: 1.269 ± 0.234
3.879AspIle: 3.879 ± 0.394
3.372AspLys: 3.372 ± 0.313
5.366AspLeu: 5.366 ± 0.467
1.922AspMet: 1.922 ± 0.261
2.61AspAsn: 2.61 ± 0.307
2.864AspPro: 2.864 ± 0.32
1.777AspGln: 1.777 ± 0.284
3.771AspArg: 3.771 ± 0.321
3.009AspSer: 3.009 ± 0.394
3.227AspThr: 3.227 ± 0.332
3.988AspVal: 3.988 ± 0.405
1.487AspTrp: 1.487 ± 0.249
2.574AspTyr: 2.574 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
7.034GluAla: 7.034 ± 0.581
1.088GluCys: 1.088 ± 0.191
5.076GluAsp: 5.076 ± 0.418
7.759GluGlu: 7.759 ± 0.67
3.154GluPhe: 3.154 ± 0.304
5.475GluGly: 5.475 ± 0.421
1.269GluHis: 1.269 ± 0.211
4.17GluIle: 4.17 ± 0.352
3.952GluLys: 3.952 ± 0.442
6.744GluLeu: 6.744 ± 0.552
2.067GluMet: 2.067 ± 0.269
2.756GluAsn: 2.756 ± 0.29
1.777GluPro: 1.777 ± 0.267
2.792GluGln: 2.792 ± 0.296
4.242GluArg: 4.242 ± 0.362
3.444GluSer: 3.444 ± 0.323
3.698GluThr: 3.698 ± 0.317
5.475GluVal: 5.475 ± 0.523
1.74GluTrp: 1.74 ± 0.247
3.481GluTyr: 3.481 ± 0.326
0.0GluXaa: 0.0 ± 0.0
Phe
2.864PheAla: 2.864 ± 0.282
0.58PheCys: 0.58 ± 0.13
3.009PheAsp: 3.009 ± 0.39
3.336PheGlu: 3.336 ± 0.315
1.777PhePhe: 1.777 ± 0.276
2.61PheGly: 2.61 ± 0.298
0.689PheHis: 0.689 ± 0.142
1.885PheIle: 1.885 ± 0.284
3.227PheLys: 3.227 ± 0.298
2.864PheLeu: 2.864 ± 0.255
1.088PheMet: 1.088 ± 0.189
2.212PheAsn: 2.212 ± 0.263
1.487PhePro: 1.487 ± 0.165
1.342PheGln: 1.342 ± 0.207
2.139PheArg: 2.139 ± 0.235
2.756PheSer: 2.756 ± 0.299
2.719PheThr: 2.719 ± 0.354
2.864PheVal: 2.864 ± 0.329
0.761PheTrp: 0.761 ± 0.152
1.414PheTyr: 1.414 ± 0.203
0.0PheXaa: 0.0 ± 0.0
Gly
4.423GlyAla: 4.423 ± 0.402
1.196GlyCys: 1.196 ± 0.222
4.278GlyAsp: 4.278 ± 0.384
4.786GlyGlu: 4.786 ± 0.46
3.626GlyPhe: 3.626 ± 0.351
5.366GlyGly: 5.366 ± 0.609
1.196GlyHis: 1.196 ± 0.23
3.553GlyIle: 3.553 ± 0.437
4.315GlyLys: 4.315 ± 0.473
5.185GlyLeu: 5.185 ± 0.509
2.212GlyMet: 2.212 ± 0.303
3.118GlyAsn: 3.118 ± 0.443
1.559GlyPro: 1.559 ± 0.254
2.937GlyGln: 2.937 ± 0.308
3.879GlyArg: 3.879 ± 0.331
4.496GlySer: 4.496 ± 0.533
3.553GlyThr: 3.553 ± 0.412
5.402GlyVal: 5.402 ± 0.394
1.487GlyTrp: 1.487 ± 0.223
3.444GlyTyr: 3.444 ± 0.37
0.0GlyXaa: 0.0 ± 0.0
His
1.523HisAla: 1.523 ± 0.253
0.399HisCys: 0.399 ± 0.122
1.196HisAsp: 1.196 ± 0.206
0.834HisGlu: 0.834 ± 0.189
0.87HisPhe: 0.87 ± 0.22
1.523HisGly: 1.523 ± 0.236
0.508HisHis: 0.508 ± 0.145
1.233HisIle: 1.233 ± 0.234
1.233HisLys: 1.233 ± 0.24
1.523HisLeu: 1.523 ± 0.248
0.653HisMet: 0.653 ± 0.137
0.761HisAsn: 0.761 ± 0.183
0.87HisPro: 0.87 ± 0.186
0.616HisGln: 0.616 ± 0.134
1.015HisArg: 1.015 ± 0.203
1.051HisSer: 1.051 ± 0.187
1.196HisThr: 1.196 ± 0.237
1.124HisVal: 1.124 ± 0.183
0.399HisTrp: 0.399 ± 0.125
0.689HisTyr: 0.689 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
4.46IleAla: 4.46 ± 0.408
0.471IleCys: 0.471 ± 0.125
4.097IleAsp: 4.097 ± 0.367
4.097IleGlu: 4.097 ± 0.399
1.559IlePhe: 1.559 ± 0.212
3.517IleGly: 3.517 ± 0.321
1.559IleHis: 1.559 ± 0.251
2.393IleIle: 2.393 ± 0.282
3.988IleLys: 3.988 ± 0.38
4.46IleLeu: 4.46 ± 0.4
1.414IleMet: 1.414 ± 0.239
2.683IleAsn: 2.683 ± 0.345
2.574IlePro: 2.574 ± 0.321
2.03IleGln: 2.03 ± 0.273
3.662IleArg: 3.662 ± 0.462
3.481IleSer: 3.481 ± 0.378
2.683IleThr: 2.683 ± 0.326
4.025IleVal: 4.025 ± 0.346
0.798IleTrp: 0.798 ± 0.159
1.487IleTyr: 1.487 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
6.019LysAla: 6.019 ± 0.548
0.834LysCys: 0.834 ± 0.173
3.916LysAsp: 3.916 ± 0.414
5.765LysGlu: 5.765 ± 0.565
1.958LysPhe: 1.958 ± 0.284
4.496LysGly: 4.496 ± 0.424
1.124LysHis: 1.124 ± 0.225
3.263LysIle: 3.263 ± 0.362
3.734LysLys: 3.734 ± 0.342
5.148LysLeu: 5.148 ± 0.437
2.03LysMet: 2.03 ± 0.231
2.212LysAsn: 2.212 ± 0.325
1.994LysPro: 1.994 ± 0.269
2.248LysGln: 2.248 ± 0.277
3.336LysArg: 3.336 ± 0.466
3.734LysSer: 3.734 ± 0.467
3.082LysThr: 3.082 ± 0.327
4.206LysVal: 4.206 ± 0.358
1.196LysTrp: 1.196 ± 0.245
2.248LysTyr: 2.248 ± 0.248
0.0LysXaa: 0.0 ± 0.0
Leu
5.874LeuAla: 5.874 ± 0.526
1.051LeuCys: 1.051 ± 0.222
5.729LeuAsp: 5.729 ± 0.434
6.019LeuGlu: 6.019 ± 0.465
2.647LeuPhe: 2.647 ± 0.328
5.439LeuGly: 5.439 ± 0.456
1.632LeuHis: 1.632 ± 0.249
4.351LeuIle: 4.351 ± 0.441
5.692LeuLys: 5.692 ± 0.563
5.148LeuLeu: 5.148 ± 0.5
2.502LeuMet: 2.502 ± 0.255
3.118LeuAsn: 3.118 ± 0.432
3.988LeuPro: 3.988 ± 0.35
2.719LeuGln: 2.719 ± 0.275
4.387LeuArg: 4.387 ± 0.421
5.293LeuSer: 5.293 ± 0.526
4.677LeuThr: 4.677 ± 0.445
4.713LeuVal: 4.713 ± 0.424
1.233LeuTrp: 1.233 ± 0.201
2.864LeuTyr: 2.864 ± 0.372
0.0LeuXaa: 0.0 ± 0.0
Met
3.408MetAla: 3.408 ± 0.331
0.616MetCys: 0.616 ± 0.159
1.45MetAsp: 1.45 ± 0.259
2.792MetGlu: 2.792 ± 0.305
1.015MetPhe: 1.015 ± 0.192
2.03MetGly: 2.03 ± 0.272
0.218MetHis: 0.218 ± 0.086
1.269MetIle: 1.269 ± 0.177
2.284MetLys: 2.284 ± 0.255
1.813MetLeu: 1.813 ± 0.259
1.233MetMet: 1.233 ± 0.237
1.487MetAsn: 1.487 ± 0.256
1.015MetPro: 1.015 ± 0.209
1.559MetGln: 1.559 ± 0.27
1.523MetArg: 1.523 ± 0.239
1.922MetSer: 1.922 ± 0.252
2.139MetThr: 2.139 ± 0.246
1.414MetVal: 1.414 ± 0.22
0.363MetTrp: 0.363 ± 0.102
0.979MetTyr: 0.979 ± 0.159
0.0MetXaa: 0.0 ± 0.0
Asn
3.263AsnAla: 3.263 ± 0.509
0.326AsnCys: 0.326 ± 0.104
2.212AsnAsp: 2.212 ± 0.268
2.357AsnGlu: 2.357 ± 0.283
1.885AsnPhe: 1.885 ± 0.267
3.517AsnGly: 3.517 ± 0.387
1.051AsnHis: 1.051 ± 0.217
2.828AsnIle: 2.828 ± 0.335
2.756AsnLys: 2.756 ± 0.305
3.698AsnLeu: 3.698 ± 0.52
1.124AsnMet: 1.124 ± 0.186
1.922AsnAsn: 1.922 ± 0.312
2.103AsnPro: 2.103 ± 0.29
1.269AsnGln: 1.269 ± 0.22
1.885AsnArg: 1.885 ± 0.258
2.683AsnSer: 2.683 ± 0.334
2.175AsnThr: 2.175 ± 0.296
3.009AsnVal: 3.009 ± 0.314
0.761AsnTrp: 0.761 ± 0.177
1.305AsnTyr: 1.305 ± 0.18
0.0AsnXaa: 0.0 ± 0.0
Pro
3.191ProAla: 3.191 ± 0.356
0.471ProCys: 0.471 ± 0.151
2.248ProAsp: 2.248 ± 0.337
4.025ProGlu: 4.025 ± 0.342
1.632ProPhe: 1.632 ± 0.242
2.502ProGly: 2.502 ± 0.311
0.87ProHis: 0.87 ± 0.192
1.668ProIle: 1.668 ± 0.272
1.885ProLys: 1.885 ± 0.242
2.502ProLeu: 2.502 ± 0.292
0.943ProMet: 0.943 ± 0.195
1.305ProAsn: 1.305 ± 0.262
1.051ProPro: 1.051 ± 0.178
0.906ProGln: 0.906 ± 0.181
1.378ProArg: 1.378 ± 0.214
2.103ProSer: 2.103 ± 0.271
2.393ProThr: 2.393 ± 0.314
3.444ProVal: 3.444 ± 0.345
0.435ProTrp: 0.435 ± 0.12
1.269ProTyr: 1.269 ± 0.205
0.0ProXaa: 0.0 ± 0.0
Gln
3.299GlnAla: 3.299 ± 0.355
0.58GlnCys: 0.58 ± 0.159
1.74GlnAsp: 1.74 ± 0.226
2.502GlnGlu: 2.502 ± 0.301
1.813GlnPhe: 1.813 ± 0.247
2.067GlnGly: 2.067 ± 0.351
0.653GlnHis: 0.653 ± 0.159
2.248GlnIle: 2.248 ± 0.309
1.559GlnLys: 1.559 ± 0.242
3.227GlnLeu: 3.227 ± 0.383
1.487GlnMet: 1.487 ± 0.285
1.088GlnAsn: 1.088 ± 0.187
0.906GlnPro: 0.906 ± 0.232
0.943GlnGln: 0.943 ± 0.179
1.994GlnArg: 1.994 ± 0.228
2.03GlnSer: 2.03 ± 0.279
1.813GlnThr: 1.813 ± 0.227
2.756GlnVal: 2.756 ± 0.337
0.689GlnTrp: 0.689 ± 0.152
1.595GlnTyr: 1.595 ± 0.218
0.0GlnXaa: 0.0 ± 0.0
Arg
4.605ArgAla: 4.605 ± 0.413
0.725ArgCys: 0.725 ± 0.179
3.227ArgAsp: 3.227 ± 0.328
4.097ArgGlu: 4.097 ± 0.378
2.465ArgPhe: 2.465 ± 0.374
3.662ArgGly: 3.662 ± 0.383
1.051ArgHis: 1.051 ± 0.201
2.937ArgIle: 2.937 ± 0.281
3.988ArgLys: 3.988 ± 0.371
5.076ArgLeu: 5.076 ± 0.449
1.487ArgMet: 1.487 ± 0.231
2.139ArgAsn: 2.139 ± 0.25
2.103ArgPro: 2.103 ± 0.278
2.175ArgGln: 2.175 ± 0.275
3.299ArgArg: 3.299 ± 0.361
3.082ArgSer: 3.082 ± 0.31
2.538ArgThr: 2.538 ± 0.267
4.025ArgVal: 4.025 ± 0.292
1.196ArgTrp: 1.196 ± 0.236
1.849ArgTyr: 1.849 ± 0.271
0.0ArgXaa: 0.0 ± 0.0
Ser
3.988SerAla: 3.988 ± 0.511
0.653SerCys: 0.653 ± 0.186
3.481SerAsp: 3.481 ± 0.387
3.843SerGlu: 3.843 ± 0.399
2.538SerPhe: 2.538 ± 0.263
3.952SerGly: 3.952 ± 0.403
0.725SerHis: 0.725 ± 0.159
2.937SerIle: 2.937 ± 0.286
3.263SerLys: 3.263 ± 0.33
4.786SerLeu: 4.786 ± 0.415
1.74SerMet: 1.74 ± 0.24
2.61SerAsn: 2.61 ± 0.373
2.248SerPro: 2.248 ± 0.316
1.885SerGln: 1.885 ± 0.323
3.626SerArg: 3.626 ± 0.376
2.901SerSer: 2.901 ± 0.382
3.009SerThr: 3.009 ± 0.324
4.895SerVal: 4.895 ± 0.396
1.015SerTrp: 1.015 ± 0.163
2.212SerTyr: 2.212 ± 0.235
0.0SerXaa: 0.0 ± 0.0
Thr
4.387ThrAla: 4.387 ± 0.5
0.761ThrCys: 0.761 ± 0.158
2.647ThrAsp: 2.647 ± 0.334
4.133ThrGlu: 4.133 ± 0.362
2.574ThrPhe: 2.574 ± 0.26
4.677ThrGly: 4.677 ± 0.466
1.088ThrHis: 1.088 ± 0.202
3.553ThrIle: 3.553 ± 0.269
2.61ThrLys: 2.61 ± 0.312
5.076ThrLeu: 5.076 ± 0.439
1.051ThrMet: 1.051 ± 0.224
2.175ThrAsn: 2.175 ± 0.273
2.139ThrPro: 2.139 ± 0.262
1.885ThrGln: 1.885 ± 0.286
2.792ThrArg: 2.792 ± 0.3
2.756ThrSer: 2.756 ± 0.327
2.719ThrThr: 2.719 ± 0.315
4.641ThrVal: 4.641 ± 0.507
0.761ThrTrp: 0.761 ± 0.194
2.175ThrTyr: 2.175 ± 0.264
0.0ThrXaa: 0.0 ± 0.0
Val
5.982ValAla: 5.982 ± 0.473
0.798ValCys: 0.798 ± 0.224
4.75ValAsp: 4.75 ± 0.36
5.656ValGlu: 5.656 ± 0.432
3.227ValPhe: 3.227 ± 0.288
4.242ValGly: 4.242 ± 0.407
1.196ValHis: 1.196 ± 0.207
3.916ValIle: 3.916 ± 0.39
4.641ValLys: 4.641 ± 0.414
5.475ValLeu: 5.475 ± 0.458
2.32ValMet: 2.32 ± 0.341
3.009ValAsn: 3.009 ± 0.321
2.937ValPro: 2.937 ± 0.415
2.284ValGln: 2.284 ± 0.285
3.879ValArg: 3.879 ± 0.311
3.843ValSer: 3.843 ± 0.352
4.17ValThr: 4.17 ± 0.367
5.801ValVal: 5.801 ± 0.622
1.414ValTrp: 1.414 ± 0.285
2.647ValTyr: 2.647 ± 0.313
0.0ValXaa: 0.0 ± 0.0
Trp
1.088TrpAla: 1.088 ± 0.192
0.471TrpCys: 0.471 ± 0.135
1.269TrpAsp: 1.269 ± 0.191
1.595TrpGlu: 1.595 ± 0.243
0.689TrpPhe: 0.689 ± 0.15
0.943TrpGly: 0.943 ± 0.211
0.544TrpHis: 0.544 ± 0.122
0.906TrpIle: 0.906 ± 0.169
1.196TrpLys: 1.196 ± 0.208
1.342TrpLeu: 1.342 ± 0.211
0.689TrpMet: 0.689 ± 0.159
0.725TrpAsn: 0.725 ± 0.139
0.58TrpPro: 0.58 ± 0.157
0.544TrpGln: 0.544 ± 0.154
1.16TrpArg: 1.16 ± 0.198
1.051TrpSer: 1.051 ± 0.22
1.051TrpThr: 1.051 ± 0.211
1.487TrpVal: 1.487 ± 0.27
0.29TrpTrp: 0.29 ± 0.096
0.653TrpTyr: 0.653 ± 0.148
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.046TyrAla: 3.046 ± 0.35
0.689TyrCys: 0.689 ± 0.189
2.393TyrAsp: 2.393 ± 0.265
2.502TyrGlu: 2.502 ± 0.411
1.813TyrPhe: 1.813 ± 0.212
2.792TyrGly: 2.792 ± 0.338
0.326TyrHis: 0.326 ± 0.115
2.284TyrIle: 2.284 ± 0.315
2.647TyrLys: 2.647 ± 0.368
2.248TyrLeu: 2.248 ± 0.275
1.088TyrMet: 1.088 ± 0.195
1.994TyrAsn: 1.994 ± 0.238
1.45TyrPro: 1.45 ± 0.264
1.559TyrGln: 1.559 ± 0.267
1.994TyrArg: 1.994 ± 0.242
2.212TyrSer: 2.212 ± 0.324
2.538TyrThr: 2.538 ± 0.336
2.502TyrVal: 2.502 ± 0.27
0.399TyrTrp: 0.399 ± 0.115
1.378TyrTyr: 1.378 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 173 proteins (27582 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski