Amino acid dipepetide frequency for Ralstonia phage RSY1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.364AlaAla: 20.364 ± 2.039
0.805AlaCys: 0.805 ± 0.254
9.015AlaAsp: 9.015 ± 1.205
6.359AlaGlu: 6.359 ± 0.623
3.059AlaPhe: 3.059 ± 0.483
11.59AlaGly: 11.59 ± 1.108
2.415AlaHis: 2.415 ± 0.441
4.588AlaIle: 4.588 ± 0.57
4.346AlaLys: 4.346 ± 0.789
11.188AlaLeu: 11.188 ± 0.953
3.139AlaMet: 3.139 ± 0.563
3.863AlaAsn: 3.863 ± 0.711
4.024AlaPro: 4.024 ± 0.632
5.473AlaGln: 5.473 ± 0.675
9.578AlaArg: 9.578 ± 0.684
7.486AlaSer: 7.486 ± 0.74
6.6AlaThr: 6.6 ± 0.738
8.451AlaVal: 8.451 ± 0.992
2.093AlaTrp: 2.093 ± 0.436
3.461AlaTyr: 3.461 ± 0.487
0.0AlaXaa: 0.0 ± 0.0
Cys
1.127CysAla: 1.127 ± 0.285
0.08CysCys: 0.08 ± 0.09
0.402CysAsp: 0.402 ± 0.168
0.724CysGlu: 0.724 ± 0.293
0.161CysPhe: 0.161 ± 0.105
0.805CysGly: 0.805 ± 0.248
0.0CysHis: 0.0 ± 0.0
0.241CysIle: 0.241 ± 0.149
0.0CysLys: 0.0 ± 0.0
0.644CysLeu: 0.644 ± 0.295
0.402CysMet: 0.402 ± 0.166
0.322CysAsn: 0.322 ± 0.198
0.966CysPro: 0.966 ± 0.333
0.402CysGln: 0.402 ± 0.185
0.483CysArg: 0.483 ± 0.218
0.644CysSer: 0.644 ± 0.188
0.563CysThr: 0.563 ± 0.226
0.885CysVal: 0.885 ± 0.352
0.161CysTrp: 0.161 ± 0.103
0.322CysTyr: 0.322 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
9.176AspAla: 9.176 ± 0.964
0.644AspCys: 0.644 ± 0.215
2.978AspAsp: 2.978 ± 0.467
2.898AspGlu: 2.898 ± 0.439
1.449AspPhe: 1.449 ± 0.378
5.393AspGly: 5.393 ± 0.613
0.966AspHis: 0.966 ± 0.345
2.737AspIle: 2.737 ± 0.47
1.529AspLys: 1.529 ± 0.315
4.185AspLeu: 4.185 ± 0.479
1.207AspMet: 1.207 ± 0.294
1.771AspAsn: 1.771 ± 0.485
3.944AspPro: 3.944 ± 0.572
2.012AspGln: 2.012 ± 0.356
4.507AspArg: 4.507 ± 0.582
1.771AspSer: 1.771 ± 0.435
4.266AspThr: 4.266 ± 0.555
3.622AspVal: 3.622 ± 0.457
1.046AspTrp: 1.046 ± 0.334
1.851AspTyr: 1.851 ± 0.473
0.0AspXaa: 0.0 ± 0.0
Glu
6.278GluAla: 6.278 ± 0.627
0.805GluCys: 0.805 ± 0.284
2.334GluAsp: 2.334 ± 0.454
2.334GluGlu: 2.334 ± 0.406
1.932GluPhe: 1.932 ± 0.398
4.024GluGly: 4.024 ± 0.47
1.288GluHis: 1.288 ± 0.33
1.61GluIle: 1.61 ± 0.461
2.093GluLys: 2.093 ± 0.326
6.922GluLeu: 6.922 ± 0.768
1.288GluMet: 1.288 ± 0.373
1.851GluAsn: 1.851 ± 0.368
2.898GluPro: 2.898 ± 0.41
2.334GluGln: 2.334 ± 0.495
4.91GluArg: 4.91 ± 0.736
2.334GluSer: 2.334 ± 0.417
2.334GluThr: 2.334 ± 0.402
4.346GluVal: 4.346 ± 0.632
0.805GluTrp: 0.805 ± 0.225
1.449GluTyr: 1.449 ± 0.255
0.0GluXaa: 0.0 ± 0.0
Phe
4.024PheAla: 4.024 ± 0.593
0.402PheCys: 0.402 ± 0.16
2.334PheAsp: 2.334 ± 0.519
1.288PheGlu: 1.288 ± 0.276
0.966PhePhe: 0.966 ± 0.244
2.576PheGly: 2.576 ± 0.544
0.483PheHis: 0.483 ± 0.146
1.288PheIle: 1.288 ± 0.333
1.449PheLys: 1.449 ± 0.366
2.012PheLeu: 2.012 ± 0.435
0.644PheMet: 0.644 ± 0.241
0.885PheAsn: 0.885 ± 0.26
1.932PhePro: 1.932 ± 0.383
1.207PheGln: 1.207 ± 0.336
3.059PheArg: 3.059 ± 0.534
2.093PheSer: 2.093 ± 0.422
2.093PheThr: 2.093 ± 0.357
2.012PheVal: 2.012 ± 0.471
0.724PheTrp: 0.724 ± 0.261
0.724PheTyr: 0.724 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
7.325GlyAla: 7.325 ± 0.947
0.724GlyCys: 0.724 ± 0.238
4.427GlyAsp: 4.427 ± 0.569
4.829GlyGlu: 4.829 ± 0.685
2.334GlyPhe: 2.334 ± 0.399
6.6GlyGly: 6.6 ± 0.861
2.173GlyHis: 2.173 ± 0.772
3.783GlyIle: 3.783 ± 0.598
3.381GlyLys: 3.381 ± 0.501
6.278GlyLeu: 6.278 ± 0.943
1.771GlyMet: 1.771 ± 0.364
3.059GlyAsn: 3.059 ± 0.518
2.737GlyPro: 2.737 ± 0.386
3.059GlyGln: 3.059 ± 0.42
6.842GlyArg: 6.842 ± 0.914
3.783GlySer: 3.783 ± 0.545
4.99GlyThr: 4.99 ± 0.79
5.795GlyVal: 5.795 ± 0.722
2.012GlyTrp: 2.012 ± 0.39
2.415GlyTyr: 2.415 ± 0.355
0.0GlyXaa: 0.0 ± 0.0
His
2.817HisAla: 2.817 ± 0.561
0.322HisCys: 0.322 ± 0.168
1.046HisAsp: 1.046 ± 0.246
1.61HisGlu: 1.61 ± 0.298
0.644HisPhe: 0.644 ± 0.248
2.012HisGly: 2.012 ± 0.432
0.805HisHis: 0.805 ± 0.213
0.885HisIle: 0.885 ± 0.199
0.966HisLys: 0.966 ± 0.321
1.932HisLeu: 1.932 ± 0.34
0.563HisMet: 0.563 ± 0.213
0.563HisAsn: 0.563 ± 0.202
1.127HisPro: 1.127 ± 0.232
1.288HisGln: 1.288 ± 0.366
1.61HisArg: 1.61 ± 0.397
1.127HisSer: 1.127 ± 0.274
1.046HisThr: 1.046 ± 0.342
1.529HisVal: 1.529 ± 0.339
0.241HisTrp: 0.241 ± 0.12
0.402HisTyr: 0.402 ± 0.143
0.0HisXaa: 0.0 ± 0.0
Ile
5.151IleAla: 5.151 ± 0.541
0.161IleCys: 0.161 ± 0.119
3.944IleAsp: 3.944 ± 0.706
3.059IleGlu: 3.059 ± 0.591
0.885IlePhe: 0.885 ± 0.217
3.783IleGly: 3.783 ± 0.549
0.724IleHis: 0.724 ± 0.224
1.046IleIle: 1.046 ± 0.319
2.173IleLys: 2.173 ± 0.405
2.254IleLeu: 2.254 ± 0.462
0.563IleMet: 0.563 ± 0.182
1.851IleAsn: 1.851 ± 0.29
1.207IlePro: 1.207 ± 0.305
0.885IleGln: 0.885 ± 0.264
2.576IleArg: 2.576 ± 0.326
1.61IleSer: 1.61 ± 0.343
2.737IleThr: 2.737 ± 0.516
3.139IleVal: 3.139 ± 0.379
0.241IleTrp: 0.241 ± 0.179
1.046IleTyr: 1.046 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
5.232LysAla: 5.232 ± 0.7
0.402LysCys: 0.402 ± 0.206
2.012LysAsp: 2.012 ± 0.407
2.334LysGlu: 2.334 ± 0.486
0.885LysPhe: 0.885 ± 0.286
3.22LysGly: 3.22 ± 0.452
0.724LysHis: 0.724 ± 0.251
1.127LysIle: 1.127 ± 0.343
2.173LysLys: 2.173 ± 0.579
3.3LysLeu: 3.3 ± 0.585
0.644LysMet: 0.644 ± 0.189
1.207LysAsn: 1.207 ± 0.385
2.334LysPro: 2.334 ± 0.488
2.415LysGln: 2.415 ± 0.553
3.22LysArg: 3.22 ± 0.455
1.207LysSer: 1.207 ± 0.321
2.334LysThr: 2.334 ± 0.357
3.059LysVal: 3.059 ± 0.579
0.241LysTrp: 0.241 ± 0.148
0.483LysTyr: 0.483 ± 0.206
0.0LysXaa: 0.0 ± 0.0
Leu
12.637LeuAla: 12.637 ± 0.996
0.966LeuCys: 0.966 ± 0.343
4.91LeuAsp: 4.91 ± 0.765
5.473LeuGlu: 5.473 ± 0.704
3.944LeuPhe: 3.944 ± 0.592
5.151LeuGly: 5.151 ± 0.679
1.932LeuHis: 1.932 ± 0.456
2.978LeuIle: 2.978 ± 0.414
4.266LeuLys: 4.266 ± 0.59
6.761LeuLeu: 6.761 ± 0.797
1.851LeuMet: 1.851 ± 0.313
2.656LeuAsn: 2.656 ± 0.331
4.668LeuPro: 4.668 ± 0.599
2.898LeuGln: 2.898 ± 0.471
6.761LeuArg: 6.761 ± 0.668
4.829LeuSer: 4.829 ± 0.756
5.554LeuThr: 5.554 ± 0.817
6.681LeuVal: 6.681 ± 0.797
1.449LeuTrp: 1.449 ± 0.313
2.334LeuTyr: 2.334 ± 0.395
0.0LeuXaa: 0.0 ± 0.0
Met
2.978MetAla: 2.978 ± 0.453
0.161MetCys: 0.161 ± 0.104
0.563MetAsp: 0.563 ± 0.235
1.449MetGlu: 1.449 ± 0.38
0.402MetPhe: 0.402 ± 0.192
1.368MetGly: 1.368 ± 0.338
0.563MetHis: 0.563 ± 0.21
0.483MetIle: 0.483 ± 0.198
1.288MetLys: 1.288 ± 0.408
2.334MetLeu: 2.334 ± 0.375
0.644MetMet: 0.644 ± 0.226
0.885MetAsn: 0.885 ± 0.265
1.127MetPro: 1.127 ± 0.235
0.644MetGln: 0.644 ± 0.239
1.61MetArg: 1.61 ± 0.352
1.046MetSer: 1.046 ± 0.234
1.449MetThr: 1.449 ± 0.401
1.69MetVal: 1.69 ± 0.271
0.241MetTrp: 0.241 ± 0.169
0.644MetTyr: 0.644 ± 0.196
0.0MetXaa: 0.0 ± 0.0
Asn
4.185AsnAla: 4.185 ± 0.621
0.241AsnCys: 0.241 ± 0.123
2.254AsnAsp: 2.254 ± 0.439
1.368AsnGlu: 1.368 ± 0.303
0.805AsnPhe: 0.805 ± 0.229
2.737AsnGly: 2.737 ± 0.364
0.644AsnHis: 0.644 ± 0.2
1.449AsnIle: 1.449 ± 0.328
1.127AsnLys: 1.127 ± 0.274
3.22AsnLeu: 3.22 ± 0.415
0.563AsnMet: 0.563 ± 0.215
1.046AsnAsn: 1.046 ± 0.322
2.978AsnPro: 2.978 ± 0.459
1.207AsnGln: 1.207 ± 0.248
1.851AsnArg: 1.851 ± 0.345
2.254AsnSer: 2.254 ± 0.548
1.851AsnThr: 1.851 ± 0.373
1.771AsnVal: 1.771 ± 0.409
0.322AsnTrp: 0.322 ± 0.152
1.046AsnTyr: 1.046 ± 0.259
0.0AsnXaa: 0.0 ± 0.0
Pro
5.312ProAla: 5.312 ± 0.913
0.563ProCys: 0.563 ± 0.215
3.542ProAsp: 3.542 ± 0.65
3.059ProGlu: 3.059 ± 0.377
1.288ProPhe: 1.288 ± 0.361
4.024ProGly: 4.024 ± 0.624
1.288ProHis: 1.288 ± 0.338
2.415ProIle: 2.415 ± 0.369
1.932ProLys: 1.932 ± 0.413
5.071ProLeu: 5.071 ± 0.677
0.805ProMet: 0.805 ± 0.296
1.529ProAsn: 1.529 ± 0.353
2.656ProPro: 2.656 ± 0.416
1.529ProGln: 1.529 ± 0.257
2.576ProArg: 2.576 ± 0.49
3.139ProSer: 3.139 ± 0.493
2.093ProThr: 2.093 ± 0.357
3.863ProVal: 3.863 ± 0.514
1.207ProTrp: 1.207 ± 0.254
1.207ProTyr: 1.207 ± 0.306
0.0ProXaa: 0.0 ± 0.0
Gln
5.634GlnAla: 5.634 ± 0.551
0.644GlnCys: 0.644 ± 0.261
1.449GlnAsp: 1.449 ± 0.375
1.851GlnGlu: 1.851 ± 0.542
1.771GlnPhe: 1.771 ± 0.353
2.334GlnGly: 2.334 ± 0.559
1.288GlnHis: 1.288 ± 0.419
2.173GlnIle: 2.173 ± 0.341
1.127GlnLys: 1.127 ± 0.351
4.829GlnLeu: 4.829 ± 0.659
0.805GlnMet: 0.805 ± 0.24
1.288GlnAsn: 1.288 ± 0.302
1.69GlnPro: 1.69 ± 0.39
2.817GlnGln: 2.817 ± 0.598
4.105GlnArg: 4.105 ± 0.555
1.529GlnSer: 1.529 ± 0.305
2.173GlnThr: 2.173 ± 0.374
2.817GlnVal: 2.817 ± 0.427
0.724GlnTrp: 0.724 ± 0.234
1.046GlnTyr: 1.046 ± 0.282
0.0GlnXaa: 0.0 ± 0.0
Arg
8.612ArgAla: 8.612 ± 0.892
0.402ArgCys: 0.402 ± 0.164
4.105ArgAsp: 4.105 ± 0.575
4.185ArgGlu: 4.185 ± 0.533
2.415ArgPhe: 2.415 ± 0.367
6.278ArgGly: 6.278 ± 0.728
1.449ArgHis: 1.449 ± 0.313
3.863ArgIle: 3.863 ± 0.688
2.495ArgLys: 2.495 ± 0.379
6.681ArgLeu: 6.681 ± 0.673
1.932ArgMet: 1.932 ± 0.381
2.576ArgAsn: 2.576 ± 0.465
4.185ArgPro: 4.185 ± 0.626
4.105ArgGln: 4.105 ± 0.483
7.003ArgArg: 7.003 ± 1.05
4.105ArgSer: 4.105 ± 0.513
4.346ArgThr: 4.346 ± 0.648
6.359ArgVal: 6.359 ± 0.677
1.046ArgTrp: 1.046 ± 0.3
2.495ArgTyr: 2.495 ± 0.4
0.0ArgXaa: 0.0 ± 0.0
Ser
6.037SerAla: 6.037 ± 0.702
0.322SerCys: 0.322 ± 0.185
2.898SerAsp: 2.898 ± 0.495
2.817SerGlu: 2.817 ± 0.431
2.334SerPhe: 2.334 ± 0.38
4.266SerGly: 4.266 ± 0.676
1.288SerHis: 1.288 ± 0.404
1.851SerIle: 1.851 ± 0.36
1.288SerLys: 1.288 ± 0.369
4.185SerLeu: 4.185 ± 0.513
1.288SerMet: 1.288 ± 0.352
2.334SerAsn: 2.334 ± 0.546
2.495SerPro: 2.495 ± 0.508
1.449SerGln: 1.449 ± 0.296
3.783SerArg: 3.783 ± 0.472
3.22SerSer: 3.22 ± 0.347
2.737SerThr: 2.737 ± 0.53
4.185SerVal: 4.185 ± 0.558
0.966SerTrp: 0.966 ± 0.237
1.046SerTyr: 1.046 ± 0.287
0.0SerXaa: 0.0 ± 0.0
Thr
8.29ThrAla: 8.29 ± 0.791
0.724ThrCys: 0.724 ± 0.22
3.139ThrAsp: 3.139 ± 0.529
2.576ThrGlu: 2.576 ± 0.489
1.932ThrPhe: 1.932 ± 0.448
5.393ThrGly: 5.393 ± 0.645
1.288ThrHis: 1.288 ± 0.311
2.817ThrIle: 2.817 ± 0.521
1.851ThrLys: 1.851 ± 0.41
5.393ThrLeu: 5.393 ± 0.678
1.046ThrMet: 1.046 ± 0.256
1.368ThrAsn: 1.368 ± 0.383
2.254ThrPro: 2.254 ± 0.406
2.173ThrGln: 2.173 ± 0.412
3.3ThrArg: 3.3 ± 0.562
3.059ThrSer: 3.059 ± 0.451
3.3ThrThr: 3.3 ± 0.603
5.232ThrVal: 5.232 ± 0.895
0.724ThrTrp: 0.724 ± 0.251
1.69ThrTyr: 1.69 ± 0.313
0.0ThrXaa: 0.0 ± 0.0
Val
8.371ValAla: 8.371 ± 0.858
0.563ValCys: 0.563 ± 0.229
4.266ValAsp: 4.266 ± 0.594
3.783ValGlu: 3.783 ± 0.528
2.898ValPhe: 2.898 ± 0.563
4.266ValGly: 4.266 ± 0.736
2.093ValHis: 2.093 ± 0.373
2.254ValIle: 2.254 ± 0.323
3.703ValLys: 3.703 ± 0.574
7.244ValLeu: 7.244 ± 0.848
1.368ValMet: 1.368 ± 0.318
2.576ValAsn: 2.576 ± 0.533
4.024ValPro: 4.024 ± 0.621
3.139ValGln: 3.139 ± 0.628
6.681ValArg: 6.681 ± 0.937
3.542ValSer: 3.542 ± 0.392
5.151ValThr: 5.151 ± 0.755
6.439ValVal: 6.439 ± 0.891
1.207ValTrp: 1.207 ± 0.327
1.529ValTyr: 1.529 ± 0.56
0.0ValXaa: 0.0 ± 0.0
Trp
1.851TrpAla: 1.851 ± 0.316
0.161TrpCys: 0.161 ± 0.139
1.046TrpAsp: 1.046 ± 0.23
0.644TrpGlu: 0.644 ± 0.233
0.805TrpPhe: 0.805 ± 0.264
0.563TrpGly: 0.563 ± 0.257
0.322TrpHis: 0.322 ± 0.195
0.483TrpIle: 0.483 ± 0.245
0.644TrpLys: 0.644 ± 0.211
1.771TrpLeu: 1.771 ± 0.304
0.483TrpMet: 0.483 ± 0.146
0.805TrpAsn: 0.805 ± 0.244
0.322TrpPro: 0.322 ± 0.169
1.288TrpGln: 1.288 ± 0.326
1.529TrpArg: 1.529 ± 0.288
1.368TrpSer: 1.368 ± 0.339
0.402TrpThr: 0.402 ± 0.202
1.046TrpVal: 1.046 ± 0.305
0.402TrpTrp: 0.402 ± 0.171
0.563TrpTyr: 0.563 ± 0.23
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.576TyrAla: 2.576 ± 0.536
0.241TyrCys: 0.241 ± 0.126
1.288TyrAsp: 1.288 ± 0.295
1.529TyrGlu: 1.529 ± 0.347
1.288TyrPhe: 1.288 ± 0.43
1.932TyrGly: 1.932 ± 0.414
0.805TyrHis: 0.805 ± 0.253
0.966TyrIle: 0.966 ± 0.261
0.724TyrLys: 0.724 ± 0.179
2.495TyrLeu: 2.495 ± 0.428
0.483TyrMet: 0.483 ± 0.196
0.483TyrAsn: 0.483 ± 0.164
1.529TyrPro: 1.529 ± 0.363
1.771TyrGln: 1.771 ± 0.382
2.656TyrArg: 2.656 ± 0.35
0.644TyrSer: 0.644 ± 0.239
1.529TyrThr: 1.529 ± 0.407
2.173TyrVal: 2.173 ± 0.498
0.644TyrTrp: 0.644 ± 0.266
1.288TyrTyr: 1.288 ± 0.321
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 49 proteins (12425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski