Amino acid dipepetide frequency for Ralstonia phage DU_RP_I

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.043AlaAla: 12.043 ± 1.258
0.624AlaCys: 0.624 ± 0.281
5.62AlaAsp: 5.62 ± 0.762
5.531AlaGlu: 5.531 ± 0.903
2.855AlaPhe: 2.855 ± 0.422
8.118AlaGly: 8.118 ± 0.919
1.16AlaHis: 1.16 ± 0.364
4.46AlaIle: 4.46 ± 0.655
6.155AlaLys: 6.155 ± 0.855
9.01AlaLeu: 9.01 ± 0.985
2.052AlaMet: 2.052 ± 0.567
5.442AlaAsn: 5.442 ± 0.619
3.033AlaPro: 3.033 ± 0.58
4.103AlaGln: 4.103 ± 0.692
4.817AlaArg: 4.817 ± 0.647
6.334AlaSer: 6.334 ± 0.852
4.996AlaThr: 4.996 ± 0.895
6.601AlaVal: 6.601 ± 0.951
1.338AlaTrp: 1.338 ± 0.328
2.944AlaTyr: 2.944 ± 0.511
0.0AlaXaa: 0.0 ± 0.0
Cys
0.714CysAla: 0.714 ± 0.304
0.089CysCys: 0.089 ± 0.078
0.357CysAsp: 0.357 ± 0.156
0.624CysGlu: 0.624 ± 0.267
0.446CysPhe: 0.446 ± 0.229
0.268CysGly: 0.268 ± 0.155
0.089CysHis: 0.089 ± 0.078
0.535CysIle: 0.535 ± 0.23
0.714CysLys: 0.714 ± 0.326
0.535CysLeu: 0.535 ± 0.278
0.0CysMet: 0.0 ± 0.0
0.268CysAsn: 0.268 ± 0.195
0.624CysPro: 0.624 ± 0.265
0.357CysGln: 0.357 ± 0.212
0.535CysArg: 0.535 ± 0.215
0.535CysSer: 0.535 ± 0.204
0.268CysThr: 0.268 ± 0.168
0.446CysVal: 0.446 ± 0.196
0.268CysTrp: 0.268 ± 0.157
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.174AspAla: 5.174 ± 0.788
0.178AspCys: 0.178 ± 0.127
3.925AspAsp: 3.925 ± 0.618
3.39AspGlu: 3.39 ± 0.544
2.765AspPhe: 2.765 ± 0.505
5.442AspGly: 5.442 ± 0.834
0.892AspHis: 0.892 ± 0.242
3.836AspIle: 3.836 ± 0.667
3.925AspLys: 3.925 ± 0.793
5.174AspLeu: 5.174 ± 0.665
1.606AspMet: 1.606 ± 0.372
3.39AspAsn: 3.39 ± 0.409
2.409AspPro: 2.409 ± 0.469
1.517AspGln: 1.517 ± 0.462
3.657AspArg: 3.657 ± 0.365
2.498AspSer: 2.498 ± 0.622
3.39AspThr: 3.39 ± 0.631
4.103AspVal: 4.103 ± 0.682
0.803AspTrp: 0.803 ± 0.24
2.409AspTyr: 2.409 ± 0.453
0.0AspXaa: 0.0 ± 0.0
Glu
6.155GluAla: 6.155 ± 0.839
0.357GluCys: 0.357 ± 0.163
2.855GluAsp: 2.855 ± 0.711
4.906GluGlu: 4.906 ± 1.065
3.301GluPhe: 3.301 ± 0.555
4.46GluGly: 4.46 ± 0.985
1.338GluHis: 1.338 ± 0.307
1.963GluIle: 1.963 ± 0.421
4.906GluLys: 4.906 ± 0.815
5.798GluLeu: 5.798 ± 0.744
1.517GluMet: 1.517 ± 0.39
3.301GluAsn: 3.301 ± 0.464
2.409GluPro: 2.409 ± 0.462
1.873GluGln: 1.873 ± 0.481
4.014GluArg: 4.014 ± 0.692
3.301GluSer: 3.301 ± 0.622
3.033GluThr: 3.033 ± 0.668
3.39GluVal: 3.39 ± 0.551
1.695GluTrp: 1.695 ± 0.43
1.784GluTyr: 1.784 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
2.587PheAla: 2.587 ± 0.458
0.178PheCys: 0.178 ± 0.112
3.033PheAsp: 3.033 ± 0.636
2.23PheGlu: 2.23 ± 0.451
0.803PhePhe: 0.803 ± 0.301
3.211PheGly: 3.211 ± 0.631
0.357PheHis: 0.357 ± 0.164
1.784PheIle: 1.784 ± 0.358
1.963PheLys: 1.963 ± 0.359
2.319PheLeu: 2.319 ± 0.448
1.07PheMet: 1.07 ± 0.315
1.784PheAsn: 1.784 ± 0.423
2.052PhePro: 2.052 ± 0.386
1.427PheGln: 1.427 ± 0.317
3.479PheArg: 3.479 ± 0.463
3.122PheSer: 3.122 ± 0.541
2.765PheThr: 2.765 ± 0.465
2.587PheVal: 2.587 ± 0.302
0.268PheTrp: 0.268 ± 0.118
0.981PheTyr: 0.981 ± 0.353
0.0PheXaa: 0.0 ± 0.0
Gly
6.334GlyAla: 6.334 ± 0.568
0.714GlyCys: 0.714 ± 0.271
5.174GlyAsp: 5.174 ± 0.689
4.371GlyGlu: 4.371 ± 0.942
4.014GlyPhe: 4.014 ± 0.579
6.066GlyGly: 6.066 ± 0.967
1.517GlyHis: 1.517 ± 0.48
4.639GlyIle: 4.639 ± 0.757
4.906GlyLys: 4.906 ± 0.747
6.69GlyLeu: 6.69 ± 0.701
1.695GlyMet: 1.695 ± 0.309
4.193GlyAsn: 4.193 ± 0.627
0.981GlyPro: 0.981 ± 0.343
3.033GlyGln: 3.033 ± 0.413
4.639GlyArg: 4.639 ± 0.641
6.066GlySer: 6.066 ± 0.902
5.977GlyThr: 5.977 ± 1.005
5.798GlyVal: 5.798 ± 0.595
1.517GlyTrp: 1.517 ± 0.441
3.479GlyTyr: 3.479 ± 0.454
0.0GlyXaa: 0.0 ± 0.0
His
1.07HisAla: 1.07 ± 0.339
0.357HisCys: 0.357 ± 0.192
0.981HisAsp: 0.981 ± 0.269
0.892HisGlu: 0.892 ± 0.252
0.803HisPhe: 0.803 ± 0.239
1.517HisGly: 1.517 ± 0.349
0.089HisHis: 0.089 ± 0.087
1.07HisIle: 1.07 ± 0.347
0.535HisLys: 0.535 ± 0.191
2.498HisLeu: 2.498 ± 0.537
0.624HisMet: 0.624 ± 0.195
0.714HisAsn: 0.714 ± 0.246
0.714HisPro: 0.714 ± 0.233
0.357HisGln: 0.357 ± 0.156
1.427HisArg: 1.427 ± 0.404
0.981HisSer: 0.981 ± 0.282
0.624HisThr: 0.624 ± 0.276
1.16HisVal: 1.16 ± 0.339
0.357HisTrp: 0.357 ± 0.118
0.357HisTyr: 0.357 ± 0.173
0.0HisXaa: 0.0 ± 0.0
Ile
4.46IleAla: 4.46 ± 0.569
0.535IleCys: 0.535 ± 0.264
3.657IleAsp: 3.657 ± 0.622
3.211IleGlu: 3.211 ± 0.618
0.357IlePhe: 0.357 ± 0.159
4.103IleGly: 4.103 ± 0.526
0.892IleHis: 0.892 ± 0.26
2.587IleIle: 2.587 ± 0.626
3.39IleLys: 3.39 ± 0.513
3.925IleLeu: 3.925 ± 0.662
0.357IleMet: 0.357 ± 0.166
2.319IleAsn: 2.319 ± 0.412
3.122IlePro: 3.122 ± 0.616
2.052IleGln: 2.052 ± 0.392
1.963IleArg: 1.963 ± 0.422
2.855IleSer: 2.855 ± 0.462
3.479IleThr: 3.479 ± 0.77
3.033IleVal: 3.033 ± 0.495
0.714IleTrp: 0.714 ± 0.205
0.714IleTyr: 0.714 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
7.136LysAla: 7.136 ± 0.838
0.268LysCys: 0.268 ± 0.156
4.817LysAsp: 4.817 ± 0.706
2.944LysGlu: 2.944 ± 0.458
1.963LysPhe: 1.963 ± 0.418
5.888LysGly: 5.888 ± 0.936
1.338LysHis: 1.338 ± 0.431
2.23LysIle: 2.23 ± 0.458
3.479LysLys: 3.479 ± 0.561
5.352LysLeu: 5.352 ± 0.851
0.981LysMet: 0.981 ± 0.381
1.963LysAsn: 1.963 ± 0.434
2.587LysPro: 2.587 ± 0.588
2.676LysGln: 2.676 ± 0.5
3.925LysArg: 3.925 ± 0.575
3.211LysSer: 3.211 ± 0.448
3.479LysThr: 3.479 ± 0.471
4.817LysVal: 4.817 ± 0.544
1.606LysTrp: 1.606 ± 0.418
1.873LysTyr: 1.873 ± 0.37
0.0LysXaa: 0.0 ± 0.0
Leu
8.385LeuAla: 8.385 ± 0.925
0.178LeuCys: 0.178 ± 0.111
4.728LeuAsp: 4.728 ± 0.445
6.066LeuGlu: 6.066 ± 0.912
2.765LeuPhe: 2.765 ± 0.456
6.869LeuGly: 6.869 ± 0.699
1.517LeuHis: 1.517 ± 0.41
3.122LeuIle: 3.122 ± 0.582
6.244LeuLys: 6.244 ± 0.823
5.888LeuLeu: 5.888 ± 0.778
2.676LeuMet: 2.676 ± 0.429
3.39LeuAsn: 3.39 ± 0.539
4.55LeuPro: 4.55 ± 1.09
3.033LeuGln: 3.033 ± 0.421
5.263LeuArg: 5.263 ± 1.073
4.996LeuSer: 4.996 ± 0.526
4.46LeuThr: 4.46 ± 0.515
5.442LeuVal: 5.442 ± 0.705
0.803LeuTrp: 0.803 ± 0.333
3.122LeuTyr: 3.122 ± 0.636
0.0LeuXaa: 0.0 ± 0.0
Met
2.676MetAla: 2.676 ± 0.454
0.0MetCys: 0.0 ± 0.0
1.249MetAsp: 1.249 ± 0.342
1.517MetGlu: 1.517 ± 0.355
0.892MetPhe: 0.892 ± 0.281
1.784MetGly: 1.784 ± 0.612
0.446MetHis: 0.446 ± 0.216
0.892MetIle: 0.892 ± 0.338
1.16MetLys: 1.16 ± 0.393
1.517MetLeu: 1.517 ± 0.361
0.714MetMet: 0.714 ± 0.345
0.624MetAsn: 0.624 ± 0.264
1.07MetPro: 1.07 ± 0.304
0.981MetGln: 0.981 ± 0.383
1.249MetArg: 1.249 ± 0.332
1.427MetSer: 1.427 ± 0.394
1.695MetThr: 1.695 ± 0.34
1.517MetVal: 1.517 ± 0.414
0.446MetTrp: 0.446 ± 0.182
0.892MetTyr: 0.892 ± 0.267
0.0MetXaa: 0.0 ± 0.0
Asn
3.836AsnAla: 3.836 ± 0.555
0.268AsnCys: 0.268 ± 0.188
2.052AsnAsp: 2.052 ± 0.446
2.855AsnGlu: 2.855 ± 0.432
2.498AsnPhe: 2.498 ± 0.595
4.282AsnGly: 4.282 ± 0.711
0.624AsnHis: 0.624 ± 0.276
1.695AsnIle: 1.695 ± 0.388
2.319AsnLys: 2.319 ± 0.344
4.193AsnLeu: 4.193 ± 0.569
0.624AsnMet: 0.624 ± 0.222
1.963AsnAsn: 1.963 ± 0.466
2.765AsnPro: 2.765 ± 0.347
1.695AsnGln: 1.695 ± 0.529
1.873AsnArg: 1.873 ± 0.446
2.498AsnSer: 2.498 ± 0.325
3.033AsnThr: 3.033 ± 0.931
2.765AsnVal: 2.765 ± 0.539
0.357AsnTrp: 0.357 ± 0.161
0.803AsnTyr: 0.803 ± 0.231
0.0AsnXaa: 0.0 ± 0.0
Pro
4.817ProAla: 4.817 ± 0.753
0.268ProCys: 0.268 ± 0.125
2.676ProAsp: 2.676 ± 0.531
3.657ProGlu: 3.657 ± 0.723
1.784ProPhe: 1.784 ± 0.383
3.211ProGly: 3.211 ± 0.495
0.714ProHis: 0.714 ± 0.232
1.16ProIle: 1.16 ± 0.348
2.855ProLys: 2.855 ± 0.546
3.39ProLeu: 3.39 ± 0.579
1.07ProMet: 1.07 ± 0.305
1.695ProAsn: 1.695 ± 0.276
2.765ProPro: 2.765 ± 0.586
1.784ProGln: 1.784 ± 0.404
1.606ProArg: 1.606 ± 0.35
2.676ProSer: 2.676 ± 0.608
3.122ProThr: 3.122 ± 0.69
3.925ProVal: 3.925 ± 0.429
0.981ProTrp: 0.981 ± 0.312
1.606ProTyr: 1.606 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
4.639GlnAla: 4.639 ± 0.811
0.268GlnCys: 0.268 ± 0.155
1.606GlnAsp: 1.606 ± 0.357
2.765GlnGlu: 2.765 ± 0.588
2.409GlnPhe: 2.409 ± 0.55
3.211GlnGly: 3.211 ± 0.657
1.07GlnHis: 1.07 ± 0.348
1.873GlnIle: 1.873 ± 0.479
3.033GlnLys: 3.033 ± 0.561
3.211GlnLeu: 3.211 ± 0.585
1.07GlnMet: 1.07 ± 0.445
1.16GlnAsn: 1.16 ± 0.341
1.16GlnPro: 1.16 ± 0.345
2.498GlnGln: 2.498 ± 0.502
2.23GlnArg: 2.23 ± 0.4
1.427GlnSer: 1.427 ± 0.402
1.338GlnThr: 1.338 ± 0.374
2.409GlnVal: 2.409 ± 0.446
0.892GlnTrp: 0.892 ± 0.252
1.16GlnTyr: 1.16 ± 0.281
0.0GlnXaa: 0.0 ± 0.0
Arg
5.352ArgAla: 5.352 ± 0.835
0.714ArgCys: 0.714 ± 0.36
4.282ArgAsp: 4.282 ± 0.66
3.39ArgGlu: 3.39 ± 0.526
2.23ArgPhe: 2.23 ± 0.349
4.46ArgGly: 4.46 ± 0.552
1.16ArgHis: 1.16 ± 0.294
3.568ArgIle: 3.568 ± 0.515
3.479ArgLys: 3.479 ± 0.613
5.442ArgLeu: 5.442 ± 0.574
1.16ArgMet: 1.16 ± 0.349
1.963ArgAsn: 1.963 ± 0.392
2.319ArgPro: 2.319 ± 0.518
1.695ArgGln: 1.695 ± 0.502
4.014ArgArg: 4.014 ± 0.674
1.873ArgSer: 1.873 ± 0.515
2.855ArgThr: 2.855 ± 0.591
3.657ArgVal: 3.657 ± 0.641
1.07ArgTrp: 1.07 ± 0.333
2.052ArgTyr: 2.052 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
5.352SerAla: 5.352 ± 0.821
0.535SerCys: 0.535 ± 0.216
3.122SerAsp: 3.122 ± 0.604
3.568SerGlu: 3.568 ± 0.807
2.23SerPhe: 2.23 ± 0.447
5.62SerGly: 5.62 ± 0.948
0.714SerHis: 0.714 ± 0.246
2.23SerIle: 2.23 ± 0.466
3.657SerLys: 3.657 ± 0.596
4.906SerLeu: 4.906 ± 0.604
1.427SerMet: 1.427 ± 0.33
1.695SerAsn: 1.695 ± 0.335
2.676SerPro: 2.676 ± 0.364
2.855SerGln: 2.855 ± 0.608
2.409SerArg: 2.409 ± 0.555
3.211SerSer: 3.211 ± 0.688
3.657SerThr: 3.657 ± 0.894
4.55SerVal: 4.55 ± 0.614
0.624SerTrp: 0.624 ± 0.233
1.606SerTyr: 1.606 ± 0.448
0.0SerXaa: 0.0 ± 0.0
Thr
5.442ThrAla: 5.442 ± 0.928
0.446ThrCys: 0.446 ± 0.23
4.193ThrAsp: 4.193 ± 0.474
3.033ThrGlu: 3.033 ± 0.479
2.141ThrPhe: 2.141 ± 0.434
5.263ThrGly: 5.263 ± 0.725
1.16ThrHis: 1.16 ± 0.368
3.657ThrIle: 3.657 ± 0.633
3.657ThrLys: 3.657 ± 0.646
4.996ThrLeu: 4.996 ± 0.612
1.338ThrMet: 1.338 ± 0.275
2.141ThrAsn: 2.141 ± 0.565
4.371ThrPro: 4.371 ± 0.857
2.052ThrGln: 2.052 ± 0.444
2.587ThrArg: 2.587 ± 0.45
2.944ThrSer: 2.944 ± 0.467
4.103ThrThr: 4.103 ± 0.678
5.174ThrVal: 5.174 ± 0.928
0.624ThrTrp: 0.624 ± 0.223
1.873ThrTyr: 1.873 ± 0.644
0.0ThrXaa: 0.0 ± 0.0
Val
6.601ValAla: 6.601 ± 0.916
1.16ValCys: 1.16 ± 0.394
3.122ValAsp: 3.122 ± 0.583
5.174ValGlu: 5.174 ± 0.565
2.676ValPhe: 2.676 ± 0.612
4.46ValGly: 4.46 ± 0.685
1.338ValHis: 1.338 ± 0.356
3.925ValIle: 3.925 ± 0.701
3.568ValLys: 3.568 ± 0.607
4.906ValLeu: 4.906 ± 0.642
1.427ValMet: 1.427 ± 0.31
3.033ValAsn: 3.033 ± 0.532
4.103ValPro: 4.103 ± 0.56
2.765ValGln: 2.765 ± 0.512
4.282ValArg: 4.282 ± 0.703
4.55ValSer: 4.55 ± 0.85
4.996ValThr: 4.996 ± 0.904
5.62ValVal: 5.62 ± 0.912
0.624ValTrp: 0.624 ± 0.242
1.873ValTyr: 1.873 ± 0.51
0.0ValXaa: 0.0 ± 0.0
Trp
1.784TrpAla: 1.784 ± 0.412
0.268TrpCys: 0.268 ± 0.156
1.07TrpAsp: 1.07 ± 0.311
0.535TrpGlu: 0.535 ± 0.155
0.357TrpPhe: 0.357 ± 0.17
0.981TrpGly: 0.981 ± 0.267
0.446TrpHis: 0.446 ± 0.188
0.714TrpIle: 0.714 ± 0.271
0.714TrpLys: 0.714 ± 0.267
1.07TrpLeu: 1.07 ± 0.332
0.892TrpMet: 0.892 ± 0.241
0.803TrpAsn: 0.803 ± 0.278
0.446TrpPro: 0.446 ± 0.183
0.714TrpGln: 0.714 ± 0.253
1.07TrpArg: 1.07 ± 0.272
0.535TrpSer: 0.535 ± 0.159
1.695TrpThr: 1.695 ± 0.367
1.16TrpVal: 1.16 ± 0.249
0.357TrpTrp: 0.357 ± 0.159
0.178TrpTyr: 0.178 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.676TyrAla: 2.676 ± 0.569
0.268TyrCys: 0.268 ± 0.16
1.963TyrAsp: 1.963 ± 0.424
1.695TyrGlu: 1.695 ± 0.503
0.714TyrPhe: 0.714 ± 0.197
2.587TyrGly: 2.587 ± 0.353
0.268TyrHis: 0.268 ± 0.142
1.784TyrIle: 1.784 ± 0.4
1.873TyrLys: 1.873 ± 0.369
2.676TyrLeu: 2.676 ± 0.454
0.357TyrMet: 0.357 ± 0.177
1.16TyrAsn: 1.16 ± 0.371
1.606TyrPro: 1.606 ± 0.357
1.963TyrGln: 1.963 ± 0.456
1.784TyrArg: 1.784 ± 0.439
1.606TyrSer: 1.606 ± 0.275
2.23TyrThr: 2.23 ± 0.623
2.052TyrVal: 2.052 ± 0.329
0.446TyrTrp: 0.446 ± 0.242
0.357TyrTyr: 0.357 ± 0.172
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 39 proteins (11211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski