Amino acid dipepetide frequency for Rhodovulum phage RS1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.232AlaAla: 16.232 ± 2.244
0.812AlaCys: 0.812 ± 0.204
6.493AlaAsp: 6.493 ± 0.694
7.629AlaGlu: 7.629 ± 0.801
3.815AlaPhe: 3.815 ± 0.646
9.171AlaGly: 9.171 ± 0.967
1.055AlaHis: 1.055 ± 0.298
5.763AlaIle: 5.763 ± 0.572
6.006AlaLys: 6.006 ± 0.862
11.85AlaLeu: 11.85 ± 0.901
2.922AlaMet: 2.922 ± 0.521
3.246AlaAsn: 3.246 ± 0.786
5.6AlaPro: 5.6 ± 0.582
4.707AlaGln: 4.707 ± 1.005
9.171AlaArg: 9.171 ± 0.742
7.792AlaSer: 7.792 ± 1.056
7.467AlaThr: 7.467 ± 0.697
7.71AlaVal: 7.71 ± 0.709
2.354AlaTrp: 2.354 ± 0.349
1.217AlaTyr: 1.217 ± 0.339
0.0AlaXaa: 0.0 ± 0.0
Cys
0.73CysAla: 0.73 ± 0.246
0.325CysCys: 0.325 ± 0.164
0.243CysAsp: 0.243 ± 0.163
0.568CysGlu: 0.568 ± 0.198
0.162CysPhe: 0.162 ± 0.111
0.568CysGly: 0.568 ± 0.21
0.162CysHis: 0.162 ± 0.122
0.325CysIle: 0.325 ± 0.143
0.162CysLys: 0.162 ± 0.13
0.487CysLeu: 0.487 ± 0.208
0.162CysMet: 0.162 ± 0.105
0.243CysAsn: 0.243 ± 0.163
0.812CysPro: 0.812 ± 0.308
0.162CysGln: 0.162 ± 0.128
0.568CysArg: 0.568 ± 0.257
0.568CysSer: 0.568 ± 0.215
0.325CysThr: 0.325 ± 0.174
0.243CysVal: 0.243 ± 0.123
0.081CysTrp: 0.081 ± 0.08
0.243CysTyr: 0.243 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
9.009AspAla: 9.009 ± 0.732
0.406AspCys: 0.406 ± 0.195
3.571AspAsp: 3.571 ± 0.63
4.302AspGlu: 4.302 ± 0.546
2.191AspPhe: 2.191 ± 0.395
5.681AspGly: 5.681 ± 0.828
1.704AspHis: 1.704 ± 0.381
3.246AspIle: 3.246 ± 0.473
1.704AspLys: 1.704 ± 0.411
7.142AspLeu: 7.142 ± 0.753
1.542AspMet: 1.542 ± 0.245
1.542AspAsn: 1.542 ± 0.366
4.139AspPro: 4.139 ± 0.633
3.165AspGln: 3.165 ± 0.407
3.652AspArg: 3.652 ± 0.553
1.623AspSer: 1.623 ± 0.308
2.435AspThr: 2.435 ± 0.494
2.435AspVal: 2.435 ± 0.378
1.542AspTrp: 1.542 ± 0.404
1.623AspTyr: 1.623 ± 0.447
0.0AspXaa: 0.0 ± 0.0
Glu
8.684GluAla: 8.684 ± 0.872
0.568GluCys: 0.568 ± 0.25
3.328GluAsp: 3.328 ± 0.586
4.058GluGlu: 4.058 ± 0.511
2.678GluPhe: 2.678 ± 0.608
3.328GluGly: 3.328 ± 0.465
1.623GluHis: 1.623 ± 0.314
5.357GluIle: 5.357 ± 0.646
3.246GluLys: 3.246 ± 0.605
7.467GluLeu: 7.467 ± 0.718
2.029GluMet: 2.029 ± 0.44
2.191GluAsn: 2.191 ± 0.432
2.354GluPro: 2.354 ± 0.525
3.49GluGln: 3.49 ± 0.773
4.545GluArg: 4.545 ± 0.874
2.191GluSer: 2.191 ± 0.397
4.464GluThr: 4.464 ± 0.558
3.896GluVal: 3.896 ± 0.501
1.055GluTrp: 1.055 ± 0.319
1.136GluTyr: 1.136 ± 0.267
0.0GluXaa: 0.0 ± 0.0
Phe
3.246PheAla: 3.246 ± 0.425
0.325PheCys: 0.325 ± 0.142
2.678PheAsp: 2.678 ± 0.423
2.191PheGlu: 2.191 ± 0.49
0.487PhePhe: 0.487 ± 0.209
3.165PheGly: 3.165 ± 0.445
0.325PheHis: 0.325 ± 0.142
0.73PheIle: 0.73 ± 0.214
1.542PheLys: 1.542 ± 0.356
1.461PheLeu: 1.461 ± 0.318
1.217PheMet: 1.217 ± 0.294
1.055PheAsn: 1.055 ± 0.226
0.649PhePro: 0.649 ± 0.246
1.055PheGln: 1.055 ± 0.351
2.029PheArg: 2.029 ± 0.411
2.516PheSer: 2.516 ± 0.475
1.704PheThr: 1.704 ± 0.52
1.786PheVal: 1.786 ± 0.356
0.243PheTrp: 0.243 ± 0.137
0.325PheTyr: 0.325 ± 0.172
0.0PheXaa: 0.0 ± 0.0
Gly
10.632GlyAla: 10.632 ± 1.045
0.406GlyCys: 0.406 ± 0.218
4.464GlyAsp: 4.464 ± 0.609
3.977GlyGlu: 3.977 ± 0.533
2.354GlyPhe: 2.354 ± 0.422
5.925GlyGly: 5.925 ± 0.922
1.948GlyHis: 1.948 ± 0.466
4.22GlyIle: 4.22 ± 0.684
3.084GlyLys: 3.084 ± 0.409
7.954GlyLeu: 7.954 ± 0.887
2.191GlyMet: 2.191 ± 0.398
2.11GlyAsn: 2.11 ± 0.391
3.165GlyPro: 3.165 ± 0.392
2.922GlyGln: 2.922 ± 0.705
5.844GlyArg: 5.844 ± 0.803
5.113GlySer: 5.113 ± 0.711
4.951GlyThr: 4.951 ± 0.72
5.357GlyVal: 5.357 ± 0.73
1.461GlyTrp: 1.461 ± 0.352
1.542GlyTyr: 1.542 ± 0.295
0.0GlyXaa: 0.0 ± 0.0
His
1.136HisAla: 1.136 ± 0.293
0.162HisCys: 0.162 ± 0.107
1.461HisAsp: 1.461 ± 0.361
0.974HisGlu: 0.974 ± 0.258
0.812HisPhe: 0.812 ± 0.211
1.704HisGly: 1.704 ± 0.349
0.406HisHis: 0.406 ± 0.168
0.649HisIle: 0.649 ± 0.233
0.162HisLys: 0.162 ± 0.089
1.786HisLeu: 1.786 ± 0.398
0.649HisMet: 0.649 ± 0.267
0.568HisAsn: 0.568 ± 0.219
0.649HisPro: 0.649 ± 0.242
0.649HisGln: 0.649 ± 0.203
1.704HisArg: 1.704 ± 0.369
1.299HisSer: 1.299 ± 0.401
0.487HisThr: 0.487 ± 0.25
1.136HisVal: 1.136 ± 0.291
0.325HisTrp: 0.325 ± 0.186
0.487HisTyr: 0.487 ± 0.165
0.0HisXaa: 0.0 ± 0.0
Ile
6.006IleAla: 6.006 ± 0.669
0.649IleCys: 0.649 ± 0.216
5.113IleAsp: 5.113 ± 0.69
5.438IleGlu: 5.438 ± 0.647
0.812IlePhe: 0.812 ± 0.242
4.626IleGly: 4.626 ± 0.534
0.649IleHis: 0.649 ± 0.257
1.623IleIle: 1.623 ± 0.328
1.461IleLys: 1.461 ± 0.353
3.815IleLeu: 3.815 ± 0.455
0.325IleMet: 0.325 ± 0.155
1.055IleAsn: 1.055 ± 0.297
2.678IlePro: 2.678 ± 0.47
1.461IleGln: 1.461 ± 0.33
3.571IleArg: 3.571 ± 0.511
3.815IleSer: 3.815 ± 0.492
3.003IleThr: 3.003 ± 0.799
3.328IleVal: 3.328 ± 0.702
1.055IleTrp: 1.055 ± 0.214
1.38IleTyr: 1.38 ± 0.407
0.0IleXaa: 0.0 ± 0.0
Lys
6.006LysAla: 6.006 ± 0.688
0.243LysCys: 0.243 ± 0.154
1.299LysAsp: 1.299 ± 0.314
2.678LysGlu: 2.678 ± 0.507
0.974LysPhe: 0.974 ± 0.238
3.977LysGly: 3.977 ± 0.536
0.73LysHis: 0.73 ± 0.245
2.435LysIle: 2.435 ± 0.55
1.136LysLys: 1.136 ± 0.217
4.139LysLeu: 4.139 ± 0.614
0.649LysMet: 0.649 ± 0.236
0.568LysAsn: 0.568 ± 0.179
2.597LysPro: 2.597 ± 0.613
1.542LysGln: 1.542 ± 0.409
3.246LysArg: 3.246 ± 0.552
2.435LysSer: 2.435 ± 0.334
3.328LysThr: 3.328 ± 0.611
2.354LysVal: 2.354 ± 0.544
0.812LysTrp: 0.812 ± 0.24
0.568LysTyr: 0.568 ± 0.154
0.0LysXaa: 0.0 ± 0.0
Leu
10.47LeuAla: 10.47 ± 1.013
0.812LeuCys: 0.812 ± 0.339
6.249LeuAsp: 6.249 ± 0.877
6.087LeuGlu: 6.087 ± 0.827
2.191LeuPhe: 2.191 ± 0.409
7.386LeuGly: 7.386 ± 0.923
1.299LeuHis: 1.299 ± 0.331
4.626LeuIle: 4.626 ± 0.623
3.733LeuLys: 3.733 ± 0.478
6.899LeuLeu: 6.899 ± 0.812
2.435LeuMet: 2.435 ± 0.494
2.841LeuAsn: 2.841 ± 0.47
4.87LeuPro: 4.87 ± 0.625
3.328LeuGln: 3.328 ± 0.529
7.061LeuArg: 7.061 ± 0.73
6.655LeuSer: 6.655 ± 0.7
5.844LeuThr: 5.844 ± 0.674
5.519LeuVal: 5.519 ± 0.543
1.299LeuTrp: 1.299 ± 0.449
1.704LeuTyr: 1.704 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
2.841MetAla: 2.841 ± 0.422
0.081MetCys: 0.081 ± 0.075
1.136MetAsp: 1.136 ± 0.29
1.623MetGlu: 1.623 ± 0.423
0.568MetPhe: 0.568 ± 0.214
1.867MetGly: 1.867 ± 0.576
0.73MetHis: 0.73 ± 0.242
1.217MetIle: 1.217 ± 0.271
1.299MetLys: 1.299 ± 0.36
2.435MetLeu: 2.435 ± 0.469
0.974MetMet: 0.974 ± 0.359
0.487MetAsn: 0.487 ± 0.213
1.623MetPro: 1.623 ± 0.35
1.217MetGln: 1.217 ± 0.313
2.191MetArg: 2.191 ± 0.441
2.191MetSer: 2.191 ± 0.377
2.191MetThr: 2.191 ± 0.385
0.73MetVal: 0.73 ± 0.278
0.487MetTrp: 0.487 ± 0.225
0.325MetTyr: 0.325 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
3.165AsnAla: 3.165 ± 0.76
0.162AsnCys: 0.162 ± 0.155
2.191AsnAsp: 2.191 ± 0.439
0.974AsnGlu: 0.974 ± 0.216
0.73AsnPhe: 0.73 ± 0.225
2.354AsnGly: 2.354 ± 0.417
0.406AsnHis: 0.406 ± 0.158
1.299AsnIle: 1.299 ± 0.267
0.893AsnLys: 0.893 ± 0.248
2.435AsnLeu: 2.435 ± 0.584
0.406AsnMet: 0.406 ± 0.172
0.649AsnAsn: 0.649 ± 0.232
1.867AsnPro: 1.867 ± 0.463
0.649AsnGln: 0.649 ± 0.373
1.623AsnArg: 1.623 ± 0.385
2.029AsnSer: 2.029 ± 0.348
1.623AsnThr: 1.623 ± 0.356
1.542AsnVal: 1.542 ± 0.331
0.568AsnTrp: 0.568 ± 0.223
0.406AsnTyr: 0.406 ± 0.173
0.0AsnXaa: 0.0 ± 0.0
Pro
5.357ProAla: 5.357 ± 0.732
0.081ProCys: 0.081 ± 0.075
3.003ProAsp: 3.003 ± 0.543
3.165ProGlu: 3.165 ± 0.546
1.623ProPhe: 1.623 ± 0.342
4.139ProGly: 4.139 ± 0.585
1.136ProHis: 1.136 ± 0.338
1.867ProIle: 1.867 ± 0.353
2.273ProLys: 2.273 ± 0.526
4.22ProLeu: 4.22 ± 0.546
1.299ProMet: 1.299 ± 0.344
0.893ProAsn: 0.893 ± 0.31
2.029ProPro: 2.029 ± 0.391
1.704ProGln: 1.704 ± 0.336
3.977ProArg: 3.977 ± 0.584
3.571ProSer: 3.571 ± 0.686
1.948ProThr: 1.948 ± 0.388
4.058ProVal: 4.058 ± 0.638
1.217ProTrp: 1.217 ± 0.461
1.055ProTyr: 1.055 ± 0.278
0.0ProXaa: 0.0 ± 0.0
Gln
5.194GlnAla: 5.194 ± 0.789
0.081GlnCys: 0.081 ± 0.092
1.299GlnAsp: 1.299 ± 0.315
2.516GlnGlu: 2.516 ± 0.413
0.974GlnPhe: 0.974 ± 0.227
2.354GlnGly: 2.354 ± 0.58
0.73GlnHis: 0.73 ± 0.213
3.003GlnIle: 3.003 ± 0.496
1.867GlnLys: 1.867 ± 0.415
4.139GlnLeu: 4.139 ± 0.599
1.38GlnMet: 1.38 ± 0.347
0.812GlnAsn: 0.812 ± 0.253
1.948GlnPro: 1.948 ± 0.472
1.786GlnGln: 1.786 ± 0.676
2.354GlnArg: 2.354 ± 0.383
1.38GlnSer: 1.38 ± 0.441
2.922GlnThr: 2.922 ± 0.606
2.597GlnVal: 2.597 ± 0.396
0.568GlnTrp: 0.568 ± 0.218
0.812GlnTyr: 0.812 ± 0.313
0.0GlnXaa: 0.0 ± 0.0
Arg
7.467ArgAla: 7.467 ± 0.725
0.325ArgCys: 0.325 ± 0.241
5.438ArgAsp: 5.438 ± 0.686
5.032ArgGlu: 5.032 ± 0.578
2.678ArgPhe: 2.678 ± 0.417
4.789ArgGly: 4.789 ± 0.592
1.542ArgHis: 1.542 ± 0.343
3.165ArgIle: 3.165 ± 0.542
4.383ArgLys: 4.383 ± 1.022
6.899ArgLeu: 6.899 ± 0.883
1.704ArgMet: 1.704 ± 0.572
2.11ArgAsn: 2.11 ± 0.485
3.571ArgPro: 3.571 ± 0.581
2.597ArgGln: 2.597 ± 0.485
6.087ArgArg: 6.087 ± 0.995
4.951ArgSer: 4.951 ± 0.514
2.354ArgThr: 2.354 ± 0.443
5.438ArgVal: 5.438 ± 0.84
1.217ArgTrp: 1.217 ± 0.209
1.623ArgTyr: 1.623 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
6.736SerAla: 6.736 ± 1.112
0.568SerCys: 0.568 ± 0.238
5.113SerAsp: 5.113 ± 0.507
5.276SerGlu: 5.276 ± 0.545
2.191SerPhe: 2.191 ± 0.473
6.736SerGly: 6.736 ± 0.858
0.568SerHis: 0.568 ± 0.199
3.896SerIle: 3.896 ± 0.639
2.354SerLys: 2.354 ± 0.442
4.383SerLeu: 4.383 ± 0.545
1.461SerMet: 1.461 ± 0.315
1.867SerAsn: 1.867 ± 0.401
2.516SerPro: 2.516 ± 0.466
2.191SerGln: 2.191 ± 0.445
4.464SerArg: 4.464 ± 0.532
4.626SerSer: 4.626 ± 0.703
3.49SerThr: 3.49 ± 0.455
2.516SerVal: 2.516 ± 0.444
0.568SerTrp: 0.568 ± 0.259
1.055SerTyr: 1.055 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
6.087ThrAla: 6.087 ± 0.774
0.406ThrCys: 0.406 ± 0.186
4.707ThrAsp: 4.707 ± 0.832
4.545ThrGlu: 4.545 ± 0.677
1.461ThrPhe: 1.461 ± 0.384
4.951ThrGly: 4.951 ± 0.504
1.217ThrHis: 1.217 ± 0.277
3.165ThrIle: 3.165 ± 0.554
2.76ThrLys: 2.76 ± 0.544
4.545ThrLeu: 4.545 ± 0.432
1.704ThrMet: 1.704 ± 0.31
1.217ThrAsn: 1.217 ± 0.282
2.597ThrPro: 2.597 ± 0.521
2.191ThrGln: 2.191 ± 0.352
2.76ThrArg: 2.76 ± 0.405
3.084ThrSer: 3.084 ± 0.456
3.733ThrThr: 3.733 ± 0.548
4.464ThrVal: 4.464 ± 0.77
1.136ThrTrp: 1.136 ± 0.327
1.055ThrTyr: 1.055 ± 0.256
0.0ThrXaa: 0.0 ± 0.0
Val
7.873ValAla: 7.873 ± 0.67
0.406ValCys: 0.406 ± 0.194
2.841ValAsp: 2.841 ± 0.49
5.194ValGlu: 5.194 ± 0.685
1.623ValPhe: 1.623 ± 0.443
3.084ValGly: 3.084 ± 0.691
0.568ValHis: 0.568 ± 0.203
3.409ValIle: 3.409 ± 0.563
2.922ValLys: 2.922 ± 0.501
5.357ValLeu: 5.357 ± 0.665
1.786ValMet: 1.786 ± 0.476
1.136ValAsn: 1.136 ± 0.321
3.165ValPro: 3.165 ± 0.55
2.435ValGln: 2.435 ± 0.587
4.951ValArg: 4.951 ± 0.706
4.545ValSer: 4.545 ± 0.703
3.733ValThr: 3.733 ± 0.569
4.707ValVal: 4.707 ± 0.796
1.217ValTrp: 1.217 ± 0.263
1.136ValTyr: 1.136 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
1.704TrpAla: 1.704 ± 0.352
0.081TrpCys: 0.081 ± 0.072
1.055TrpAsp: 1.055 ± 0.301
0.73TrpGlu: 0.73 ± 0.192
0.243TrpPhe: 0.243 ± 0.151
1.786TrpGly: 1.786 ± 0.352
0.406TrpHis: 0.406 ± 0.167
0.974TrpIle: 0.974 ± 0.251
0.568TrpLys: 0.568 ± 0.207
1.38TrpLeu: 1.38 ± 0.307
0.73TrpMet: 0.73 ± 0.275
0.974TrpAsn: 0.974 ± 0.294
1.136TrpPro: 1.136 ± 0.302
0.568TrpGln: 0.568 ± 0.223
1.623TrpArg: 1.623 ± 0.276
0.974TrpSer: 0.974 ± 0.262
1.136TrpThr: 1.136 ± 0.324
1.136TrpVal: 1.136 ± 0.325
0.162TrpTrp: 0.162 ± 0.121
0.162TrpTyr: 0.162 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.435TyrAla: 2.435 ± 0.465
0.243TyrCys: 0.243 ± 0.147
0.974TyrAsp: 0.974 ± 0.275
0.73TyrGlu: 0.73 ± 0.264
0.162TyrPhe: 0.162 ± 0.103
2.029TyrGly: 2.029 ± 0.487
0.0TyrHis: 0.0 ± 0.0
0.893TyrIle: 0.893 ± 0.308
0.162TyrLys: 0.162 ± 0.11
2.597TyrLeu: 2.597 ± 0.474
0.568TyrMet: 0.568 ± 0.258
0.406TyrAsn: 0.406 ± 0.135
0.649TyrPro: 0.649 ± 0.269
0.893TyrGln: 0.893 ± 0.257
1.948TyrArg: 1.948 ± 0.471
1.217TyrSer: 1.217 ± 0.374
0.649TyrThr: 0.649 ± 0.242
1.136TyrVal: 1.136 ± 0.311
0.162TyrTrp: 0.162 ± 0.116
0.325TyrTyr: 0.325 ± 0.122
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 56 proteins (12322 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski