Amino acid dipepetide frequency for Rhizobium phage RR1-A

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.064AlaAla: 14.064 ± 1.105
0.966AlaCys: 0.966 ± 0.216
7.545AlaAsp: 7.545 ± 0.598
8.571AlaGlu: 8.571 ± 0.714
4.105AlaPhe: 4.105 ± 0.456
8.994AlaGly: 8.994 ± 0.739
1.932AlaHis: 1.932 ± 0.371
6.036AlaIle: 6.036 ± 0.706
4.406AlaLys: 4.406 ± 0.657
8.632AlaLeu: 8.632 ± 0.862
3.923AlaMet: 3.923 ± 0.518
3.803AlaAsn: 3.803 ± 0.511
5.01AlaPro: 5.01 ± 0.465
3.38AlaGln: 3.38 ± 0.45
6.217AlaArg: 6.217 ± 0.595
6.036AlaSer: 6.036 ± 0.618
5.191AlaThr: 5.191 ± 0.516
7.183AlaVal: 7.183 ± 0.606
1.871AlaTrp: 1.871 ± 0.286
2.052AlaTyr: 2.052 ± 0.316
0.0AlaXaa: 0.0 ± 0.0
Cys
1.207CysAla: 1.207 ± 0.326
0.121CysCys: 0.121 ± 0.079
0.483CysAsp: 0.483 ± 0.162
0.905CysGlu: 0.905 ± 0.225
0.181CysPhe: 0.181 ± 0.101
1.569CysGly: 1.569 ± 0.397
0.241CysHis: 0.241 ± 0.109
0.543CysIle: 0.543 ± 0.182
0.604CysLys: 0.604 ± 0.263
0.543CysLeu: 0.543 ± 0.174
0.241CysMet: 0.241 ± 0.124
0.423CysAsn: 0.423 ± 0.135
0.483CysPro: 0.483 ± 0.142
0.362CysGln: 0.362 ± 0.131
1.63CysArg: 1.63 ± 0.32
0.845CysSer: 0.845 ± 0.297
0.724CysThr: 0.724 ± 0.225
0.543CysVal: 0.543 ± 0.169
0.06CysTrp: 0.06 ± 0.059
0.121CysTyr: 0.121 ± 0.088
0.0CysXaa: 0.0 ± 0.0
Asp
7.304AspAla: 7.304 ± 0.601
0.966AspCys: 0.966 ± 0.249
3.863AspAsp: 3.863 ± 0.552
3.984AspGlu: 3.984 ± 0.4
2.958AspPhe: 2.958 ± 0.415
5.855AspGly: 5.855 ± 0.618
1.569AspHis: 1.569 ± 0.354
3.984AspIle: 3.984 ± 0.543
2.535AspLys: 2.535 ± 0.484
5.372AspLeu: 5.372 ± 0.57
1.086AspMet: 1.086 ± 0.239
1.086AspAsn: 1.086 ± 0.283
3.199AspPro: 3.199 ± 0.359
1.75AspGln: 1.75 ± 0.335
4.105AspArg: 4.105 ± 0.506
2.837AspSer: 2.837 ± 0.309
3.561AspThr: 3.561 ± 0.369
3.682AspVal: 3.682 ± 0.426
1.328AspTrp: 1.328 ± 0.352
2.052AspTyr: 2.052 ± 0.307
0.0AspXaa: 0.0 ± 0.0
Glu
7.605GluAla: 7.605 ± 0.699
1.147GluCys: 1.147 ± 0.277
3.501GluAsp: 3.501 ± 0.511
4.225GluGlu: 4.225 ± 0.559
2.233GluPhe: 2.233 ± 0.378
3.803GluGly: 3.803 ± 0.403
1.328GluHis: 1.328 ± 0.344
3.984GluIle: 3.984 ± 0.45
3.078GluLys: 3.078 ± 0.436
5.312GluLeu: 5.312 ± 0.699
1.932GluMet: 1.932 ± 0.297
1.992GluAsn: 1.992 ± 0.346
2.958GluPro: 2.958 ± 0.514
1.932GluGln: 1.932 ± 0.411
5.734GluArg: 5.734 ± 0.478
2.897GluSer: 2.897 ± 0.436
3.863GluThr: 3.863 ± 0.523
3.622GluVal: 3.622 ± 0.452
1.207GluTrp: 1.207 ± 0.342
1.75GluTyr: 1.75 ± 0.327
0.0GluXaa: 0.0 ± 0.0
Phe
3.682PheAla: 3.682 ± 0.46
0.362PheCys: 0.362 ± 0.16
2.475PheAsp: 2.475 ± 0.358
2.414PheGlu: 2.414 ± 0.371
1.328PhePhe: 1.328 ± 0.308
3.32PheGly: 3.32 ± 0.403
0.543PheHis: 0.543 ± 0.176
1.63PheIle: 1.63 ± 0.413
1.268PheLys: 1.268 ± 0.297
2.656PheLeu: 2.656 ± 0.482
1.147PheMet: 1.147 ± 0.249
1.509PheAsn: 1.509 ± 0.279
1.388PhePro: 1.388 ± 0.266
1.268PheGln: 1.268 ± 0.236
2.716PheArg: 2.716 ± 0.431
2.354PheSer: 2.354 ± 0.369
2.052PheThr: 2.052 ± 0.333
2.294PheVal: 2.294 ± 0.333
0.785PheTrp: 0.785 ± 0.195
0.724PheTyr: 0.724 ± 0.223
0.0PheXaa: 0.0 ± 0.0
Gly
6.76GlyAla: 6.76 ± 0.656
1.147GlyCys: 1.147 ± 0.299
4.225GlyAsp: 4.225 ± 0.611
5.131GlyGlu: 5.131 ± 0.597
2.897GlyPhe: 2.897 ± 0.483
7.183GlyGly: 7.183 ± 0.746
1.569GlyHis: 1.569 ± 0.323
3.682GlyIle: 3.682 ± 0.421
4.95GlyLys: 4.95 ± 0.559
6.096GlyLeu: 6.096 ± 0.566
2.354GlyMet: 2.354 ± 0.396
2.958GlyAsn: 2.958 ± 0.455
2.958GlyPro: 2.958 ± 0.422
2.897GlyGln: 2.897 ± 0.445
5.915GlyArg: 5.915 ± 0.569
5.07GlySer: 5.07 ± 0.466
4.648GlyThr: 4.648 ± 0.374
5.674GlyVal: 5.674 ± 0.526
1.63GlyTrp: 1.63 ± 0.274
2.354GlyTyr: 2.354 ± 0.459
0.0GlyXaa: 0.0 ± 0.0
His
1.69HisAla: 1.69 ± 0.28
0.302HisCys: 0.302 ± 0.11
1.207HisAsp: 1.207 ± 0.277
1.509HisGlu: 1.509 ± 0.308
0.905HisPhe: 0.905 ± 0.236
2.354HisGly: 2.354 ± 0.487
0.604HisHis: 0.604 ± 0.156
1.086HisIle: 1.086 ± 0.28
1.388HisLys: 1.388 ± 0.269
1.811HisLeu: 1.811 ± 0.248
0.423HisMet: 0.423 ± 0.131
0.543HisAsn: 0.543 ± 0.162
1.388HisPro: 1.388 ± 0.272
1.026HisGln: 1.026 ± 0.253
1.328HisArg: 1.328 ± 0.295
0.905HisSer: 0.905 ± 0.202
0.966HisThr: 0.966 ± 0.265
1.268HisVal: 1.268 ± 0.292
0.241HisTrp: 0.241 ± 0.132
0.423HisTyr: 0.423 ± 0.19
0.0HisXaa: 0.0 ± 0.0
Ile
6.459IleAla: 6.459 ± 0.595
0.724IleCys: 0.724 ± 0.181
3.803IleAsp: 3.803 ± 0.435
3.863IleGlu: 3.863 ± 0.46
1.932IlePhe: 1.932 ± 0.386
3.803IleGly: 3.803 ± 0.459
1.147IleHis: 1.147 ± 0.299
2.414IleIle: 2.414 ± 0.36
1.63IleLys: 1.63 ± 0.269
3.501IleLeu: 3.501 ± 0.497
0.845IleMet: 0.845 ± 0.183
1.871IleAsn: 1.871 ± 0.343
3.199IlePro: 3.199 ± 0.45
1.268IleGln: 1.268 ± 0.284
4.527IleArg: 4.527 ± 0.481
3.501IleSer: 3.501 ± 0.459
3.018IleThr: 3.018 ± 0.358
4.225IleVal: 4.225 ± 0.458
0.241IleTrp: 0.241 ± 0.148
1.569IleTyr: 1.569 ± 0.238
0.0IleXaa: 0.0 ± 0.0
Lys
4.829LysAla: 4.829 ± 0.77
0.181LysCys: 0.181 ± 0.099
2.958LysAsp: 2.958 ± 0.555
1.871LysGlu: 1.871 ± 0.259
1.569LysPhe: 1.569 ± 0.299
3.078LysGly: 3.078 ± 0.496
0.845LysHis: 0.845 ± 0.262
3.078LysIle: 3.078 ± 0.394
2.113LysLys: 2.113 ± 0.335
4.105LysLeu: 4.105 ± 0.483
0.664LysMet: 0.664 ± 0.173
1.449LysAsn: 1.449 ± 0.317
2.414LysPro: 2.414 ± 0.31
1.147LysGln: 1.147 ± 0.224
3.441LysArg: 3.441 ± 0.467
2.837LysSer: 2.837 ± 0.33
2.897LysThr: 2.897 ± 0.398
2.113LysVal: 2.113 ± 0.331
0.724LysTrp: 0.724 ± 0.17
1.086LysTyr: 1.086 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
10.02LeuAla: 10.02 ± 0.787
0.543LeuCys: 0.543 ± 0.205
4.889LeuAsp: 4.889 ± 0.459
4.587LeuGlu: 4.587 ± 0.533
2.414LeuPhe: 2.414 ± 0.322
5.553LeuGly: 5.553 ± 0.494
1.569LeuHis: 1.569 ± 0.278
3.742LeuIle: 3.742 ± 0.421
3.32LeuLys: 3.32 ± 0.424
5.734LeuLeu: 5.734 ± 0.582
2.233LeuMet: 2.233 ± 0.36
2.113LeuAsn: 2.113 ± 0.388
5.251LeuPro: 5.251 ± 0.595
3.742LeuGln: 3.742 ± 0.501
5.312LeuArg: 5.312 ± 0.486
6.64LeuSer: 6.64 ± 0.664
4.044LeuThr: 4.044 ± 0.516
5.493LeuVal: 5.493 ± 0.537
1.147LeuTrp: 1.147 ± 0.301
1.509LeuTyr: 1.509 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
2.958MetAla: 2.958 ± 0.405
0.241MetCys: 0.241 ± 0.124
1.449MetAsp: 1.449 ± 0.281
1.449MetGlu: 1.449 ± 0.341
0.845MetPhe: 0.845 ± 0.234
1.388MetGly: 1.388 ± 0.281
0.604MetHis: 0.604 ± 0.165
1.328MetIle: 1.328 ± 0.257
1.086MetLys: 1.086 ± 0.214
2.535MetLeu: 2.535 ± 0.378
0.724MetMet: 0.724 ± 0.209
0.785MetAsn: 0.785 ± 0.175
1.811MetPro: 1.811 ± 0.267
1.207MetGln: 1.207 ± 0.225
2.052MetArg: 2.052 ± 0.329
1.811MetSer: 1.811 ± 0.359
2.354MetThr: 2.354 ± 0.305
1.569MetVal: 1.569 ± 0.374
0.06MetTrp: 0.06 ± 0.062
0.241MetTyr: 0.241 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
3.682AsnAla: 3.682 ± 0.348
0.362AsnCys: 0.362 ± 0.163
1.449AsnAsp: 1.449 ± 0.269
1.328AsnGlu: 1.328 ± 0.292
0.785AsnPhe: 0.785 ± 0.226
3.199AsnGly: 3.199 ± 0.422
0.604AsnHis: 0.604 ± 0.177
1.509AsnIle: 1.509 ± 0.295
0.966AsnLys: 0.966 ± 0.276
2.414AsnLeu: 2.414 ± 0.405
0.785AsnMet: 0.785 ± 0.163
1.328AsnAsn: 1.328 ± 0.26
2.354AsnPro: 2.354 ± 0.417
0.966AsnGln: 0.966 ± 0.228
3.139AsnArg: 3.139 ± 0.386
1.509AsnSer: 1.509 ± 0.277
1.026AsnThr: 1.026 ± 0.215
1.811AsnVal: 1.811 ± 0.342
0.423AsnTrp: 0.423 ± 0.149
0.604AsnTyr: 0.604 ± 0.199
0.0AsnXaa: 0.0 ± 0.0
Pro
4.889ProAla: 4.889 ± 0.585
0.483ProCys: 0.483 ± 0.177
4.467ProAsp: 4.467 ± 0.53
4.105ProGlu: 4.105 ± 0.534
2.294ProPhe: 2.294 ± 0.425
4.346ProGly: 4.346 ± 0.521
1.449ProHis: 1.449 ± 0.298
1.992ProIle: 1.992 ± 0.333
1.992ProLys: 1.992 ± 0.39
3.622ProLeu: 3.622 ± 0.585
1.328ProMet: 1.328 ± 0.268
1.63ProAsn: 1.63 ± 0.263
2.837ProPro: 2.837 ± 0.425
1.388ProGln: 1.388 ± 0.309
3.018ProArg: 3.018 ± 0.423
3.863ProSer: 3.863 ± 0.546
2.777ProThr: 2.777 ± 0.471
3.501ProVal: 3.501 ± 0.494
0.543ProTrp: 0.543 ± 0.178
0.845ProTyr: 0.845 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
4.165GlnAla: 4.165 ± 0.404
0.543GlnCys: 0.543 ± 0.174
1.569GlnAsp: 1.569 ± 0.273
1.69GlnGlu: 1.69 ± 0.246
1.026GlnPhe: 1.026 ± 0.28
2.173GlnGly: 2.173 ± 0.294
0.664GlnHis: 0.664 ± 0.227
2.596GlnIle: 2.596 ± 0.422
1.992GlnLys: 1.992 ± 0.318
2.354GlnLeu: 2.354 ± 0.357
0.724GlnMet: 0.724 ± 0.203
1.147GlnAsn: 1.147 ± 0.263
1.811GlnPro: 1.811 ± 0.438
2.113GlnGln: 2.113 ± 0.304
3.259GlnArg: 3.259 ± 0.408
1.992GlnSer: 1.992 ± 0.293
1.932GlnThr: 1.932 ± 0.333
1.811GlnVal: 1.811 ± 0.304
0.724GlnTrp: 0.724 ± 0.204
0.845GlnTyr: 0.845 ± 0.29
0.0GlnXaa: 0.0 ± 0.0
Arg
5.976ArgAla: 5.976 ± 0.656
0.845ArgCys: 0.845 ± 0.229
4.587ArgAsp: 4.587 ± 0.514
4.829ArgGlu: 4.829 ± 0.531
3.199ArgPhe: 3.199 ± 0.384
5.251ArgGly: 5.251 ± 0.486
2.173ArgHis: 2.173 ± 0.336
4.95ArgIle: 4.95 ± 0.531
3.38ArgLys: 3.38 ± 0.455
7.907ArgLeu: 7.907 ± 0.654
2.294ArgMet: 2.294 ± 0.392
1.871ArgAsn: 1.871 ± 0.34
3.622ArgPro: 3.622 ± 0.487
2.958ArgGln: 2.958 ± 0.462
5.674ArgArg: 5.674 ± 0.685
3.803ArgSer: 3.803 ± 0.389
2.596ArgThr: 2.596 ± 0.318
4.708ArgVal: 4.708 ± 0.498
1.026ArgTrp: 1.026 ± 0.235
1.871ArgTyr: 1.871 ± 0.24
0.0ArgXaa: 0.0 ± 0.0
Ser
7.002SerAla: 7.002 ± 0.744
0.604SerCys: 0.604 ± 0.167
4.346SerAsp: 4.346 ± 0.491
3.682SerGlu: 3.682 ± 0.39
1.811SerPhe: 1.811 ± 0.37
5.855SerGly: 5.855 ± 0.691
0.905SerHis: 0.905 ± 0.213
3.199SerIle: 3.199 ± 0.392
2.716SerLys: 2.716 ± 0.361
4.346SerLeu: 4.346 ± 0.584
1.268SerMet: 1.268 ± 0.244
1.69SerAsn: 1.69 ± 0.269
3.139SerPro: 3.139 ± 0.381
1.509SerGln: 1.509 ± 0.321
4.406SerArg: 4.406 ± 0.393
3.984SerSer: 3.984 ± 0.6
4.165SerThr: 4.165 ± 0.574
5.01SerVal: 5.01 ± 0.455
1.026SerTrp: 1.026 ± 0.209
1.268SerTyr: 1.268 ± 0.272
0.0SerXaa: 0.0 ± 0.0
Thr
6.459ThrAla: 6.459 ± 0.721
0.241ThrCys: 0.241 ± 0.106
3.923ThrAsp: 3.923 ± 0.403
3.078ThrGlu: 3.078 ± 0.44
1.871ThrPhe: 1.871 ± 0.339
5.493ThrGly: 5.493 ± 0.515
1.449ThrHis: 1.449 ± 0.291
2.897ThrIle: 2.897 ± 0.409
2.233ThrLys: 2.233 ± 0.403
4.527ThrLeu: 4.527 ± 0.602
1.388ThrMet: 1.388 ± 0.295
1.509ThrAsn: 1.509 ± 0.354
2.535ThrPro: 2.535 ± 0.309
1.69ThrGln: 1.69 ± 0.305
2.777ThrArg: 2.777 ± 0.359
3.139ThrSer: 3.139 ± 0.415
2.414ThrThr: 2.414 ± 0.358
4.406ThrVal: 4.406 ± 0.531
0.604ThrTrp: 0.604 ± 0.197
1.388ThrTyr: 1.388 ± 0.258
0.0ThrXaa: 0.0 ± 0.0
Val
7.364ValAla: 7.364 ± 0.722
1.268ValCys: 1.268 ± 0.253
4.044ValAsp: 4.044 ± 0.503
4.467ValGlu: 4.467 ± 0.554
1.992ValPhe: 1.992 ± 0.295
3.863ValGly: 3.863 ± 0.414
1.388ValHis: 1.388 ± 0.272
3.078ValIle: 3.078 ± 0.465
2.294ValLys: 2.294 ± 0.319
4.406ValLeu: 4.406 ± 0.452
2.294ValMet: 2.294 ± 0.29
1.871ValAsn: 1.871 ± 0.29
2.897ValPro: 2.897 ± 0.375
2.354ValGln: 2.354 ± 0.394
5.01ValArg: 5.01 ± 0.571
5.312ValSer: 5.312 ± 0.593
4.467ValThr: 4.467 ± 0.446
4.406ValVal: 4.406 ± 0.54
1.026ValTrp: 1.026 ± 0.259
1.69ValTyr: 1.69 ± 0.334
0.0ValXaa: 0.0 ± 0.0
Trp
1.268TrpAla: 1.268 ± 0.286
0.241TrpCys: 0.241 ± 0.098
0.785TrpAsp: 0.785 ± 0.172
0.483TrpGlu: 0.483 ± 0.162
0.905TrpPhe: 0.905 ± 0.222
0.785TrpGly: 0.785 ± 0.204
0.483TrpHis: 0.483 ± 0.176
0.604TrpIle: 0.604 ± 0.319
0.905TrpLys: 0.905 ± 0.232
1.75TrpLeu: 1.75 ± 0.347
0.483TrpMet: 0.483 ± 0.166
0.423TrpAsn: 0.423 ± 0.179
0.966TrpPro: 0.966 ± 0.238
1.026TrpGln: 1.026 ± 0.284
1.328TrpArg: 1.328 ± 0.234
0.966TrpSer: 0.966 ± 0.23
0.604TrpThr: 0.604 ± 0.174
0.785TrpVal: 0.785 ± 0.203
0.181TrpTrp: 0.181 ± 0.108
0.423TrpTyr: 0.423 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.535TyrAla: 2.535 ± 0.408
0.543TyrCys: 0.543 ± 0.187
1.509TyrAsp: 1.509 ± 0.264
1.932TyrGlu: 1.932 ± 0.3
0.664TyrPhe: 0.664 ± 0.195
2.233TyrGly: 2.233 ± 0.289
0.302TyrHis: 0.302 ± 0.132
1.207TyrIle: 1.207 ± 0.261
0.543TyrLys: 0.543 ± 0.154
2.294TyrLeu: 2.294 ± 0.423
0.302TyrMet: 0.302 ± 0.115
0.241TyrAsn: 0.241 ± 0.103
1.147TyrPro: 1.147 ± 0.256
1.147TyrGln: 1.147 ± 0.226
1.932TyrArg: 1.932 ± 0.307
1.63TyrSer: 1.63 ± 0.352
0.724TyrThr: 0.724 ± 0.199
1.388TyrVal: 1.388 ± 0.3
0.543TyrTrp: 0.543 ± 0.17
0.483TyrTyr: 0.483 ± 0.165
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (16568 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski