Amino acid dipepetide frequency for Xanthomonas phage RiverRider

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.686AlaAla: 10.686 ± 1.107
0.689AlaCys: 0.689 ± 0.225
5.774AlaAsp: 5.774 ± 0.548
5.946AlaGlu: 5.946 ± 0.663
2.758AlaPhe: 2.758 ± 0.347
6.851AlaGly: 6.851 ± 0.706
1.25AlaHis: 1.25 ± 0.238
4.869AlaIle: 4.869 ± 0.471
4.998AlaLys: 4.998 ± 0.526
8.488AlaLeu: 8.488 ± 1.12
3.059AlaMet: 3.059 ± 0.374
4.223AlaAsn: 4.223 ± 0.471
4.266AlaPro: 4.266 ± 0.704
5.041AlaGln: 5.041 ± 0.669
3.663AlaArg: 3.663 ± 0.387
4.912AlaSer: 4.912 ± 0.473
5.558AlaThr: 5.558 ± 0.534
6.549AlaVal: 6.549 ± 0.685
1.293AlaTrp: 1.293 ± 0.266
2.715AlaTyr: 2.715 ± 0.277
0.0AlaXaa: 0.0 ± 0.0
Cys
0.345CysAla: 0.345 ± 0.121
0.129CysCys: 0.129 ± 0.09
0.474CysAsp: 0.474 ± 0.171
0.259CysGlu: 0.259 ± 0.111
0.345CysPhe: 0.345 ± 0.127
0.776CysGly: 0.776 ± 0.25
0.172CysHis: 0.172 ± 0.082
0.517CysIle: 0.517 ± 0.17
0.259CysLys: 0.259 ± 0.106
0.56CysLeu: 0.56 ± 0.179
0.086CysMet: 0.086 ± 0.069
0.603CysAsn: 0.603 ± 0.238
0.172CysPro: 0.172 ± 0.094
0.302CysGln: 0.302 ± 0.144
0.388CysArg: 0.388 ± 0.166
0.689CysSer: 0.689 ± 0.207
0.474CysThr: 0.474 ± 0.143
0.388CysVal: 0.388 ± 0.141
0.129CysTrp: 0.129 ± 0.08
0.172CysTyr: 0.172 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
5.989AspAla: 5.989 ± 0.513
0.517AspCys: 0.517 ± 0.185
3.663AspAsp: 3.663 ± 0.379
3.878AspGlu: 3.878 ± 0.542
2.284AspPhe: 2.284 ± 0.298
4.352AspGly: 4.352 ± 0.402
1.68AspHis: 1.68 ± 0.287
3.275AspIle: 3.275 ± 0.489
2.413AspLys: 2.413 ± 0.286
5.731AspLeu: 5.731 ± 0.592
1.465AspMet: 1.465 ± 0.309
2.241AspAsn: 2.241 ± 0.312
3.059AspPro: 3.059 ± 0.321
3.663AspGln: 3.663 ± 0.412
3.102AspArg: 3.102 ± 0.402
2.671AspSer: 2.671 ± 0.323
3.533AspThr: 3.533 ± 0.481
4.266AspVal: 4.266 ± 0.315
0.776AspTrp: 0.776 ± 0.145
2.37AspTyr: 2.37 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
5.774GluAla: 5.774 ± 0.701
0.345GluCys: 0.345 ± 0.126
3.533GluAsp: 3.533 ± 0.362
4.654GluGlu: 4.654 ± 0.577
2.758GluPhe: 2.758 ± 0.311
3.318GluGly: 3.318 ± 0.326
1.206GluHis: 1.206 ± 0.256
4.438GluIle: 4.438 ± 0.678
3.059GluLys: 3.059 ± 0.333
4.998GluLeu: 4.998 ± 0.491
1.379GluMet: 1.379 ± 0.223
2.542GluAsn: 2.542 ± 0.297
1.508GluPro: 1.508 ± 0.261
3.49GluGln: 3.49 ± 0.425
3.49GluArg: 3.49 ± 0.393
3.921GluSer: 3.921 ± 0.297
3.663GluThr: 3.663 ± 0.383
4.395GluVal: 4.395 ± 0.507
0.689GluTrp: 0.689 ± 0.21
1.982GluTyr: 1.982 ± 0.328
0.0GluXaa: 0.0 ± 0.0
Phe
2.241PheAla: 2.241 ± 0.416
0.302PheCys: 0.302 ± 0.144
2.284PheAsp: 2.284 ± 0.311
1.81PheGlu: 1.81 ± 0.336
1.077PhePhe: 1.077 ± 0.224
3.102PheGly: 3.102 ± 0.283
0.646PheHis: 0.646 ± 0.184
2.111PheIle: 2.111 ± 0.326
2.154PheLys: 2.154 ± 0.278
2.456PheLeu: 2.456 ± 0.384
1.293PheMet: 1.293 ± 0.273
2.413PheAsn: 2.413 ± 0.358
1.163PhePro: 1.163 ± 0.202
1.853PheGln: 1.853 ± 0.291
1.81PheArg: 1.81 ± 0.276
2.284PheSer: 2.284 ± 0.304
2.241PheThr: 2.241 ± 0.273
1.939PheVal: 1.939 ± 0.328
0.388PheTrp: 0.388 ± 0.118
1.379PheTyr: 1.379 ± 0.248
0.0PheXaa: 0.0 ± 0.0
Gly
6.119GlyAla: 6.119 ± 0.684
0.646GlyCys: 0.646 ± 0.209
3.792GlyAsp: 3.792 ± 0.661
3.835GlyGlu: 3.835 ± 0.371
2.542GlyPhe: 2.542 ± 0.334
4.007GlyGly: 4.007 ± 0.614
0.776GlyHis: 0.776 ± 0.166
4.309GlyIle: 4.309 ± 0.338
4.223GlyLys: 4.223 ± 0.501
5.472GlyLeu: 5.472 ± 0.528
1.982GlyMet: 1.982 ± 0.301
3.878GlyAsn: 3.878 ± 0.522
1.982GlyPro: 1.982 ± 0.267
3.361GlyGln: 3.361 ± 0.359
3.189GlyArg: 3.189 ± 0.615
4.912GlySer: 4.912 ± 0.513
4.869GlyThr: 4.869 ± 0.683
4.74GlyVal: 4.74 ± 0.547
0.948GlyTrp: 0.948 ± 0.252
2.844GlyTyr: 2.844 ± 0.279
0.0GlyXaa: 0.0 ± 0.0
His
1.81HisAla: 1.81 ± 0.379
0.215HisCys: 0.215 ± 0.116
1.034HisAsp: 1.034 ± 0.235
1.077HisGlu: 1.077 ± 0.206
0.819HisPhe: 0.819 ± 0.189
1.336HisGly: 1.336 ± 0.305
0.345HisHis: 0.345 ± 0.183
0.603HisIle: 0.603 ± 0.156
0.862HisLys: 0.862 ± 0.201
1.982HisLeu: 1.982 ± 0.371
0.474HisMet: 0.474 ± 0.165
0.819HisAsn: 0.819 ± 0.191
0.474HisPro: 0.474 ± 0.163
0.819HisGln: 0.819 ± 0.212
1.25HisArg: 1.25 ± 0.262
1.163HisSer: 1.163 ± 0.175
0.819HisThr: 0.819 ± 0.197
0.689HisVal: 0.689 ± 0.173
0.388HisTrp: 0.388 ± 0.15
0.862HisTyr: 0.862 ± 0.183
0.0HisXaa: 0.0 ± 0.0
Ile
4.74IleAla: 4.74 ± 0.493
0.345IleCys: 0.345 ± 0.143
4.137IleAsp: 4.137 ± 0.638
4.223IleGlu: 4.223 ± 0.382
1.206IlePhe: 1.206 ± 0.235
3.619IleGly: 3.619 ± 0.469
1.077IleHis: 1.077 ± 0.226
3.189IleIle: 3.189 ± 0.498
3.619IleLys: 3.619 ± 0.492
3.663IleLeu: 3.663 ± 0.499
1.293IleMet: 1.293 ± 0.229
3.835IleAsn: 3.835 ± 0.438
3.102IlePro: 3.102 ± 0.5
2.198IleGln: 2.198 ± 0.351
3.533IleArg: 3.533 ± 0.414
3.275IleSer: 3.275 ± 0.348
4.223IleThr: 4.223 ± 0.424
3.145IleVal: 3.145 ± 0.484
0.862IleTrp: 0.862 ± 0.187
1.982IleTyr: 1.982 ± 0.236
0.0IleXaa: 0.0 ± 0.0
Lys
5.558LysAla: 5.558 ± 0.454
0.172LysCys: 0.172 ± 0.097
2.801LysAsp: 2.801 ± 0.329
3.361LysGlu: 3.361 ± 0.565
1.379LysPhe: 1.379 ± 0.224
3.318LysGly: 3.318 ± 0.577
1.25LysHis: 1.25 ± 0.306
2.973LysIle: 2.973 ± 0.31
4.352LysLys: 4.352 ± 0.56
5.343LysLeu: 5.343 ± 0.683
1.896LysMet: 1.896 ± 0.253
2.456LysAsn: 2.456 ± 0.317
2.111LysPro: 2.111 ± 0.32
2.327LysGln: 2.327 ± 0.406
2.844LysArg: 2.844 ± 0.371
2.887LysSer: 2.887 ± 0.345
3.706LysThr: 3.706 ± 0.349
3.404LysVal: 3.404 ± 0.392
0.733LysTrp: 0.733 ± 0.157
2.068LysTyr: 2.068 ± 0.367
0.0LysXaa: 0.0 ± 0.0
Leu
7.411LeuAla: 7.411 ± 0.615
0.776LeuCys: 0.776 ± 0.192
5.903LeuAsp: 5.903 ± 0.489
5.041LeuGlu: 5.041 ± 0.41
3.059LeuPhe: 3.059 ± 0.384
5.084LeuGly: 5.084 ± 0.42
1.293LeuHis: 1.293 ± 0.265
5.128LeuIle: 5.128 ± 0.495
4.869LeuLys: 4.869 ± 0.519
6.248LeuLeu: 6.248 ± 0.706
2.241LeuMet: 2.241 ± 0.312
4.998LeuAsn: 4.998 ± 0.474
3.835LeuPro: 3.835 ± 0.318
3.49LeuGln: 3.49 ± 0.365
4.783LeuArg: 4.783 ± 0.557
5.3LeuSer: 5.3 ± 0.482
5.645LeuThr: 5.645 ± 0.475
5.257LeuVal: 5.257 ± 0.528
0.862LeuTrp: 0.862 ± 0.188
2.327LeuTyr: 2.327 ± 0.323
0.0LeuXaa: 0.0 ± 0.0
Met
3.275MetAla: 3.275 ± 0.472
0.259MetCys: 0.259 ± 0.116
1.68MetAsp: 1.68 ± 0.312
1.896MetGlu: 1.896 ± 0.347
0.991MetPhe: 0.991 ± 0.185
1.422MetGly: 1.422 ± 0.221
0.172MetHis: 0.172 ± 0.114
1.767MetIle: 1.767 ± 0.321
1.594MetLys: 1.594 ± 0.31
2.284MetLeu: 2.284 ± 0.294
0.431MetMet: 0.431 ± 0.133
1.379MetAsn: 1.379 ± 0.235
1.12MetPro: 1.12 ± 0.194
1.767MetGln: 1.767 ± 0.327
1.465MetArg: 1.465 ± 0.266
2.025MetSer: 2.025 ± 0.264
1.724MetThr: 1.724 ± 0.352
1.336MetVal: 1.336 ± 0.219
0.345MetTrp: 0.345 ± 0.123
0.733MetTyr: 0.733 ± 0.162
0.0MetXaa: 0.0 ± 0.0
Asn
5.084AsnAla: 5.084 ± 0.674
0.388AsnCys: 0.388 ± 0.143
2.973AsnAsp: 2.973 ± 0.312
3.145AsnGlu: 3.145 ± 0.387
1.939AsnPhe: 1.939 ± 0.283
3.921AsnGly: 3.921 ± 0.415
1.12AsnHis: 1.12 ± 0.249
2.025AsnIle: 2.025 ± 0.273
2.93AsnLys: 2.93 ± 0.379
4.524AsnLeu: 4.524 ± 0.407
1.422AsnMet: 1.422 ± 0.183
2.758AsnAsn: 2.758 ± 0.331
2.93AsnPro: 2.93 ± 0.402
2.542AsnGln: 2.542 ± 0.391
2.758AsnArg: 2.758 ± 0.415
3.663AsnSer: 3.663 ± 0.485
2.844AsnThr: 2.844 ± 0.347
3.145AsnVal: 3.145 ± 0.445
0.603AsnTrp: 0.603 ± 0.192
1.465AsnTyr: 1.465 ± 0.203
0.0AsnXaa: 0.0 ± 0.0
Pro
4.18ProAla: 4.18 ± 0.445
0.302ProCys: 0.302 ± 0.127
2.844ProAsp: 2.844 ± 0.375
2.887ProGlu: 2.887 ± 0.388
1.68ProPhe: 1.68 ± 0.322
3.102ProGly: 3.102 ± 0.493
0.689ProHis: 0.689 ± 0.171
2.37ProIle: 2.37 ± 0.313
2.241ProLys: 2.241 ± 0.339
3.318ProLeu: 3.318 ± 0.45
1.12ProMet: 1.12 ± 0.179
2.284ProAsn: 2.284 ± 0.34
0.948ProPro: 0.948 ± 0.278
1.594ProGln: 1.594 ± 0.292
1.336ProArg: 1.336 ± 0.296
3.059ProSer: 3.059 ± 0.564
2.456ProThr: 2.456 ± 0.308
3.016ProVal: 3.016 ± 0.41
0.776ProTrp: 0.776 ± 0.239
1.293ProTyr: 1.293 ± 0.271
0.0ProXaa: 0.0 ± 0.0
Gln
5.386GlnAla: 5.386 ± 0.682
0.302GlnCys: 0.302 ± 0.126
2.198GlnAsp: 2.198 ± 0.342
2.456GlnGlu: 2.456 ± 0.397
1.594GlnPhe: 1.594 ± 0.284
2.758GlnGly: 2.758 ± 0.415
1.293GlnHis: 1.293 ± 0.3
2.456GlnIle: 2.456 ± 0.308
2.413GlnLys: 2.413 ± 0.332
4.697GlnLeu: 4.697 ± 0.577
1.336GlnMet: 1.336 ± 0.199
2.198GlnAsn: 2.198 ± 0.384
1.724GlnPro: 1.724 ± 0.292
2.758GlnGln: 2.758 ± 0.388
3.059GlnArg: 3.059 ± 0.414
2.887GlnSer: 2.887 ± 0.465
3.145GlnThr: 3.145 ± 0.332
3.447GlnVal: 3.447 ± 0.329
0.819GlnTrp: 0.819 ± 0.17
1.508GlnTyr: 1.508 ± 0.325
0.0GlnXaa: 0.0 ± 0.0
Arg
4.438ArgAla: 4.438 ± 0.573
0.517ArgCys: 0.517 ± 0.171
2.887ArgAsp: 2.887 ± 0.429
3.145ArgGlu: 3.145 ± 0.412
2.025ArgPhe: 2.025 ± 0.368
3.49ArgGly: 3.49 ± 0.381
0.862ArgHis: 0.862 ± 0.208
3.016ArgIle: 3.016 ± 0.445
2.154ArgLys: 2.154 ± 0.237
4.697ArgLeu: 4.697 ± 0.501
1.422ArgMet: 1.422 ± 0.26
2.542ArgAsn: 2.542 ± 0.323
2.37ArgPro: 2.37 ± 0.384
2.37ArgGln: 2.37 ± 0.349
2.671ArgArg: 2.671 ± 0.382
4.137ArgSer: 4.137 ± 0.522
3.361ArgThr: 3.361 ± 0.447
3.533ArgVal: 3.533 ± 0.373
0.689ArgTrp: 0.689 ± 0.196
1.293ArgTyr: 1.293 ± 0.22
0.0ArgXaa: 0.0 ± 0.0
Ser
4.74SerAla: 4.74 ± 0.505
0.345SerCys: 0.345 ± 0.115
3.576SerAsp: 3.576 ± 0.354
3.576SerGlu: 3.576 ± 0.31
2.585SerPhe: 2.585 ± 0.348
4.654SerGly: 4.654 ± 0.426
0.905SerHis: 0.905 ± 0.216
3.404SerIle: 3.404 ± 0.366
3.145SerLys: 3.145 ± 0.373
5.731SerLeu: 5.731 ± 0.587
2.37SerMet: 2.37 ± 0.284
2.887SerAsn: 2.887 ± 0.344
2.413SerPro: 2.413 ± 0.267
2.671SerGln: 2.671 ± 0.354
3.016SerArg: 3.016 ± 0.377
3.533SerSer: 3.533 ± 0.357
4.352SerThr: 4.352 ± 0.615
4.266SerVal: 4.266 ± 0.525
0.991SerTrp: 0.991 ± 0.21
2.671SerTyr: 2.671 ± 0.377
0.0SerXaa: 0.0 ± 0.0
Thr
5.602ThrAla: 5.602 ± 0.54
0.172ThrCys: 0.172 ± 0.094
4.309ThrAsp: 4.309 ± 0.49
3.706ThrGlu: 3.706 ± 0.365
2.241ThrPhe: 2.241 ± 0.315
5.386ThrGly: 5.386 ± 0.633
1.163ThrHis: 1.163 ± 0.213
3.878ThrIle: 3.878 ± 0.467
3.447ThrLys: 3.447 ± 0.469
4.137ThrLeu: 4.137 ± 0.38
1.551ThrMet: 1.551 ± 0.346
3.404ThrAsn: 3.404 ± 0.469
3.706ThrPro: 3.706 ± 0.492
2.844ThrGln: 2.844 ± 0.347
2.758ThrArg: 2.758 ± 0.339
3.404ThrSer: 3.404 ± 0.364
4.524ThrThr: 4.524 ± 0.579
4.74ThrVal: 4.74 ± 0.337
0.905ThrTrp: 0.905 ± 0.187
2.241ThrTyr: 2.241 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
6.075ValAla: 6.075 ± 0.854
0.474ValCys: 0.474 ± 0.218
4.266ValAsp: 4.266 ± 0.456
4.395ValGlu: 4.395 ± 0.388
2.154ValPhe: 2.154 ± 0.241
4.567ValGly: 4.567 ± 0.606
0.948ValHis: 0.948 ± 0.221
4.223ValIle: 4.223 ± 0.457
3.318ValLys: 3.318 ± 0.423
5.343ValLeu: 5.343 ± 0.536
1.81ValMet: 1.81 ± 0.352
4.266ValAsn: 4.266 ± 0.362
3.059ValPro: 3.059 ± 0.447
2.499ValGln: 2.499 ± 0.277
3.749ValArg: 3.749 ± 0.428
4.18ValSer: 4.18 ± 0.3
4.481ValThr: 4.481 ± 0.435
4.869ValVal: 4.869 ± 0.51
0.431ValTrp: 0.431 ± 0.135
1.68ValTyr: 1.68 ± 0.253
0.0ValXaa: 0.0 ± 0.0
Trp
1.25TrpAla: 1.25 ± 0.199
0.086TrpCys: 0.086 ± 0.063
1.077TrpAsp: 1.077 ± 0.235
0.56TrpGlu: 0.56 ± 0.154
0.733TrpPhe: 0.733 ± 0.168
1.034TrpGly: 1.034 ± 0.223
0.431TrpHis: 0.431 ± 0.207
0.776TrpIle: 0.776 ± 0.215
1.163TrpLys: 1.163 ± 0.231
1.25TrpLeu: 1.25 ± 0.234
0.086TrpMet: 0.086 ± 0.068
0.646TrpAsn: 0.646 ± 0.182
0.215TrpPro: 0.215 ± 0.095
0.646TrpGln: 0.646 ± 0.137
0.603TrpArg: 0.603 ± 0.171
0.862TrpSer: 0.862 ± 0.185
0.517TrpThr: 0.517 ± 0.185
0.689TrpVal: 0.689 ± 0.158
0.043TrpTrp: 0.043 ± 0.043
0.474TrpTyr: 0.474 ± 0.166
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.887TyrAla: 2.887 ± 0.266
0.259TyrCys: 0.259 ± 0.13
1.724TyrAsp: 1.724 ± 0.222
1.336TyrGlu: 1.336 ± 0.278
0.776TyrPhe: 0.776 ± 0.223
2.327TyrGly: 2.327 ± 0.334
0.56TyrHis: 0.56 ± 0.173
1.939TyrIle: 1.939 ± 0.313
1.724TyrLys: 1.724 ± 0.275
2.628TyrLeu: 2.628 ± 0.29
0.905TyrMet: 0.905 ± 0.201
1.81TyrAsn: 1.81 ± 0.194
1.336TyrPro: 1.336 ± 0.286
2.198TyrGln: 2.198 ± 0.299
2.154TyrArg: 2.154 ± 0.395
2.068TyrSer: 2.068 ± 0.316
1.982TyrThr: 1.982 ± 0.381
2.973TyrVal: 2.973 ± 0.318
0.474TyrTrp: 0.474 ± 0.143
0.819TyrTyr: 0.819 ± 0.219
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 89 proteins (23209 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski