Amino acid dipepetide frequency for Xanthomonas phage Bosa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.373AlaAla: 12.373 ± 1.2
0.777AlaCys: 0.777 ± 0.179
6.523AlaAsp: 6.523 ± 0.668
6.627AlaGlu: 6.627 ± 0.768
3.624AlaPhe: 3.624 ± 0.459
7.662AlaGly: 7.662 ± 0.633
2.537AlaHis: 2.537 ± 0.366
4.608AlaIle: 4.608 ± 0.566
6.057AlaLys: 6.057 ± 0.884
8.697AlaLeu: 8.697 ± 0.793
2.589AlaMet: 2.589 ± 0.397
2.951AlaAsn: 2.951 ± 0.352
4.815AlaPro: 4.815 ± 0.415
3.417AlaGln: 3.417 ± 0.559
5.954AlaArg: 5.954 ± 0.536
6.627AlaSer: 6.627 ± 0.635
5.747AlaThr: 5.747 ± 0.588
6.937AlaVal: 6.937 ± 0.474
1.916AlaTrp: 1.916 ± 0.269
3.158AlaTyr: 3.158 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.569CysAla: 0.569 ± 0.165
0.311CysCys: 0.311 ± 0.104
0.518CysAsp: 0.518 ± 0.178
0.259CysGlu: 0.259 ± 0.127
0.518CysPhe: 0.518 ± 0.152
0.725CysGly: 0.725 ± 0.224
0.259CysHis: 0.259 ± 0.096
0.311CysIle: 0.311 ± 0.112
0.207CysLys: 0.207 ± 0.092
1.035CysLeu: 1.035 ± 0.249
0.362CysMet: 0.362 ± 0.141
0.466CysAsn: 0.466 ± 0.198
0.518CysPro: 0.518 ± 0.174
0.311CysGln: 0.311 ± 0.141
0.725CysArg: 0.725 ± 0.181
0.414CysSer: 0.414 ± 0.177
0.362CysThr: 0.362 ± 0.128
0.621CysVal: 0.621 ± 0.197
0.155CysTrp: 0.155 ± 0.084
0.207CysTyr: 0.207 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
7.144AspAla: 7.144 ± 0.756
0.518AspCys: 0.518 ± 0.176
4.4AspAsp: 4.4 ± 0.57
4.297AspGlu: 4.297 ± 0.372
2.847AspPhe: 2.847 ± 0.488
7.455AspGly: 7.455 ± 0.772
0.673AspHis: 0.673 ± 0.184
1.501AspIle: 1.501 ± 0.27
3.262AspLys: 3.262 ± 0.406
6.109AspLeu: 6.109 ± 0.482
1.657AspMet: 1.657 ± 0.322
1.45AspAsn: 1.45 ± 0.218
4.452AspPro: 4.452 ± 0.462
2.174AspGln: 2.174 ± 0.311
3.624AspArg: 3.624 ± 0.568
3.52AspSer: 3.52 ± 0.471
2.589AspThr: 2.589 ± 0.455
4.659AspVal: 4.659 ± 0.428
0.932AspTrp: 0.932 ± 0.239
1.657AspTyr: 1.657 ± 0.352
0.0AspXaa: 0.0 ± 0.0
Glu
7.507GluAla: 7.507 ± 0.841
0.621GluCys: 0.621 ± 0.174
3.572GluAsp: 3.572 ± 0.468
4.245GluGlu: 4.245 ± 0.557
2.278GluPhe: 2.278 ± 0.401
5.229GluGly: 5.229 ± 0.589
1.501GluHis: 1.501 ± 0.323
2.226GluIle: 2.226 ± 0.416
3.003GluLys: 3.003 ± 0.463
7.403GluLeu: 7.403 ± 0.666
1.76GluMet: 1.76 ± 0.296
2.174GluAsn: 2.174 ± 0.324
2.278GluPro: 2.278 ± 0.416
3.417GluGln: 3.417 ± 0.434
3.883GluArg: 3.883 ± 0.605
3.365GluSer: 3.365 ± 0.388
3.21GluThr: 3.21 ± 0.46
4.349GluVal: 4.349 ± 0.434
1.346GluTrp: 1.346 ± 0.206
1.553GluTyr: 1.553 ± 0.27
0.0GluXaa: 0.0 ± 0.0
Phe
3.003PheAla: 3.003 ± 0.437
0.362PheCys: 0.362 ± 0.144
3.054PheAsp: 3.054 ± 0.459
2.796PheGlu: 2.796 ± 0.429
1.035PhePhe: 1.035 ± 0.241
2.589PheGly: 2.589 ± 0.345
0.518PheHis: 0.518 ± 0.199
1.501PheIle: 1.501 ± 0.285
1.087PheLys: 1.087 ± 0.197
2.744PheLeu: 2.744 ± 0.361
0.88PheMet: 0.88 ± 0.222
1.294PheAsn: 1.294 ± 0.247
2.019PhePro: 2.019 ± 0.303
1.501PheGln: 1.501 ± 0.302
2.899PheArg: 2.899 ± 0.389
2.071PheSer: 2.071 ± 0.306
2.019PheThr: 2.019 ± 0.353
1.76PheVal: 1.76 ± 0.282
0.518PheTrp: 0.518 ± 0.144
0.828PheTyr: 0.828 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
6.161GlyAla: 6.161 ± 0.72
1.035GlyCys: 1.035 ± 0.259
5.902GlyAsp: 5.902 ± 0.637
5.695GlyGlu: 5.695 ± 0.598
3.158GlyPhe: 3.158 ± 0.534
6.937GlyGly: 6.937 ± 0.584
1.294GlyHis: 1.294 ± 0.234
3.417GlyIle: 3.417 ± 0.418
4.659GlyLys: 4.659 ± 0.607
7.196GlyLeu: 7.196 ± 0.434
2.226GlyMet: 2.226 ± 0.451
2.796GlyAsn: 2.796 ± 0.336
2.899GlyPro: 2.899 ± 0.424
3.158GlyGln: 3.158 ± 0.354
5.85GlyArg: 5.85 ± 0.705
4.556GlySer: 4.556 ± 0.693
5.591GlyThr: 5.591 ± 0.739
5.798GlyVal: 5.798 ± 0.502
1.864GlyTrp: 1.864 ± 0.326
2.589GlyTyr: 2.589 ± 0.437
0.0GlyXaa: 0.0 ± 0.0
His
2.019HisAla: 2.019 ± 0.323
0.207HisCys: 0.207 ± 0.095
0.88HisAsp: 0.88 ± 0.204
1.294HisGlu: 1.294 ± 0.236
0.362HisPhe: 0.362 ± 0.12
1.45HisGly: 1.45 ± 0.23
0.362HisHis: 0.362 ± 0.145
0.984HisIle: 0.984 ± 0.24
0.828HisLys: 0.828 ± 0.184
1.657HisLeu: 1.657 ± 0.316
0.569HisMet: 0.569 ± 0.173
0.414HisAsn: 0.414 ± 0.166
1.398HisPro: 1.398 ± 0.261
1.087HisGln: 1.087 ± 0.218
1.087HisArg: 1.087 ± 0.214
1.087HisSer: 1.087 ± 0.201
0.828HisThr: 0.828 ± 0.218
1.346HisVal: 1.346 ± 0.2
0.518HisTrp: 0.518 ± 0.147
0.466HisTyr: 0.466 ± 0.16
0.0HisXaa: 0.0 ± 0.0
Ile
3.417IleAla: 3.417 ± 0.451
0.207IleCys: 0.207 ± 0.139
3.313IleAsp: 3.313 ± 0.336
2.744IleGlu: 2.744 ± 0.441
1.605IlePhe: 1.605 ± 0.246
3.054IleGly: 3.054 ± 0.368
0.621IleHis: 0.621 ± 0.19
1.294IleIle: 1.294 ± 0.265
2.537IleLys: 2.537 ± 0.326
2.123IleLeu: 2.123 ± 0.404
0.828IleMet: 0.828 ± 0.2
1.191IleAsn: 1.191 ± 0.243
2.019IlePro: 2.019 ± 0.289
1.967IleGln: 1.967 ± 0.314
2.796IleArg: 2.796 ± 0.402
1.553IleSer: 1.553 ± 0.3
2.537IleThr: 2.537 ± 0.518
2.64IleVal: 2.64 ± 0.443
0.414IleTrp: 0.414 ± 0.123
0.88IleTyr: 0.88 ± 0.251
0.0IleXaa: 0.0 ± 0.0
Lys
7.041LysAla: 7.041 ± 1.075
0.311LysCys: 0.311 ± 0.129
2.744LysAsp: 2.744 ± 0.441
2.692LysGlu: 2.692 ± 0.45
1.501LysPhe: 1.501 ± 0.273
4.038LysGly: 4.038 ± 0.554
0.414LysHis: 0.414 ± 0.157
1.242LysIle: 1.242 ± 0.238
2.744LysLys: 2.744 ± 0.499
4.452LysLeu: 4.452 ± 0.489
1.657LysMet: 1.657 ± 0.285
1.242LysAsn: 1.242 ± 0.217
3.624LysPro: 3.624 ± 0.438
2.33LysGln: 2.33 ± 0.412
2.847LysArg: 2.847 ± 0.563
2.847LysSer: 2.847 ± 0.409
2.796LysThr: 2.796 ± 0.414
3.054LysVal: 3.054 ± 0.387
0.518LysTrp: 0.518 ± 0.113
1.294LysTyr: 1.294 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
9.992LeuAla: 9.992 ± 0.763
0.518LeuCys: 0.518 ± 0.158
5.747LeuAsp: 5.747 ± 0.497
5.332LeuGlu: 5.332 ± 0.65
2.537LeuPhe: 2.537 ± 0.37
6.782LeuGly: 6.782 ± 0.549
1.398LeuHis: 1.398 ± 0.264
2.847LeuIle: 2.847 ± 0.377
4.815LeuLys: 4.815 ± 0.557
6.782LeuLeu: 6.782 ± 0.563
2.174LeuMet: 2.174 ± 0.359
3.262LeuAsn: 3.262 ± 0.346
4.918LeuPro: 4.918 ± 0.512
3.262LeuGln: 3.262 ± 0.505
5.798LeuArg: 5.798 ± 0.519
5.074LeuSer: 5.074 ± 0.561
5.022LeuThr: 5.022 ± 0.438
6.161LeuVal: 6.161 ± 0.598
1.553LeuTrp: 1.553 ± 0.263
2.433LeuTyr: 2.433 ± 0.4
0.0LeuXaa: 0.0 ± 0.0
Met
3.676MetAla: 3.676 ± 0.352
0.104MetCys: 0.104 ± 0.065
1.657MetAsp: 1.657 ± 0.251
1.501MetGlu: 1.501 ± 0.304
0.569MetPhe: 0.569 ± 0.192
1.812MetGly: 1.812 ± 0.363
0.673MetHis: 0.673 ± 0.186
0.828MetIle: 0.828 ± 0.196
1.035MetLys: 1.035 ± 0.236
1.812MetLeu: 1.812 ± 0.255
0.828MetMet: 0.828 ± 0.197
0.828MetAsn: 0.828 ± 0.207
1.501MetPro: 1.501 ± 0.321
0.777MetGln: 0.777 ± 0.205
1.864MetArg: 1.864 ± 0.328
2.381MetSer: 2.381 ± 0.391
1.708MetThr: 1.708 ± 0.3
1.657MetVal: 1.657 ± 0.299
0.311MetTrp: 0.311 ± 0.111
0.414MetTyr: 0.414 ± 0.127
0.0MetXaa: 0.0 ± 0.0
Asn
2.847AsnAla: 2.847 ± 0.43
0.362AsnCys: 0.362 ± 0.135
1.087AsnAsp: 1.087 ± 0.248
1.864AsnGlu: 1.864 ± 0.275
0.984AsnPhe: 0.984 ± 0.204
3.469AsnGly: 3.469 ± 0.418
0.518AsnHis: 0.518 ± 0.128
1.346AsnIle: 1.346 ± 0.274
1.45AsnLys: 1.45 ± 0.268
2.589AsnLeu: 2.589 ± 0.354
0.932AsnMet: 0.932 ± 0.196
1.398AsnAsn: 1.398 ± 0.243
1.864AsnPro: 1.864 ± 0.395
1.501AsnGln: 1.501 ± 0.295
1.501AsnArg: 1.501 ± 0.258
1.605AsnSer: 1.605 ± 0.284
1.916AsnThr: 1.916 ± 0.386
2.433AsnVal: 2.433 ± 0.27
1.191AsnTrp: 1.191 ± 0.2
0.725AsnTyr: 0.725 ± 0.196
0.0AsnXaa: 0.0 ± 0.0
Pro
4.815ProAla: 4.815 ± 0.486
0.466ProCys: 0.466 ± 0.16
4.09ProAsp: 4.09 ± 0.416
4.09ProGlu: 4.09 ± 0.519
1.657ProPhe: 1.657 ± 0.267
5.229ProGly: 5.229 ± 0.597
1.242ProHis: 1.242 ± 0.235
2.019ProIle: 2.019 ± 0.3
2.485ProLys: 2.485 ± 0.392
3.003ProLeu: 3.003 ± 0.37
1.139ProMet: 1.139 ± 0.295
2.071ProAsn: 2.071 ± 0.282
2.951ProPro: 2.951 ± 0.539
1.657ProGln: 1.657 ± 0.324
2.64ProArg: 2.64 ± 0.415
3.262ProSer: 3.262 ± 0.437
3.624ProThr: 3.624 ± 0.429
3.676ProVal: 3.676 ± 0.407
0.518ProTrp: 0.518 ± 0.187
1.76ProTyr: 1.76 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
5.539GlnAla: 5.539 ± 0.546
0.052GlnCys: 0.052 ± 0.047
2.174GlnAsp: 2.174 ± 0.345
1.501GlnGlu: 1.501 ± 0.347
1.45GlnPhe: 1.45 ± 0.286
3.21GlnGly: 3.21 ± 0.37
0.777GlnHis: 0.777 ± 0.185
1.242GlnIle: 1.242 ± 0.226
1.76GlnLys: 1.76 ± 0.348
4.038GlnLeu: 4.038 ± 0.413
1.398GlnMet: 1.398 ± 0.22
0.777GlnAsn: 0.777 ± 0.25
2.123GlnPro: 2.123 ± 0.31
1.864GlnGln: 1.864 ± 0.337
2.899GlnArg: 2.899 ± 0.442
1.967GlnSer: 1.967 ± 0.382
2.381GlnThr: 2.381 ± 0.284
3.21GlnVal: 3.21 ± 0.386
1.035GlnTrp: 1.035 ± 0.208
1.398GlnTyr: 1.398 ± 0.312
0.0GlnXaa: 0.0 ± 0.0
Arg
7.093ArgAla: 7.093 ± 0.714
0.621ArgCys: 0.621 ± 0.174
4.193ArgAsp: 4.193 ± 0.476
4.815ArgGlu: 4.815 ± 0.578
1.864ArgPhe: 1.864 ± 0.368
4.711ArgGly: 4.711 ± 0.481
1.657ArgHis: 1.657 ± 0.323
2.485ArgIle: 2.485 ± 0.35
2.589ArgLys: 2.589 ± 0.366
5.85ArgLeu: 5.85 ± 0.57
1.501ArgMet: 1.501 ± 0.251
1.45ArgAsn: 1.45 ± 0.223
2.796ArgPro: 2.796 ± 0.428
2.847ArgGln: 2.847 ± 0.328
5.074ArgArg: 5.074 ± 0.439
3.676ArgSer: 3.676 ± 0.545
3.52ArgThr: 3.52 ± 0.451
3.727ArgVal: 3.727 ± 0.427
1.242ArgTrp: 1.242 ± 0.251
1.76ArgTyr: 1.76 ± 0.343
0.0ArgXaa: 0.0 ± 0.0
Ser
4.245SerAla: 4.245 ± 0.664
0.621SerCys: 0.621 ± 0.23
3.106SerAsp: 3.106 ± 0.482
3.365SerGlu: 3.365 ± 0.398
2.019SerPhe: 2.019 ± 0.331
5.695SerGly: 5.695 ± 0.612
1.139SerHis: 1.139 ± 0.269
2.64SerIle: 2.64 ± 0.411
2.744SerLys: 2.744 ± 0.435
5.643SerLeu: 5.643 ± 0.537
1.346SerMet: 1.346 ± 0.247
2.174SerAsn: 2.174 ± 0.302
3.262SerPro: 3.262 ± 0.482
2.381SerGln: 2.381 ± 0.303
3.365SerArg: 3.365 ± 0.4
4.245SerSer: 4.245 ± 0.718
3.572SerThr: 3.572 ± 0.461
4.763SerVal: 4.763 ± 0.459
1.191SerTrp: 1.191 ± 0.259
2.123SerTyr: 2.123 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
5.643ThrAla: 5.643 ± 0.503
0.207ThrCys: 0.207 ± 0.105
3.52ThrAsp: 3.52 ± 0.442
3.469ThrGlu: 3.469 ± 0.458
2.123ThrPhe: 2.123 ± 0.307
5.022ThrGly: 5.022 ± 0.622
0.828ThrHis: 0.828 ± 0.198
2.951ThrIle: 2.951 ± 0.45
2.692ThrLys: 2.692 ± 0.372
4.711ThrLeu: 4.711 ± 0.539
1.657ThrMet: 1.657 ± 0.31
2.33ThrAsn: 2.33 ± 0.346
3.365ThrPro: 3.365 ± 0.343
2.174ThrGln: 2.174 ± 0.34
2.226ThrArg: 2.226 ± 0.306
4.245ThrSer: 4.245 ± 0.583
3.262ThrThr: 3.262 ± 0.494
4.349ThrVal: 4.349 ± 0.504
1.346ThrTrp: 1.346 ± 0.264
1.553ThrTyr: 1.553 ± 0.293
0.0ThrXaa: 0.0 ± 0.0
Val
6.212ValAla: 6.212 ± 0.543
0.777ValCys: 0.777 ± 0.224
5.074ValAsp: 5.074 ± 0.53
4.763ValGlu: 4.763 ± 0.503
2.692ValPhe: 2.692 ± 0.364
3.935ValGly: 3.935 ± 0.543
1.346ValHis: 1.346 ± 0.262
2.589ValIle: 2.589 ± 0.385
3.262ValLys: 3.262 ± 0.411
6.523ValLeu: 6.523 ± 0.554
1.657ValMet: 1.657 ± 0.29
1.864ValAsn: 1.864 ± 0.253
3.676ValPro: 3.676 ± 0.386
3.106ValGln: 3.106 ± 0.327
5.022ValArg: 5.022 ± 0.433
4.09ValSer: 4.09 ± 0.405
4.815ValThr: 4.815 ± 0.532
5.332ValVal: 5.332 ± 0.491
1.242ValTrp: 1.242 ± 0.291
1.657ValTyr: 1.657 ± 0.254
0.0ValXaa: 0.0 ± 0.0
Trp
1.553TrpAla: 1.553 ± 0.291
0.207TrpCys: 0.207 ± 0.093
1.294TrpAsp: 1.294 ± 0.303
1.553TrpGlu: 1.553 ± 0.256
1.087TrpPhe: 1.087 ± 0.23
0.984TrpGly: 0.984 ± 0.182
0.362TrpHis: 0.362 ± 0.115
0.828TrpIle: 0.828 ± 0.215
0.88TrpLys: 0.88 ± 0.203
1.76TrpLeu: 1.76 ± 0.313
0.207TrpMet: 0.207 ± 0.095
0.828TrpAsn: 0.828 ± 0.21
0.569TrpPro: 0.569 ± 0.156
0.673TrpGln: 0.673 ± 0.202
1.501TrpArg: 1.501 ± 0.351
1.191TrpSer: 1.191 ± 0.243
0.932TrpThr: 0.932 ± 0.219
0.88TrpVal: 0.88 ± 0.223
0.311TrpTrp: 0.311 ± 0.108
1.087TrpTyr: 1.087 ± 0.226
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.744TyrAla: 2.744 ± 0.354
0.569TyrCys: 0.569 ± 0.187
2.071TyrAsp: 2.071 ± 0.329
2.019TyrGlu: 2.019 ± 0.248
0.569TyrPhe: 0.569 ± 0.191
2.796TyrGly: 2.796 ± 0.416
0.725TyrHis: 0.725 ± 0.188
0.88TyrIle: 0.88 ± 0.198
1.294TyrLys: 1.294 ± 0.275
2.278TyrLeu: 2.278 ± 0.343
0.518TyrMet: 0.518 ± 0.138
0.673TyrAsn: 0.673 ± 0.183
1.346TyrPro: 1.346 ± 0.272
1.191TyrGln: 1.191 ± 0.228
1.916TyrArg: 1.916 ± 0.368
1.864TyrSer: 1.864 ± 0.331
1.242TyrThr: 1.242 ± 0.266
2.278TyrVal: 2.278 ± 0.396
0.569TyrTrp: 0.569 ± 0.218
0.725TyrTyr: 0.725 ± 0.176
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (19317 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski