Amino acid dipepetide frequency for Xanthomonas phage Tenjo

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.338AlaAla: 22.338 ± 2.241
2.01AlaCys: 2.01 ± 0.494
9.308AlaAsp: 9.308 ± 0.887
8.935AlaGlu: 8.935 ± 1.495
3.574AlaPhe: 3.574 ± 0.579
10.797AlaGly: 10.797 ± 0.987
1.638AlaHis: 1.638 ± 0.326
6.404AlaIle: 6.404 ± 0.815
6.106AlaLys: 6.106 ± 0.768
10.127AlaLeu: 10.127 ± 1.184
4.17AlaMet: 4.17 ± 0.575
3.797AlaAsn: 3.797 ± 0.561
6.031AlaPro: 6.031 ± 0.684
6.776AlaGln: 6.776 ± 1.081
9.68AlaArg: 9.68 ± 1.296
6.106AlaSer: 6.106 ± 0.578
7.744AlaThr: 7.744 ± 0.621
8.042AlaVal: 8.042 ± 0.833
1.564AlaTrp: 1.564 ± 0.375
2.755AlaTyr: 2.755 ± 0.395
0.0AlaXaa: 0.0 ± 0.0
Cys
0.745CysAla: 0.745 ± 0.325
0.149CysCys: 0.149 ± 0.115
0.819CysAsp: 0.819 ± 0.307
0.745CysGlu: 0.745 ± 0.265
0.074CysPhe: 0.074 ± 0.078
0.596CysGly: 0.596 ± 0.275
0.223CysHis: 0.223 ± 0.132
0.447CysIle: 0.447 ± 0.249
0.074CysLys: 0.074 ± 0.082
0.521CysLeu: 0.521 ± 0.214
0.149CysMet: 0.149 ± 0.115
0.298CysAsn: 0.298 ± 0.15
0.372CysPro: 0.372 ± 0.172
0.074CysGln: 0.074 ± 0.07
0.521CysArg: 0.521 ± 0.243
0.447CysSer: 0.447 ± 0.212
0.298CysThr: 0.298 ± 0.146
0.596CysVal: 0.596 ± 0.199
0.223CysTrp: 0.223 ± 0.189
0.372CysTyr: 0.372 ± 0.169
0.0CysXaa: 0.0 ± 0.0
Asp
11.02AspAla: 11.02 ± 1.097
0.521AspCys: 0.521 ± 0.239
4.244AspAsp: 4.244 ± 0.534
4.468AspGlu: 4.468 ± 0.68
1.713AspPhe: 1.713 ± 0.315
6.329AspGly: 6.329 ± 0.709
0.819AspHis: 0.819 ± 0.225
2.606AspIle: 2.606 ± 0.428
1.862AspLys: 1.862 ± 0.379
4.691AspLeu: 4.691 ± 0.654
1.266AspMet: 1.266 ± 0.319
1.564AspAsn: 1.564 ± 0.313
3.797AspPro: 3.797 ± 0.514
3.574AspGln: 3.574 ± 0.468
4.319AspArg: 4.319 ± 0.851
2.978AspSer: 2.978 ± 0.377
3.5AspThr: 3.5 ± 0.393
4.17AspVal: 4.17 ± 0.583
1.34AspTrp: 1.34 ± 0.307
1.415AspTyr: 1.415 ± 0.363
0.0AspXaa: 0.0 ± 0.0
Glu
8.488GluAla: 8.488 ± 1.254
0.521GluCys: 0.521 ± 0.227
3.425GluAsp: 3.425 ± 0.514
2.606GluGlu: 2.606 ± 0.482
1.415GluPhe: 1.415 ± 0.271
5.212GluGly: 5.212 ± 0.499
0.745GluHis: 0.745 ± 0.246
2.681GluIle: 2.681 ± 0.418
2.755GluLys: 2.755 ± 0.534
4.319GluLeu: 4.319 ± 0.653
1.191GluMet: 1.191 ± 0.325
1.117GluAsn: 1.117 ± 0.27
2.308GluPro: 2.308 ± 0.512
4.393GluGln: 4.393 ± 0.644
5.063GluArg: 5.063 ± 0.8
3.574GluSer: 3.574 ± 0.423
2.755GluThr: 2.755 ± 0.383
4.468GluVal: 4.468 ± 0.459
0.819GluTrp: 0.819 ± 0.216
1.564GluTyr: 1.564 ± 0.287
0.0GluXaa: 0.0 ± 0.0
Phe
3.127PheAla: 3.127 ± 0.492
0.149PheCys: 0.149 ± 0.105
2.383PheAsp: 2.383 ± 0.391
2.01PheGlu: 2.01 ± 0.417
0.745PhePhe: 0.745 ± 0.226
2.159PheGly: 2.159 ± 0.381
0.596PheHis: 0.596 ± 0.217
1.266PheIle: 1.266 ± 0.373
1.117PheLys: 1.117 ± 0.33
2.457PheLeu: 2.457 ± 0.506
0.819PheMet: 0.819 ± 0.251
1.564PheAsn: 1.564 ± 0.336
0.67PhePro: 0.67 ± 0.268
1.415PheGln: 1.415 ± 0.229
2.085PheArg: 2.085 ± 0.349
1.713PheSer: 1.713 ± 0.465
1.862PheThr: 1.862 ± 0.336
1.713PheVal: 1.713 ± 0.444
0.223PheTrp: 0.223 ± 0.15
1.042PheTyr: 1.042 ± 0.282
0.0PheXaa: 0.0 ± 0.0
Gly
8.34GlyAla: 8.34 ± 0.903
0.968GlyCys: 0.968 ± 0.335
5.51GlyAsp: 5.51 ± 0.646
5.361GlyGlu: 5.361 ± 0.647
3.202GlyPhe: 3.202 ± 0.378
7.669GlyGly: 7.669 ± 0.936
1.787GlyHis: 1.787 ± 0.344
3.425GlyIle: 3.425 ± 0.599
4.989GlyLys: 4.989 ± 0.634
5.733GlyLeu: 5.733 ± 0.551
2.978GlyMet: 2.978 ± 0.424
2.755GlyAsn: 2.755 ± 0.477
2.532GlyPro: 2.532 ± 0.463
3.351GlyGln: 3.351 ± 0.516
5.436GlyArg: 5.436 ± 0.778
4.17GlySer: 4.17 ± 0.641
4.989GlyThr: 4.989 ± 0.866
7.148GlyVal: 7.148 ± 0.69
1.415GlyTrp: 1.415 ± 0.377
1.862GlyTyr: 1.862 ± 0.33
0.0GlyXaa: 0.0 ± 0.0
His
1.862HisAla: 1.862 ± 0.387
0.447HisCys: 0.447 ± 0.202
0.968HisAsp: 0.968 ± 0.26
0.596HisGlu: 0.596 ± 0.288
0.447HisPhe: 0.447 ± 0.229
1.638HisGly: 1.638 ± 0.435
0.298HisHis: 0.298 ± 0.161
0.372HisIle: 0.372 ± 0.142
0.372HisLys: 0.372 ± 0.165
1.489HisLeu: 1.489 ± 0.349
0.223HisMet: 0.223 ± 0.131
0.521HisAsn: 0.521 ± 0.202
0.596HisPro: 0.596 ± 0.237
0.745HisGln: 0.745 ± 0.238
0.819HisArg: 0.819 ± 0.289
0.521HisSer: 0.521 ± 0.272
1.042HisThr: 1.042 ± 0.274
1.117HisVal: 1.117 ± 0.26
0.149HisTrp: 0.149 ± 0.113
0.372HisTyr: 0.372 ± 0.138
0.0HisXaa: 0.0 ± 0.0
Ile
5.957IleAla: 5.957 ± 0.862
0.074IleCys: 0.074 ± 0.07
2.978IleAsp: 2.978 ± 0.409
3.276IleGlu: 3.276 ± 0.465
0.596IlePhe: 0.596 ± 0.17
3.5IleGly: 3.5 ± 0.608
0.596IleHis: 0.596 ± 0.162
1.266IleIle: 1.266 ± 0.312
1.638IleLys: 1.638 ± 0.275
2.755IleLeu: 2.755 ± 0.515
0.745IleMet: 0.745 ± 0.265
0.894IleAsn: 0.894 ± 0.252
2.234IlePro: 2.234 ± 0.365
2.159IleGln: 2.159 ± 0.408
3.053IleArg: 3.053 ± 0.414
1.936IleSer: 1.936 ± 0.432
3.723IleThr: 3.723 ± 0.594
2.904IleVal: 2.904 ± 0.522
0.447IleTrp: 0.447 ± 0.158
0.819IleTyr: 0.819 ± 0.214
0.0IleXaa: 0.0 ± 0.0
Lys
5.882LysAla: 5.882 ± 0.83
0.372LysCys: 0.372 ± 0.182
2.755LysAsp: 2.755 ± 0.484
1.862LysGlu: 1.862 ± 0.44
1.34LysPhe: 1.34 ± 0.295
2.532LysGly: 2.532 ± 0.432
0.745LysHis: 0.745 ± 0.236
1.489LysIle: 1.489 ± 0.308
1.713LysLys: 1.713 ± 0.451
4.095LysLeu: 4.095 ± 0.536
0.819LysMet: 0.819 ± 0.241
0.596LysAsn: 0.596 ± 0.194
2.085LysPro: 2.085 ± 0.433
1.638LysGln: 1.638 ± 0.417
2.457LysArg: 2.457 ± 0.432
0.894LysSer: 0.894 ± 0.221
3.127LysThr: 3.127 ± 0.552
2.755LysVal: 2.755 ± 0.452
0.819LysTrp: 0.819 ± 0.252
1.415LysTyr: 1.415 ± 0.284
0.0LysXaa: 0.0 ± 0.0
Leu
11.392LeuAla: 11.392 ± 1.119
0.298LeuCys: 0.298 ± 0.161
5.436LeuAsp: 5.436 ± 0.831
3.872LeuGlu: 3.872 ± 0.449
2.234LeuPhe: 2.234 ± 0.42
6.925LeuGly: 6.925 ± 0.708
0.968LeuHis: 0.968 ± 0.306
2.308LeuIle: 2.308 ± 0.353
1.564LeuLys: 1.564 ± 0.383
6.701LeuLeu: 6.701 ± 0.875
0.745LeuMet: 0.745 ± 0.306
1.862LeuAsn: 1.862 ± 0.455
5.51LeuPro: 5.51 ± 0.542
3.872LeuGln: 3.872 ± 0.505
7.074LeuArg: 7.074 ± 0.801
3.723LeuSer: 3.723 ± 0.421
3.723LeuThr: 3.723 ± 0.509
6.18LeuVal: 6.18 ± 0.928
1.415LeuTrp: 1.415 ± 0.322
2.308LeuTyr: 2.308 ± 0.511
0.0LeuXaa: 0.0 ± 0.0
Met
3.797MetAla: 3.797 ± 0.454
0.298MetCys: 0.298 ± 0.174
1.862MetAsp: 1.862 ± 0.362
1.191MetGlu: 1.191 ± 0.291
0.447MetPhe: 0.447 ± 0.211
2.01MetGly: 2.01 ± 0.419
0.223MetHis: 0.223 ± 0.129
1.191MetIle: 1.191 ± 0.291
0.894MetLys: 0.894 ± 0.342
1.489MetLeu: 1.489 ± 0.233
0.372MetMet: 0.372 ± 0.154
1.489MetAsn: 1.489 ± 0.346
1.415MetPro: 1.415 ± 0.257
0.894MetGln: 0.894 ± 0.286
1.787MetArg: 1.787 ± 0.32
1.191MetSer: 1.191 ± 0.234
1.713MetThr: 1.713 ± 0.389
0.894MetVal: 0.894 ± 0.286
0.149MetTrp: 0.149 ± 0.113
0.298MetTyr: 0.298 ± 0.133
0.0MetXaa: 0.0 ± 0.0
Asn
3.649AsnAla: 3.649 ± 0.495
0.149AsnCys: 0.149 ± 0.112
1.266AsnAsp: 1.266 ± 0.294
0.968AsnGlu: 0.968 ± 0.201
0.819AsnPhe: 0.819 ± 0.205
3.425AsnGly: 3.425 ± 0.441
0.372AsnHis: 0.372 ± 0.142
1.564AsnIle: 1.564 ± 0.345
1.117AsnLys: 1.117 ± 0.274
3.053AsnLeu: 3.053 ± 0.384
0.745AsnMet: 0.745 ± 0.252
1.042AsnAsn: 1.042 ± 0.247
2.755AsnPro: 2.755 ± 0.593
1.564AsnGln: 1.564 ± 0.282
1.713AsnArg: 1.713 ± 0.338
1.862AsnSer: 1.862 ± 0.414
1.936AsnThr: 1.936 ± 0.359
2.308AsnVal: 2.308 ± 0.417
0.372AsnTrp: 0.372 ± 0.123
0.521AsnTyr: 0.521 ± 0.204
0.0AsnXaa: 0.0 ± 0.0
Pro
6.701ProAla: 6.701 ± 0.609
0.298ProCys: 0.298 ± 0.161
3.649ProAsp: 3.649 ± 0.499
2.681ProGlu: 2.681 ± 0.386
1.638ProPhe: 1.638 ± 0.504
3.872ProGly: 3.872 ± 0.575
0.745ProHis: 0.745 ± 0.295
1.489ProIle: 1.489 ± 0.319
2.234ProLys: 2.234 ± 0.662
3.127ProLeu: 3.127 ± 0.455
1.415ProMet: 1.415 ± 0.285
1.936ProAsn: 1.936 ± 0.342
2.904ProPro: 2.904 ± 0.623
2.085ProGln: 2.085 ± 0.418
2.457ProArg: 2.457 ± 0.452
2.606ProSer: 2.606 ± 0.417
3.202ProThr: 3.202 ± 0.514
3.202ProVal: 3.202 ± 0.593
1.34ProTrp: 1.34 ± 0.409
1.787ProTyr: 1.787 ± 0.439
0.0ProXaa: 0.0 ± 0.0
Gln
8.861GlnAla: 8.861 ± 1.315
0.149GlnCys: 0.149 ± 0.098
2.159GlnAsp: 2.159 ± 0.418
2.904GlnGlu: 2.904 ± 0.491
1.787GlnPhe: 1.787 ± 0.277
3.351GlnGly: 3.351 ± 0.604
0.894GlnHis: 0.894 ± 0.254
1.787GlnIle: 1.787 ± 0.443
1.415GlnLys: 1.415 ± 0.274
4.244GlnLeu: 4.244 ± 0.577
0.894GlnMet: 0.894 ± 0.233
1.191GlnAsn: 1.191 ± 0.335
1.936GlnPro: 1.936 ± 0.364
4.617GlnGln: 4.617 ± 0.735
4.393GlnArg: 4.393 ± 0.745
1.862GlnSer: 1.862 ± 0.47
1.713GlnThr: 1.713 ± 0.25
4.244GlnVal: 4.244 ± 0.484
0.894GlnTrp: 0.894 ± 0.24
0.67GlnTyr: 0.67 ± 0.171
0.0GlnXaa: 0.0 ± 0.0
Arg
8.861ArgAla: 8.861 ± 1.526
0.447ArgCys: 0.447 ± 0.18
4.989ArgAsp: 4.989 ± 0.629
5.733ArgGlu: 5.733 ± 0.933
2.01ArgPhe: 2.01 ± 0.374
4.542ArgGly: 4.542 ± 0.57
0.894ArgHis: 0.894 ± 0.234
3.425ArgIle: 3.425 ± 0.53
3.127ArgLys: 3.127 ± 0.529
5.212ArgLeu: 5.212 ± 0.589
1.191ArgMet: 1.191 ± 0.306
2.532ArgAsn: 2.532 ± 0.44
2.457ArgPro: 2.457 ± 0.501
4.021ArgGln: 4.021 ± 0.765
6.031ArgArg: 6.031 ± 0.924
3.351ArgSer: 3.351 ± 0.46
3.797ArgThr: 3.797 ± 0.529
4.84ArgVal: 4.84 ± 0.493
1.191ArgTrp: 1.191 ± 0.39
1.415ArgTyr: 1.415 ± 0.339
0.0ArgXaa: 0.0 ± 0.0
Ser
5.212SerAla: 5.212 ± 0.674
0.074SerCys: 0.074 ± 0.063
4.319SerAsp: 4.319 ± 0.522
3.127SerGlu: 3.127 ± 0.437
1.415SerPhe: 1.415 ± 0.321
5.138SerGly: 5.138 ± 0.647
0.596SerHis: 0.596 ± 0.224
2.978SerIle: 2.978 ± 0.387
2.457SerLys: 2.457 ± 0.418
3.351SerLeu: 3.351 ± 0.501
0.819SerMet: 0.819 ± 0.315
1.489SerAsn: 1.489 ± 0.336
2.383SerPro: 2.383 ± 0.367
1.415SerGln: 1.415 ± 0.3
2.755SerArg: 2.755 ± 0.395
2.383SerSer: 2.383 ± 0.43
3.649SerThr: 3.649 ± 0.682
2.978SerVal: 2.978 ± 0.408
0.596SerTrp: 0.596 ± 0.191
1.191SerTyr: 1.191 ± 0.353
0.0SerXaa: 0.0 ± 0.0
Thr
8.414ThrAla: 8.414 ± 0.733
0.074ThrCys: 0.074 ± 0.074
3.872ThrAsp: 3.872 ± 0.46
2.681ThrGlu: 2.681 ± 0.452
2.234ThrPhe: 2.234 ± 0.426
5.585ThrGly: 5.585 ± 0.596
1.117ThrHis: 1.117 ± 0.322
3.202ThrIle: 3.202 ± 0.372
1.936ThrLys: 1.936 ± 0.436
5.436ThrLeu: 5.436 ± 0.48
1.415ThrMet: 1.415 ± 0.31
2.383ThrAsn: 2.383 ± 0.512
2.829ThrPro: 2.829 ± 0.485
2.308ThrGln: 2.308 ± 0.468
2.904ThrArg: 2.904 ± 0.496
2.904ThrSer: 2.904 ± 0.45
3.797ThrThr: 3.797 ± 0.61
4.17ThrVal: 4.17 ± 0.528
0.67ThrTrp: 0.67 ± 0.188
1.489ThrTyr: 1.489 ± 0.37
0.0ThrXaa: 0.0 ± 0.0
Val
9.084ValAla: 9.084 ± 0.87
0.447ValCys: 0.447 ± 0.183
3.946ValAsp: 3.946 ± 0.541
4.319ValGlu: 4.319 ± 0.405
1.787ValPhe: 1.787 ± 0.397
4.765ValGly: 4.765 ± 0.541
0.67ValHis: 0.67 ± 0.185
2.606ValIle: 2.606 ± 0.501
2.755ValLys: 2.755 ± 0.449
4.542ValLeu: 4.542 ± 0.522
2.159ValMet: 2.159 ± 0.368
2.829ValAsn: 2.829 ± 0.585
4.17ValPro: 4.17 ± 0.515
3.5ValGln: 3.5 ± 0.421
4.914ValArg: 4.914 ± 0.528
4.617ValSer: 4.617 ± 0.522
4.468ValThr: 4.468 ± 0.509
5.585ValVal: 5.585 ± 0.717
1.787ValTrp: 1.787 ± 0.332
1.415ValTyr: 1.415 ± 0.34
0.0ValXaa: 0.0 ± 0.0
Trp
1.862TrpAla: 1.862 ± 0.358
0.149TrpCys: 0.149 ± 0.114
0.596TrpAsp: 0.596 ± 0.23
0.67TrpGlu: 0.67 ± 0.236
0.819TrpPhe: 0.819 ± 0.242
1.564TrpGly: 1.564 ± 0.43
0.67TrpHis: 0.67 ± 0.257
0.67TrpIle: 0.67 ± 0.2
0.447TrpLys: 0.447 ± 0.174
2.383TrpLeu: 2.383 ± 0.508
0.521TrpMet: 0.521 ± 0.233
0.596TrpAsn: 0.596 ± 0.26
0.67TrpPro: 0.67 ± 0.219
0.372TrpGln: 0.372 ± 0.142
1.191TrpArg: 1.191 ± 0.338
0.596TrpSer: 0.596 ± 0.182
0.745TrpThr: 0.745 ± 0.252
0.968TrpVal: 0.968 ± 0.316
0.223TrpTrp: 0.223 ± 0.135
0.596TrpTyr: 0.596 ± 0.171
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.308TyrAla: 2.308 ± 0.492
0.149TyrCys: 0.149 ± 0.119
1.713TyrAsp: 1.713 ± 0.41
1.415TyrGlu: 1.415 ± 0.273
0.819TyrPhe: 0.819 ± 0.287
2.01TyrGly: 2.01 ± 0.371
0.0TyrHis: 0.0 ± 0.0
0.372TyrIle: 0.372 ± 0.168
0.894TyrLys: 0.894 ± 0.268
2.383TyrLeu: 2.383 ± 0.318
0.894TyrMet: 0.894 ± 0.212
0.745TyrAsn: 0.745 ± 0.275
1.713TyrPro: 1.713 ± 0.44
1.191TyrGln: 1.191 ± 0.272
1.415TyrArg: 1.415 ± 0.377
0.894TyrSer: 0.894 ± 0.245
1.638TyrThr: 1.638 ± 0.454
2.085TyrVal: 2.085 ± 0.336
0.67TyrTrp: 0.67 ± 0.192
0.447TyrTyr: 0.447 ± 0.157
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13431 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski