Amino acid dipepetide frequency for Azobacteroides phage ProJPt-Bp1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.213AlaAla: 6.213 ± 0.826
0.322AlaCys: 0.322 ± 0.141
4.861AlaAsp: 4.861 ± 0.373
3.638AlaGlu: 3.638 ± 0.265
2.897AlaPhe: 2.897 ± 0.315
5.473AlaGly: 5.473 ± 0.505
1.835AlaHis: 1.835 ± 0.278
3.831AlaIle: 3.831 ± 0.343
4.346AlaLys: 4.346 ± 0.473
5.086AlaLeu: 5.086 ± 0.469
2.768AlaMet: 2.768 ± 0.387
4.088AlaAsn: 4.088 ± 0.33
3.605AlaPro: 3.605 ± 0.542
3.509AlaGln: 3.509 ± 0.576
3.155AlaArg: 3.155 ± 0.326
4.088AlaSer: 4.088 ± 0.277
4.121AlaThr: 4.121 ± 0.394
3.863AlaVal: 3.863 ± 0.422
0.901AlaTrp: 0.901 ± 0.158
3.477AlaTyr: 3.477 ± 0.349
0.0AlaXaa: 0.0 ± 0.0
Cys
0.322CysAla: 0.322 ± 0.109
0.097CysCys: 0.097 ± 0.06
0.258CysAsp: 0.258 ± 0.112
0.161CysGlu: 0.161 ± 0.078
0.418CysPhe: 0.418 ± 0.169
0.29CysGly: 0.29 ± 0.096
0.064CysHis: 0.064 ± 0.052
0.354CysIle: 0.354 ± 0.119
0.386CysLys: 0.386 ± 0.127
0.483CysLeu: 0.483 ± 0.147
0.322CysMet: 0.322 ± 0.104
0.258CysAsn: 0.258 ± 0.087
0.29CysPro: 0.29 ± 0.098
0.161CysGln: 0.161 ± 0.076
0.193CysArg: 0.193 ± 0.096
0.354CysSer: 0.354 ± 0.132
0.418CysThr: 0.418 ± 0.138
0.74CysVal: 0.74 ± 0.212
0.097CysTrp: 0.097 ± 0.052
0.258CysTyr: 0.258 ± 0.095
0.0CysXaa: 0.0 ± 0.0
Asp
5.054AspAla: 5.054 ± 0.405
0.29AspCys: 0.29 ± 0.137
5.247AspAsp: 5.247 ± 0.855
4.829AspGlu: 4.829 ± 0.444
2.672AspPhe: 2.672 ± 0.305
4.797AspGly: 4.797 ± 0.503
1.062AspHis: 1.062 ± 0.248
4.346AspIle: 4.346 ± 0.304
5.376AspLys: 5.376 ± 0.649
4.797AspLeu: 4.797 ± 0.46
2.125AspMet: 2.125 ± 0.229
4.024AspAsn: 4.024 ± 0.391
3.799AspPro: 3.799 ± 0.401
2.414AspGln: 2.414 ± 0.335
3.573AspArg: 3.573 ± 0.397
3.219AspSer: 3.219 ± 0.434
4.121AspThr: 4.121 ± 0.27
3.702AspVal: 3.702 ± 0.317
0.805AspTrp: 0.805 ± 0.198
3.67AspTyr: 3.67 ± 0.349
0.0AspXaa: 0.0 ± 0.0
Glu
4.314GluAla: 4.314 ± 0.41
0.451GluCys: 0.451 ± 0.155
4.088GluAsp: 4.088 ± 0.338
4.732GluGlu: 4.732 ± 0.639
2.253GluPhe: 2.253 ± 0.247
3.895GluGly: 3.895 ± 0.489
1.352GluHis: 1.352 ± 0.208
3.605GluIle: 3.605 ± 0.434
4.249GluLys: 4.249 ± 0.381
4.668GluLeu: 4.668 ± 0.451
2.06GluMet: 2.06 ± 0.273
2.768GluAsn: 2.768 ± 0.301
2.736GluPro: 2.736 ± 0.298
2.833GluGln: 2.833 ± 0.391
3.67GluArg: 3.67 ± 0.429
2.994GluSer: 2.994 ± 0.298
2.447GluThr: 2.447 ± 0.335
3.927GluVal: 3.927 ± 0.485
1.288GluTrp: 1.288 ± 0.188
2.704GluTyr: 2.704 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
2.962PheAla: 2.962 ± 0.273
0.515PheCys: 0.515 ± 0.135
2.511PheAsp: 2.511 ± 0.324
1.996PheGlu: 1.996 ± 0.199
1.481PhePhe: 1.481 ± 0.182
2.768PheGly: 2.768 ± 0.291
0.773PheHis: 0.773 ± 0.152
2.318PheIle: 2.318 ± 0.423
2.479PheLys: 2.479 ± 0.304
2.189PheLeu: 2.189 ± 0.345
1.449PheMet: 1.449 ± 0.211
2.511PheAsn: 2.511 ± 0.319
1.803PhePro: 1.803 ± 0.232
0.998PheGln: 0.998 ± 0.17
1.706PheArg: 1.706 ± 0.293
2.736PheSer: 2.736 ± 0.348
2.157PheThr: 2.157 ± 0.206
2.672PheVal: 2.672 ± 0.321
0.483PheTrp: 0.483 ± 0.13
1.771PheTyr: 1.771 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
5.279GlyAla: 5.279 ± 0.7
0.612GlyCys: 0.612 ± 0.187
4.925GlyAsp: 4.925 ± 0.326
3.96GlyGlu: 3.96 ± 0.379
2.768GlyPhe: 2.768 ± 0.318
8.37GlyGly: 8.37 ± 1.271
1.159GlyHis: 1.159 ± 0.222
4.314GlyIle: 4.314 ± 0.441
4.925GlyLys: 4.925 ± 0.55
6.664GlyLeu: 6.664 ± 1.367
2.189GlyMet: 2.189 ± 0.243
3.38GlyAsn: 3.38 ± 0.352
2.221GlyPro: 2.221 ± 0.332
3.219GlyGln: 3.219 ± 0.505
3.477GlyArg: 3.477 ± 0.418
4.249GlySer: 4.249 ± 0.444
5.118GlyThr: 5.118 ± 0.64
4.088GlyVal: 4.088 ± 0.365
0.966GlyTrp: 0.966 ± 0.19
3.863GlyTyr: 3.863 ± 0.439
0.0GlyXaa: 0.0 ± 0.0
His
1.095HisAla: 1.095 ± 0.147
0.064HisCys: 0.064 ± 0.051
1.642HisAsp: 1.642 ± 0.194
0.901HisGlu: 0.901 ± 0.22
0.901HisPhe: 0.901 ± 0.162
1.964HisGly: 1.964 ± 0.245
0.547HisHis: 0.547 ± 0.123
1.32HisIle: 1.32 ± 0.262
1.223HisLys: 1.223 ± 0.221
1.384HisLeu: 1.384 ± 0.217
0.676HisMet: 0.676 ± 0.129
0.998HisAsn: 0.998 ± 0.163
1.513HisPro: 1.513 ± 0.25
0.515HisGln: 0.515 ± 0.148
1.255HisArg: 1.255 ± 0.242
1.416HisSer: 1.416 ± 0.306
0.901HisThr: 0.901 ± 0.169
0.998HisVal: 0.998 ± 0.185
0.193HisTrp: 0.193 ± 0.08
0.515HisTyr: 0.515 ± 0.127
0.0HisXaa: 0.0 ± 0.0
Ile
3.895IleAla: 3.895 ± 0.451
0.322IleCys: 0.322 ± 0.089
5.505IleAsp: 5.505 ± 0.519
3.702IleGlu: 3.702 ± 0.433
1.674IlePhe: 1.674 ± 0.29
3.734IleGly: 3.734 ± 0.348
1.32IleHis: 1.32 ± 0.223
3.251IleIle: 3.251 ± 0.467
4.539IleLys: 4.539 ± 0.475
2.511IleLeu: 2.511 ± 0.343
1.835IleMet: 1.835 ± 0.22
3.734IleAsn: 3.734 ± 0.388
3.605IlePro: 3.605 ± 0.465
2.253IleGln: 2.253 ± 0.229
2.543IleArg: 2.543 ± 0.309
3.445IleSer: 3.445 ± 0.324
3.96IleThr: 3.96 ± 0.541
3.155IleVal: 3.155 ± 0.291
0.483IleTrp: 0.483 ± 0.162
2.189IleTyr: 2.189 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
4.378LysAla: 4.378 ± 0.515
0.483LysCys: 0.483 ± 0.114
4.539LysAsp: 4.539 ± 0.499
4.507LysGlu: 4.507 ± 0.64
2.447LysPhe: 2.447 ± 0.329
4.249LysGly: 4.249 ± 0.439
1.513LysHis: 1.513 ± 0.282
2.994LysIle: 2.994 ± 0.269
4.249LysLys: 4.249 ± 0.472
4.99LysLeu: 4.99 ± 0.744
2.189LysMet: 2.189 ± 0.283
3.702LysAsn: 3.702 ± 0.333
2.511LysPro: 2.511 ± 0.389
2.511LysGln: 2.511 ± 0.324
2.704LysArg: 2.704 ± 0.333
3.734LysSer: 3.734 ± 0.283
2.962LysThr: 2.962 ± 0.285
3.477LysVal: 3.477 ± 0.422
1.255LysTrp: 1.255 ± 0.204
2.543LysTyr: 2.543 ± 0.277
0.0LysXaa: 0.0 ± 0.0
Leu
5.794LeuAla: 5.794 ± 0.676
0.515LeuCys: 0.515 ± 0.139
4.764LeuAsp: 4.764 ± 0.323
4.217LeuGlu: 4.217 ± 0.531
2.189LeuPhe: 2.189 ± 0.356
5.44LeuGly: 5.44 ± 0.534
1.352LeuHis: 1.352 ± 0.234
3.895LeuIle: 3.895 ± 0.421
4.217LeuLys: 4.217 ± 0.493
5.247LeuLeu: 5.247 ± 0.445
1.513LeuMet: 1.513 ± 0.244
4.056LeuAsn: 4.056 ± 0.427
3.573LeuPro: 3.573 ± 0.474
2.575LeuGln: 2.575 ± 0.235
3.573LeuArg: 3.573 ± 0.477
4.829LeuSer: 4.829 ± 0.474
4.636LeuThr: 4.636 ± 0.468
3.348LeuVal: 3.348 ± 0.492
0.579LeuTrp: 0.579 ± 0.127
2.897LeuTyr: 2.897 ± 0.373
0.0LeuXaa: 0.0 ± 0.0
Met
2.833MetAla: 2.833 ± 0.262
0.225MetCys: 0.225 ± 0.098
2.06MetAsp: 2.06 ± 0.283
1.771MetGlu: 1.771 ± 0.246
1.159MetPhe: 1.159 ± 0.247
2.092MetGly: 2.092 ± 0.257
0.612MetHis: 0.612 ± 0.153
1.738MetIle: 1.738 ± 0.247
1.996MetLys: 1.996 ± 0.347
1.803MetLeu: 1.803 ± 0.273
1.03MetMet: 1.03 ± 0.219
1.481MetAsn: 1.481 ± 0.234
2.221MetPro: 2.221 ± 0.271
1.835MetGln: 1.835 ± 0.386
1.449MetArg: 1.449 ± 0.181
1.706MetSer: 1.706 ± 0.244
2.125MetThr: 2.125 ± 0.356
1.416MetVal: 1.416 ± 0.22
0.483MetTrp: 0.483 ± 0.122
1.255MetTyr: 1.255 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
4.217AsnAla: 4.217 ± 0.366
0.29AsnCys: 0.29 ± 0.138
3.863AsnAsp: 3.863 ± 0.412
3.316AsnGlu: 3.316 ± 0.272
1.835AsnPhe: 1.835 ± 0.278
3.09AsnGly: 3.09 ± 0.393
0.966AsnHis: 0.966 ± 0.203
3.863AsnIle: 3.863 ± 0.283
3.477AsnLys: 3.477 ± 0.361
3.412AsnLeu: 3.412 ± 0.379
1.867AsnMet: 1.867 ± 0.193
2.929AsnAsn: 2.929 ± 0.346
4.668AsnPro: 4.668 ± 0.518
2.64AsnGln: 2.64 ± 0.382
2.479AsnArg: 2.479 ± 0.413
2.672AsnSer: 2.672 ± 0.277
3.863AsnThr: 3.863 ± 0.396
2.768AsnVal: 2.768 ± 0.273
0.676AsnTrp: 0.676 ± 0.135
2.318AsnTyr: 2.318 ± 0.243
0.0AsnXaa: 0.0 ± 0.0
Pro
3.927ProAla: 3.927 ± 0.476
0.161ProCys: 0.161 ± 0.096
3.766ProAsp: 3.766 ± 0.449
3.895ProGlu: 3.895 ± 0.435
2.157ProPhe: 2.157 ± 0.291
3.863ProGly: 3.863 ± 0.583
1.416ProHis: 1.416 ± 0.246
3.638ProIle: 3.638 ± 0.399
2.35ProLys: 2.35 ± 0.282
3.477ProLeu: 3.477 ± 0.372
1.416ProMet: 1.416 ± 0.24
2.64ProAsn: 2.64 ± 0.397
3.573ProPro: 3.573 ± 0.7
2.704ProGln: 2.704 ± 0.458
2.479ProArg: 2.479 ± 0.474
3.412ProSer: 3.412 ± 0.479
3.541ProThr: 3.541 ± 0.441
3.155ProVal: 3.155 ± 0.297
0.483ProTrp: 0.483 ± 0.134
2.479ProTyr: 2.479 ± 0.232
0.0ProXaa: 0.0 ± 0.0
Gln
3.445GlnAla: 3.445 ± 0.572
0.0GlnCys: 0.0 ± 0.0
2.382GlnAsp: 2.382 ± 0.249
2.736GlnGlu: 2.736 ± 0.345
1.384GlnPhe: 1.384 ± 0.227
3.799GlnGly: 3.799 ± 0.501
0.901GlnHis: 0.901 ± 0.162
2.125GlnIle: 2.125 ± 0.269
1.931GlnLys: 1.931 ± 0.319
2.479GlnLeu: 2.479 ± 0.25
1.352GlnMet: 1.352 ± 0.299
2.35GlnAsn: 2.35 ± 0.297
2.35GlnPro: 2.35 ± 0.427
3.573GlnGln: 3.573 ± 0.979
2.253GlnArg: 2.253 ± 0.364
1.996GlnSer: 1.996 ± 0.37
2.318GlnThr: 2.318 ± 0.324
2.092GlnVal: 2.092 ± 0.272
0.708GlnTrp: 0.708 ± 0.122
1.931GlnTyr: 1.931 ± 0.324
0.0GlnXaa: 0.0 ± 0.0
Arg
3.026ArgAla: 3.026 ± 0.338
0.29ArgCys: 0.29 ± 0.104
3.219ArgAsp: 3.219 ± 0.318
3.123ArgGlu: 3.123 ± 0.41
2.35ArgPhe: 2.35 ± 0.344
2.865ArgGly: 2.865 ± 0.325
0.773ArgHis: 0.773 ± 0.157
2.736ArgIle: 2.736 ± 0.252
2.447ArgLys: 2.447 ± 0.269
3.992ArgLeu: 3.992 ± 0.368
1.545ArgMet: 1.545 ± 0.224
3.348ArgAsn: 3.348 ± 0.438
2.736ArgPro: 2.736 ± 0.327
1.964ArgGln: 1.964 ± 0.269
2.962ArgArg: 2.962 ± 0.441
2.994ArgSer: 2.994 ± 0.335
3.316ArgThr: 3.316 ± 0.355
2.704ArgVal: 2.704 ± 0.352
0.644ArgTrp: 0.644 ± 0.134
2.447ArgTyr: 2.447 ± 0.341
0.0ArgXaa: 0.0 ± 0.0
Ser
3.927SerAla: 3.927 ± 0.397
0.258SerCys: 0.258 ± 0.106
4.153SerAsp: 4.153 ± 0.446
3.38SerGlu: 3.38 ± 0.407
2.736SerPhe: 2.736 ± 0.34
5.408SerGly: 5.408 ± 0.848
0.966SerHis: 0.966 ± 0.243
3.541SerIle: 3.541 ± 0.337
3.123SerLys: 3.123 ± 0.291
4.314SerLeu: 4.314 ± 0.257
1.964SerMet: 1.964 ± 0.22
2.994SerAsn: 2.994 ± 0.242
2.704SerPro: 2.704 ± 0.405
1.996SerGln: 1.996 ± 0.248
3.058SerArg: 3.058 ± 0.428
4.732SerSer: 4.732 ± 0.653
4.217SerThr: 4.217 ± 0.685
3.251SerVal: 3.251 ± 0.358
0.901SerTrp: 0.901 ± 0.175
2.189SerTyr: 2.189 ± 0.242
0.0SerXaa: 0.0 ± 0.0
Thr
4.314ThrAla: 4.314 ± 0.443
0.225ThrCys: 0.225 ± 0.081
3.799ThrAsp: 3.799 ± 0.458
3.799ThrGlu: 3.799 ± 0.349
2.736ThrPhe: 2.736 ± 0.387
5.344ThrGly: 5.344 ± 0.566
1.159ThrHis: 1.159 ± 0.192
3.284ThrIle: 3.284 ± 0.478
2.865ThrLys: 2.865 ± 0.362
4.346ThrLeu: 4.346 ± 0.536
1.771ThrMet: 1.771 ± 0.175
3.541ThrAsn: 3.541 ± 0.382
4.281ThrPro: 4.281 ± 0.444
1.996ThrGln: 1.996 ± 0.323
2.801ThrArg: 2.801 ± 0.34
4.153ThrSer: 4.153 ± 0.551
3.863ThrThr: 3.863 ± 0.644
3.412ThrVal: 3.412 ± 0.448
0.966ThrTrp: 0.966 ± 0.156
2.929ThrTyr: 2.929 ± 0.387
0.0ThrXaa: 0.0 ± 0.0
Val
3.38ValAla: 3.38 ± 0.419
0.29ValCys: 0.29 ± 0.107
3.799ValAsp: 3.799 ± 0.356
3.155ValGlu: 3.155 ± 0.443
2.253ValPhe: 2.253 ± 0.258
4.185ValGly: 4.185 ± 0.514
1.481ValHis: 1.481 ± 0.243
2.608ValIle: 2.608 ± 0.385
4.346ValLys: 4.346 ± 0.616
3.541ValLeu: 3.541 ± 0.46
1.416ValMet: 1.416 ± 0.231
3.284ValAsn: 3.284 ± 0.337
3.734ValPro: 3.734 ± 0.362
2.189ValGln: 2.189 ± 0.25
2.801ValArg: 2.801 ± 0.216
3.316ValSer: 3.316 ± 0.353
3.251ValThr: 3.251 ± 0.454
3.187ValVal: 3.187 ± 0.483
0.74ValTrp: 0.74 ± 0.172
2.962ValTyr: 2.962 ± 0.487
0.0ValXaa: 0.0 ± 0.0
Trp
0.74TrpAla: 0.74 ± 0.102
0.064TrpCys: 0.064 ± 0.047
1.062TrpAsp: 1.062 ± 0.228
0.869TrpGlu: 0.869 ± 0.185
0.547TrpPhe: 0.547 ± 0.125
0.966TrpGly: 0.966 ± 0.166
0.193TrpHis: 0.193 ± 0.067
0.869TrpIle: 0.869 ± 0.193
0.805TrpLys: 0.805 ± 0.14
1.095TrpLeu: 1.095 ± 0.208
0.354TrpMet: 0.354 ± 0.098
0.74TrpAsn: 0.74 ± 0.149
0.451TrpPro: 0.451 ± 0.145
0.676TrpGln: 0.676 ± 0.121
0.612TrpArg: 0.612 ± 0.15
0.934TrpSer: 0.934 ± 0.182
0.773TrpThr: 0.773 ± 0.172
0.837TrpVal: 0.837 ± 0.19
0.161TrpTrp: 0.161 ± 0.07
0.612TrpTyr: 0.612 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.962TyrAla: 2.962 ± 0.335
0.451TyrCys: 0.451 ± 0.172
3.477TyrAsp: 3.477 ± 0.309
2.35TyrGlu: 2.35 ± 0.252
1.416TyrPhe: 1.416 ± 0.222
3.155TyrGly: 3.155 ± 0.426
0.579TyrHis: 0.579 ± 0.127
3.026TyrIle: 3.026 ± 0.347
2.608TyrLys: 2.608 ± 0.287
2.736TyrLeu: 2.736 ± 0.311
1.416TyrMet: 1.416 ± 0.2
2.382TyrAsn: 2.382 ± 0.375
2.157TyrPro: 2.157 ± 0.312
1.577TyrGln: 1.577 ± 0.205
2.736TyrArg: 2.736 ± 0.266
2.833TyrSer: 2.833 ± 0.282
3.477TyrThr: 3.477 ± 0.594
3.123TyrVal: 3.123 ± 0.374
0.547TyrTrp: 0.547 ± 0.138
1.899TyrTyr: 1.899 ± 0.252
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 52 proteins (31065 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski