Amino acid dipepetide frequency for Mycobacterium phage TDanisky

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.693AlaAla: 13.693 ± 1.851
1.049AlaCys: 1.049 ± 0.283
7.564AlaAsp: 7.564 ± 0.642
7.288AlaGlu: 7.288 ± 0.715
2.816AlaPhe: 2.816 ± 0.48
9.939AlaGly: 9.939 ± 1.377
2.209AlaHis: 2.209 ± 0.364
4.362AlaIle: 4.362 ± 0.606
4.086AlaLys: 4.086 ± 0.421
7.288AlaLeu: 7.288 ± 0.763
2.54AlaMet: 2.54 ± 0.443
2.209AlaAsn: 2.209 ± 0.469
4.583AlaPro: 4.583 ± 0.557
3.368AlaGln: 3.368 ± 0.437
7.123AlaArg: 7.123 ± 0.782
5.798AlaSer: 5.798 ± 0.705
6.129AlaThr: 6.129 ± 0.513
6.791AlaVal: 6.791 ± 0.557
2.485AlaTrp: 2.485 ± 0.404
2.374AlaTyr: 2.374 ± 0.37
0.0AlaXaa: 0.0 ± 0.0
Cys
1.215CysAla: 1.215 ± 0.279
0.055CysCys: 0.055 ± 0.052
1.16CysAsp: 1.16 ± 0.264
0.828CysGlu: 0.828 ± 0.287
0.166CysPhe: 0.166 ± 0.09
1.712CysGly: 1.712 ± 0.375
0.166CysHis: 0.166 ± 0.114
0.166CysIle: 0.166 ± 0.103
0.607CysLys: 0.607 ± 0.178
0.607CysLeu: 0.607 ± 0.22
0.11CysMet: 0.11 ± 0.078
0.607CysAsn: 0.607 ± 0.21
1.325CysPro: 1.325 ± 0.356
0.331CysGln: 0.331 ± 0.143
0.773CysArg: 0.773 ± 0.267
0.773CysSer: 0.773 ± 0.207
0.442CysThr: 0.442 ± 0.161
0.663CysVal: 0.663 ± 0.186
0.276CysTrp: 0.276 ± 0.129
0.221CysTyr: 0.221 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
6.35AspAla: 6.35 ± 0.694
0.883AspCys: 0.883 ± 0.214
4.583AspAsp: 4.583 ± 0.671
3.589AspGlu: 3.589 ± 0.488
1.546AspPhe: 1.546 ± 0.238
5.963AspGly: 5.963 ± 0.54
1.546AspHis: 1.546 ± 0.302
2.374AspIle: 2.374 ± 0.298
1.877AspLys: 1.877 ± 0.31
5.742AspLeu: 5.742 ± 0.51
1.104AspMet: 1.104 ± 0.263
2.264AspAsn: 2.264 ± 0.449
4.969AspPro: 4.969 ± 0.591
2.153AspGln: 2.153 ± 0.308
5.908AspArg: 5.908 ± 0.77
3.534AspSer: 3.534 ± 0.474
4.086AspThr: 4.086 ± 0.507
4.472AspVal: 4.472 ± 0.562
1.38AspTrp: 1.38 ± 0.284
2.209AspTyr: 2.209 ± 0.366
0.0AspXaa: 0.0 ± 0.0
Glu
6.46GluAla: 6.46 ± 0.683
1.104GluCys: 1.104 ± 0.394
3.258GluAsp: 3.258 ± 0.381
2.816GluGlu: 2.816 ± 0.543
2.153GluPhe: 2.153 ± 0.388
4.086GluGly: 4.086 ± 0.432
1.436GluHis: 1.436 ± 0.375
2.209GluIle: 2.209 ± 0.317
2.043GluLys: 2.043 ± 0.351
5.632GluLeu: 5.632 ± 0.579
1.767GluMet: 1.767 ± 0.311
2.098GluAsn: 2.098 ± 0.325
2.926GluPro: 2.926 ± 0.463
2.871GluGln: 2.871 ± 0.342
4.914GluArg: 4.914 ± 0.592
2.816GluSer: 2.816 ± 0.482
3.81GluThr: 3.81 ± 0.574
3.865GluVal: 3.865 ± 0.488
1.436GluTrp: 1.436 ± 0.27
1.933GluTyr: 1.933 ± 0.367
0.0GluXaa: 0.0 ± 0.0
Phe
3.147PheAla: 3.147 ± 0.46
0.221PheCys: 0.221 ± 0.114
2.374PheAsp: 2.374 ± 0.513
1.491PheGlu: 1.491 ± 0.325
0.828PhePhe: 0.828 ± 0.266
2.761PheGly: 2.761 ± 0.602
0.387PheHis: 0.387 ± 0.149
1.601PheIle: 1.601 ± 0.316
0.994PheLys: 0.994 ± 0.237
1.712PheLeu: 1.712 ± 0.274
0.497PheMet: 0.497 ± 0.165
1.16PheAsn: 1.16 ± 0.294
1.822PhePro: 1.822 ± 0.299
1.27PheGln: 1.27 ± 0.354
1.601PheArg: 1.601 ± 0.284
1.601PheSer: 1.601 ± 0.34
2.706PheThr: 2.706 ± 0.354
2.043PheVal: 2.043 ± 0.331
0.552PheTrp: 0.552 ± 0.164
1.049PheTyr: 1.049 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
8.448GlyAla: 8.448 ± 1.249
1.104GlyCys: 1.104 ± 0.276
6.129GlyAsp: 6.129 ± 0.699
4.362GlyGlu: 4.362 ± 0.597
2.982GlyPhe: 2.982 ± 0.371
10.933GlyGly: 10.933 ± 2.787
1.767GlyHis: 1.767 ± 0.304
3.92GlyIle: 3.92 ± 0.619
2.374GlyLys: 2.374 ± 0.364
5.632GlyLeu: 5.632 ± 0.638
2.485GlyMet: 2.485 ± 0.48
2.871GlyAsn: 2.871 ± 0.415
4.196GlyPro: 4.196 ± 0.545
2.374GlyGln: 2.374 ± 0.549
5.411GlyArg: 5.411 ± 0.607
6.515GlySer: 6.515 ± 1.084
6.46GlyThr: 6.46 ± 0.819
5.632GlyVal: 5.632 ± 0.651
2.485GlyTrp: 2.485 ± 0.387
2.374GlyTyr: 2.374 ± 0.452
0.0GlyXaa: 0.0 ± 0.0
His
1.767HisAla: 1.767 ± 0.327
0.276HisCys: 0.276 ± 0.17
1.215HisAsp: 1.215 ± 0.254
1.104HisGlu: 1.104 ± 0.24
0.552HisPhe: 0.552 ± 0.158
1.436HisGly: 1.436 ± 0.322
0.994HisHis: 0.994 ± 0.31
1.491HisIle: 1.491 ± 0.31
0.828HisLys: 0.828 ± 0.233
1.491HisLeu: 1.491 ± 0.286
0.497HisMet: 0.497 ± 0.146
0.994HisAsn: 0.994 ± 0.218
1.601HisPro: 1.601 ± 0.345
0.773HisGln: 0.773 ± 0.263
1.933HisArg: 1.933 ± 0.401
1.049HisSer: 1.049 ± 0.233
1.601HisThr: 1.601 ± 0.416
1.16HisVal: 1.16 ± 0.314
0.442HisTrp: 0.442 ± 0.158
0.663HisTyr: 0.663 ± 0.155
0.0HisXaa: 0.0 ± 0.0
Ile
4.969IleAla: 4.969 ± 0.492
0.442IleCys: 0.442 ± 0.193
3.479IleAsp: 3.479 ± 0.432
3.258IleGlu: 3.258 ± 0.372
0.773IlePhe: 0.773 ± 0.188
3.644IleGly: 3.644 ± 0.484
1.491IleHis: 1.491 ± 0.338
1.546IleIle: 1.546 ± 0.338
1.104IleLys: 1.104 ± 0.255
2.153IleLeu: 2.153 ± 0.395
0.331IleMet: 0.331 ± 0.144
1.712IleAsn: 1.712 ± 0.262
3.147IlePro: 3.147 ± 0.387
1.491IleGln: 1.491 ± 0.282
2.761IleArg: 2.761 ± 0.438
2.485IleSer: 2.485 ± 0.538
3.755IleThr: 3.755 ± 0.473
3.092IleVal: 3.092 ± 0.398
0.773IleTrp: 0.773 ± 0.18
0.718IleTyr: 0.718 ± 0.193
0.0IleXaa: 0.0 ± 0.0
Lys
3.699LysAla: 3.699 ± 0.564
0.497LysCys: 0.497 ± 0.187
1.436LysAsp: 1.436 ± 0.293
1.215LysGlu: 1.215 ± 0.235
1.601LysPhe: 1.601 ± 0.228
2.429LysGly: 2.429 ± 0.376
0.883LysHis: 0.883 ± 0.269
1.049LysIle: 1.049 ± 0.231
1.325LysLys: 1.325 ± 0.331
2.706LysLeu: 2.706 ± 0.483
0.939LysMet: 0.939 ± 0.212
1.049LysAsn: 1.049 ± 0.225
2.429LysPro: 2.429 ± 0.427
1.822LysGln: 1.822 ± 0.232
2.319LysArg: 2.319 ± 0.34
2.209LysSer: 2.209 ± 0.32
1.877LysThr: 1.877 ± 0.356
2.595LysVal: 2.595 ± 0.342
0.939LysTrp: 0.939 ± 0.321
0.883LysTyr: 0.883 ± 0.245
0.0LysXaa: 0.0 ± 0.0
Leu
7.896LeuAla: 7.896 ± 0.79
0.663LeuCys: 0.663 ± 0.201
4.362LeuAsp: 4.362 ± 0.612
4.307LeuGlu: 4.307 ± 0.557
2.043LeuPhe: 2.043 ± 0.288
4.804LeuGly: 4.804 ± 0.565
0.828LeuHis: 0.828 ± 0.236
2.761LeuIle: 2.761 ± 0.336
2.264LeuLys: 2.264 ± 0.337
5.025LeuLeu: 5.025 ± 0.564
1.822LeuMet: 1.822 ± 0.338
2.374LeuAsn: 2.374 ± 0.367
5.356LeuPro: 5.356 ± 0.708
2.429LeuGln: 2.429 ± 0.377
5.466LeuArg: 5.466 ± 0.714
4.748LeuSer: 4.748 ± 0.472
5.522LeuThr: 5.522 ± 0.551
4.748LeuVal: 4.748 ± 0.44
1.215LeuTrp: 1.215 ± 0.257
2.264LeuTyr: 2.264 ± 0.353
0.0LeuXaa: 0.0 ± 0.0
Met
1.988MetAla: 1.988 ± 0.349
0.331MetCys: 0.331 ± 0.221
1.325MetAsp: 1.325 ± 0.241
0.883MetGlu: 0.883 ± 0.189
0.663MetPhe: 0.663 ± 0.192
1.767MetGly: 1.767 ± 0.268
0.166MetHis: 0.166 ± 0.106
0.883MetIle: 0.883 ± 0.231
0.718MetLys: 0.718 ± 0.194
1.877MetLeu: 1.877 ± 0.293
0.663MetMet: 0.663 ± 0.242
1.104MetAsn: 1.104 ± 0.288
1.491MetPro: 1.491 ± 0.307
0.607MetGln: 0.607 ± 0.175
1.712MetArg: 1.712 ± 0.285
2.926MetSer: 2.926 ± 0.393
1.933MetThr: 1.933 ± 0.303
1.436MetVal: 1.436 ± 0.371
0.442MetTrp: 0.442 ± 0.159
0.387MetTyr: 0.387 ± 0.156
0.0MetXaa: 0.0 ± 0.0
Asn
3.423AsnAla: 3.423 ± 0.421
0.387AsnCys: 0.387 ± 0.159
1.822AsnAsp: 1.822 ± 0.266
1.767AsnGlu: 1.767 ± 0.38
0.883AsnPhe: 0.883 ± 0.246
4.196AsnGly: 4.196 ± 0.598
1.049AsnHis: 1.049 ± 0.261
1.601AsnIle: 1.601 ± 0.428
1.16AsnLys: 1.16 ± 0.27
2.374AsnLeu: 2.374 ± 0.344
0.552AsnMet: 0.552 ± 0.138
1.933AsnAsn: 1.933 ± 0.371
3.092AsnPro: 3.092 ± 0.376
1.049AsnGln: 1.049 ± 0.314
1.933AsnArg: 1.933 ± 0.302
1.38AsnSer: 1.38 ± 0.304
2.264AsnThr: 2.264 ± 0.345
2.043AsnVal: 2.043 ± 0.353
0.607AsnTrp: 0.607 ± 0.164
0.883AsnTyr: 0.883 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
5.798ProAla: 5.798 ± 0.65
0.607ProCys: 0.607 ± 0.199
4.528ProAsp: 4.528 ± 0.566
4.804ProGlu: 4.804 ± 0.46
1.822ProPhe: 1.822 ± 0.326
6.515ProGly: 6.515 ± 0.72
1.601ProHis: 1.601 ± 0.325
2.043ProIle: 2.043 ± 0.308
2.429ProLys: 2.429 ± 0.44
4.583ProLeu: 4.583 ± 0.573
1.491ProMet: 1.491 ± 0.359
2.485ProAsn: 2.485 ± 0.325
4.031ProPro: 4.031 ± 0.651
2.264ProGln: 2.264 ± 0.452
3.423ProArg: 3.423 ± 0.518
3.202ProSer: 3.202 ± 0.392
3.147ProThr: 3.147 ± 0.456
4.252ProVal: 4.252 ± 0.561
1.325ProTrp: 1.325 ± 0.293
1.38ProTyr: 1.38 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
4.252GlnAla: 4.252 ± 0.528
0.221GlnCys: 0.221 ± 0.102
1.712GlnAsp: 1.712 ± 0.269
1.712GlnGlu: 1.712 ± 0.329
1.104GlnPhe: 1.104 ± 0.226
2.65GlnGly: 2.65 ± 0.489
0.828GlnHis: 0.828 ± 0.267
1.656GlnIle: 1.656 ± 0.271
1.215GlnLys: 1.215 ± 0.244
2.706GlnLeu: 2.706 ± 0.436
0.663GlnMet: 0.663 ± 0.202
1.104GlnAsn: 1.104 ± 0.257
3.037GlnPro: 3.037 ± 0.45
1.104GlnGln: 1.104 ± 0.344
2.319GlnArg: 2.319 ± 0.342
2.429GlnSer: 2.429 ± 0.382
1.877GlnThr: 1.877 ± 0.37
2.264GlnVal: 2.264 ± 0.343
0.663GlnTrp: 0.663 ± 0.18
0.883GlnTyr: 0.883 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
7.178ArgAla: 7.178 ± 0.775
1.436ArgCys: 1.436 ± 0.395
4.417ArgAsp: 4.417 ± 0.547
5.19ArgGlu: 5.19 ± 0.733
2.098ArgPhe: 2.098 ± 0.36
3.865ArgGly: 3.865 ± 0.505
1.38ArgHis: 1.38 ± 0.294
3.865ArgIle: 3.865 ± 0.531
2.485ArgLys: 2.485 ± 0.374
4.859ArgLeu: 4.859 ± 0.514
2.374ArgMet: 2.374 ± 0.34
2.264ArgAsn: 2.264 ± 0.352
4.031ArgPro: 4.031 ± 0.495
2.043ArgGln: 2.043 ± 0.387
5.742ArgArg: 5.742 ± 0.752
3.865ArgSer: 3.865 ± 0.486
3.644ArgThr: 3.644 ± 0.579
5.301ArgVal: 5.301 ± 0.629
1.988ArgTrp: 1.988 ± 0.356
2.264ArgTyr: 2.264 ± 0.306
0.0ArgXaa: 0.0 ± 0.0
Ser
5.687SerAla: 5.687 ± 0.776
0.442SerCys: 0.442 ± 0.196
4.748SerAsp: 4.748 ± 0.546
2.706SerGlu: 2.706 ± 0.411
1.988SerPhe: 1.988 ± 0.327
6.681SerGly: 6.681 ± 1.249
1.491SerHis: 1.491 ± 0.308
2.54SerIle: 2.54 ± 0.429
2.374SerLys: 2.374 ± 0.417
3.644SerLeu: 3.644 ± 0.455
1.38SerMet: 1.38 ± 0.27
2.209SerAsn: 2.209 ± 0.451
3.423SerPro: 3.423 ± 0.334
1.546SerGln: 1.546 ± 0.245
3.368SerArg: 3.368 ± 0.412
4.086SerSer: 4.086 ± 0.815
3.534SerThr: 3.534 ± 0.474
5.08SerVal: 5.08 ± 0.653
1.27SerTrp: 1.27 ± 0.271
1.491SerTyr: 1.491 ± 0.285
0.0SerXaa: 0.0 ± 0.0
Thr
6.405ThrAla: 6.405 ± 0.61
0.607ThrCys: 0.607 ± 0.223
3.865ThrAsp: 3.865 ± 0.634
4.141ThrGlu: 4.141 ± 0.497
1.988ThrPhe: 1.988 ± 0.329
6.239ThrGly: 6.239 ± 0.556
1.436ThrHis: 1.436 ± 0.276
3.534ThrIle: 3.534 ± 0.421
2.153ThrLys: 2.153 ± 0.295
4.086ThrLeu: 4.086 ± 0.473
1.436ThrMet: 1.436 ± 0.256
2.429ThrAsn: 2.429 ± 0.347
4.086ThrPro: 4.086 ± 0.436
2.319ThrGln: 2.319 ± 0.338
4.362ThrArg: 4.362 ± 0.571
3.534ThrSer: 3.534 ± 0.548
4.804ThrThr: 4.804 ± 0.665
5.963ThrVal: 5.963 ± 0.603
1.27ThrTrp: 1.27 ± 0.299
1.215ThrTyr: 1.215 ± 0.252
0.0ThrXaa: 0.0 ± 0.0
Val
6.902ValAla: 6.902 ± 0.596
1.27ValCys: 1.27 ± 0.262
5.135ValAsp: 5.135 ± 0.545
4.693ValGlu: 4.693 ± 0.533
2.319ValPhe: 2.319 ± 0.416
5.522ValGly: 5.522 ± 0.659
1.325ValHis: 1.325 ± 0.247
3.202ValIle: 3.202 ± 0.44
2.209ValLys: 2.209 ± 0.415
5.19ValLeu: 5.19 ± 0.646
1.325ValMet: 1.325 ± 0.255
2.264ValAsn: 2.264 ± 0.378
3.755ValPro: 3.755 ± 0.428
2.926ValGln: 2.926 ± 0.405
4.638ValArg: 4.638 ± 0.593
4.528ValSer: 4.528 ± 0.558
4.914ValThr: 4.914 ± 0.561
5.908ValVal: 5.908 ± 0.785
1.822ValTrp: 1.822 ± 0.402
1.436ValTyr: 1.436 ± 0.29
0.0ValXaa: 0.0 ± 0.0
Trp
1.877TrpAla: 1.877 ± 0.295
0.331TrpCys: 0.331 ± 0.133
1.325TrpAsp: 1.325 ± 0.278
1.325TrpGlu: 1.325 ± 0.306
0.883TrpPhe: 0.883 ± 0.241
1.16TrpGly: 1.16 ± 0.272
0.497TrpHis: 0.497 ± 0.176
1.16TrpIle: 1.16 ± 0.249
0.828TrpLys: 0.828 ± 0.165
1.712TrpLeu: 1.712 ± 0.347
1.16TrpMet: 1.16 ± 0.292
0.442TrpAsn: 0.442 ± 0.259
1.049TrpPro: 1.049 ± 0.253
0.939TrpGln: 0.939 ± 0.27
2.429TrpArg: 2.429 ± 0.416
1.215TrpSer: 1.215 ± 0.287
1.491TrpThr: 1.491 ± 0.281
1.601TrpVal: 1.601 ± 0.41
1.049TrpTrp: 1.049 ± 0.208
0.552TrpTyr: 0.552 ± 0.164
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.54TyrAla: 2.54 ± 0.39
0.276TyrCys: 0.276 ± 0.132
2.098TyrAsp: 2.098 ± 0.37
2.153TyrGlu: 2.153 ± 0.339
0.607TyrPhe: 0.607 ± 0.212
2.153TyrGly: 2.153 ± 0.438
0.387TyrHis: 0.387 ± 0.145
1.16TyrIle: 1.16 ± 0.215
0.828TyrLys: 0.828 ± 0.249
1.933TyrLeu: 1.933 ± 0.307
0.11TyrMet: 0.11 ± 0.073
0.828TyrAsn: 0.828 ± 0.204
1.38TyrPro: 1.38 ± 0.256
0.718TyrGln: 0.718 ± 0.2
2.153TyrArg: 2.153 ± 0.368
0.994TyrSer: 0.994 ± 0.248
1.877TyrThr: 1.877 ± 0.362
2.374TyrVal: 2.374 ± 0.367
0.607TyrTrp: 0.607 ± 0.18
0.718TyrTyr: 0.718 ± 0.172
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 111 proteins (18112 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski