Amino acid dipepetide frequency for Mycobacterium phage Quico

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.525AlaAla: 14.525 ± 1.891
1.126AlaCys: 1.126 ± 0.268
7.503AlaAsp: 7.503 ± 0.633
7.45AlaGlu: 7.45 ± 0.735
3.269AlaPhe: 3.269 ± 0.422
9.808AlaGly: 9.808 ± 1.351
2.412AlaHis: 2.412 ± 0.355
4.609AlaIle: 4.609 ± 0.532
3.913AlaLys: 3.913 ± 0.535
8.468AlaLeu: 8.468 ± 0.862
2.626AlaMet: 2.626 ± 0.393
3.001AlaAsn: 3.001 ± 0.437
4.984AlaPro: 4.984 ± 0.607
4.127AlaGln: 4.127 ± 0.505
6.753AlaArg: 6.753 ± 0.53
5.681AlaSer: 5.681 ± 0.51
5.735AlaThr: 5.735 ± 0.473
6.86AlaVal: 6.86 ± 0.576
2.519AlaTrp: 2.519 ± 0.399
2.573AlaTyr: 2.573 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
1.018CysAla: 1.018 ± 0.232
0.054CysCys: 0.054 ± 0.043
1.179CysAsp: 1.179 ± 0.258
0.911CysGlu: 0.911 ± 0.207
0.161CysPhe: 0.161 ± 0.101
1.929CysGly: 1.929 ± 0.418
0.107CysHis: 0.107 ± 0.07
0.482CysIle: 0.482 ± 0.157
0.375CysLys: 0.375 ± 0.144
0.643CysLeu: 0.643 ± 0.176
0.268CysMet: 0.268 ± 0.103
0.429CysAsn: 0.429 ± 0.149
1.286CysPro: 1.286 ± 0.274
0.375CysGln: 0.375 ± 0.135
0.858CysArg: 0.858 ± 0.243
0.375CysSer: 0.375 ± 0.13
0.59CysThr: 0.59 ± 0.176
0.429CysVal: 0.429 ± 0.128
0.268CysTrp: 0.268 ± 0.133
0.268CysTyr: 0.268 ± 0.11
0.0CysXaa: 0.0 ± 0.0
Asp
7.021AspAla: 7.021 ± 0.703
1.286AspCys: 1.286 ± 0.248
4.877AspAsp: 4.877 ± 0.458
3.913AspGlu: 3.913 ± 0.479
1.554AspPhe: 1.554 ± 0.196
6.592AspGly: 6.592 ± 0.623
1.233AspHis: 1.233 ± 0.228
2.519AspIle: 2.519 ± 0.42
1.822AspLys: 1.822 ± 0.324
6.432AspLeu: 6.432 ± 0.573
1.179AspMet: 1.179 ± 0.217
1.929AspAsn: 1.929 ± 0.333
4.984AspPro: 4.984 ± 0.59
2.68AspGln: 2.68 ± 0.393
5.092AspArg: 5.092 ± 0.576
3.216AspSer: 3.216 ± 0.469
3.913AspThr: 3.913 ± 0.461
4.716AspVal: 4.716 ± 0.68
1.501AspTrp: 1.501 ± 0.281
2.09AspTyr: 2.09 ± 0.368
0.0AspXaa: 0.0 ± 0.0
Glu
7.182GluAla: 7.182 ± 0.804
0.697GluCys: 0.697 ± 0.222
3.055GluAsp: 3.055 ± 0.383
2.787GluGlu: 2.787 ± 0.383
1.876GluPhe: 1.876 ± 0.258
4.127GluGly: 4.127 ± 0.536
1.233GluHis: 1.233 ± 0.319
2.09GluIle: 2.09 ± 0.325
1.983GluLys: 1.983 ± 0.32
5.145GluLeu: 5.145 ± 0.564
1.661GluMet: 1.661 ± 0.295
1.715GluAsn: 1.715 ± 0.222
3.055GluPro: 3.055 ± 0.415
2.841GluGln: 2.841 ± 0.4
4.234GluArg: 4.234 ± 0.47
2.787GluSer: 2.787 ± 0.48
4.02GluThr: 4.02 ± 0.596
4.448GluVal: 4.448 ± 0.617
1.072GluTrp: 1.072 ± 0.218
1.286GluTyr: 1.286 ± 0.301
0.0GluXaa: 0.0 ± 0.0
Phe
2.787PheAla: 2.787 ± 0.406
0.268PheCys: 0.268 ± 0.122
2.251PheAsp: 2.251 ± 0.345
1.34PheGlu: 1.34 ± 0.249
0.911PhePhe: 0.911 ± 0.243
3.216PheGly: 3.216 ± 0.61
0.429PheHis: 0.429 ± 0.137
1.286PheIle: 1.286 ± 0.306
1.233PheLys: 1.233 ± 0.269
1.554PheLeu: 1.554 ± 0.253
0.804PheMet: 0.804 ± 0.234
1.018PheAsn: 1.018 ± 0.318
1.822PhePro: 1.822 ± 0.31
0.858PheGln: 0.858 ± 0.389
1.876PheArg: 1.876 ± 0.285
2.09PheSer: 2.09 ± 0.338
1.929PheThr: 1.929 ± 0.335
1.876PheVal: 1.876 ± 0.336
0.375PheTrp: 0.375 ± 0.141
0.965PheTyr: 0.965 ± 0.295
0.0PheXaa: 0.0 ± 0.0
Gly
9.862GlyAla: 9.862 ± 1.051
0.965GlyCys: 0.965 ± 0.231
6.753GlyAsp: 6.753 ± 0.627
3.752GlyGlu: 3.752 ± 0.437
3.001GlyPhe: 3.001 ± 0.483
11.309GlyGly: 11.309 ± 2.04
1.554GlyHis: 1.554 ± 0.238
4.609GlyIle: 4.609 ± 0.5
3.001GlyLys: 3.001 ± 0.365
5.52GlyLeu: 5.52 ± 0.479
2.733GlyMet: 2.733 ± 0.439
2.787GlyAsn: 2.787 ± 0.417
4.073GlyPro: 4.073 ± 0.61
2.519GlyGln: 2.519 ± 0.557
4.931GlyArg: 4.931 ± 0.584
5.949GlySer: 5.949 ± 0.832
6.378GlyThr: 6.378 ± 0.8
5.896GlyVal: 5.896 ± 0.595
2.733GlyTrp: 2.733 ± 0.446
2.251GlyTyr: 2.251 ± 0.427
0.0GlyXaa: 0.0 ± 0.0
His
1.769HisAla: 1.769 ± 0.33
0.429HisCys: 0.429 ± 0.157
1.233HisAsp: 1.233 ± 0.242
1.34HisGlu: 1.34 ± 0.244
0.59HisPhe: 0.59 ± 0.148
1.447HisGly: 1.447 ± 0.256
0.911HisHis: 0.911 ± 0.254
1.286HisIle: 1.286 ± 0.257
0.643HisLys: 0.643 ± 0.219
1.394HisLeu: 1.394 ± 0.259
0.429HisMet: 0.429 ± 0.137
0.965HisAsn: 0.965 ± 0.233
1.501HisPro: 1.501 ± 0.269
0.59HisGln: 0.59 ± 0.156
1.983HisArg: 1.983 ± 0.336
0.643HisSer: 0.643 ± 0.156
1.447HisThr: 1.447 ± 0.364
1.34HisVal: 1.34 ± 0.266
0.429HisTrp: 0.429 ± 0.153
0.858HisTyr: 0.858 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
5.306IleAla: 5.306 ± 0.49
0.59IleCys: 0.59 ± 0.206
3.645IleAsp: 3.645 ± 0.487
3.216IleGlu: 3.216 ± 0.446
0.697IlePhe: 0.697 ± 0.255
3.645IleGly: 3.645 ± 0.453
1.447IleHis: 1.447 ± 0.306
1.179IleIle: 1.179 ± 0.235
1.233IleLys: 1.233 ± 0.224
2.519IleLeu: 2.519 ± 0.394
0.482IleMet: 0.482 ± 0.156
2.037IleAsn: 2.037 ± 0.29
2.894IlePro: 2.894 ± 0.393
1.34IleGln: 1.34 ± 0.235
2.787IleArg: 2.787 ± 0.399
2.037IleSer: 2.037 ± 0.386
3.645IleThr: 3.645 ± 0.413
3.001IleVal: 3.001 ± 0.335
0.858IleTrp: 0.858 ± 0.24
0.804IleTyr: 0.804 ± 0.18
0.0IleXaa: 0.0 ± 0.0
Lys
3.752LysAla: 3.752 ± 0.514
0.322LysCys: 0.322 ± 0.158
1.661LysAsp: 1.661 ± 0.321
1.34LysGlu: 1.34 ± 0.27
1.126LysPhe: 1.126 ± 0.243
2.626LysGly: 2.626 ± 0.441
1.072LysHis: 1.072 ± 0.248
1.018LysIle: 1.018 ± 0.298
1.501LysLys: 1.501 ± 0.309
2.733LysLeu: 2.733 ± 0.522
0.59LysMet: 0.59 ± 0.157
1.018LysAsn: 1.018 ± 0.233
2.733LysPro: 2.733 ± 0.409
1.501LysGln: 1.501 ± 0.224
2.197LysArg: 2.197 ± 0.341
2.144LysSer: 2.144 ± 0.283
2.305LysThr: 2.305 ± 0.357
2.09LysVal: 2.09 ± 0.342
1.179LysTrp: 1.179 ± 0.247
0.804LysTyr: 0.804 ± 0.233
0.0LysXaa: 0.0 ± 0.0
Leu
7.611LeuAla: 7.611 ± 0.956
0.911LeuCys: 0.911 ± 0.21
5.735LeuAsp: 5.735 ± 0.612
3.752LeuGlu: 3.752 ± 0.427
1.769LeuPhe: 1.769 ± 0.259
5.735LeuGly: 5.735 ± 0.546
0.911LeuHis: 0.911 ± 0.249
3.269LeuIle: 3.269 ± 0.427
2.251LeuLys: 2.251 ± 0.358
4.984LeuLeu: 4.984 ± 0.577
1.447LeuMet: 1.447 ± 0.281
2.197LeuAsn: 2.197 ± 0.398
5.52LeuPro: 5.52 ± 0.582
2.68LeuGln: 2.68 ± 0.434
5.788LeuArg: 5.788 ± 0.61
4.502LeuSer: 4.502 ± 0.505
5.306LeuThr: 5.306 ± 0.529
5.52LeuVal: 5.52 ± 0.532
1.286LeuTrp: 1.286 ± 0.306
2.09LeuTyr: 2.09 ± 0.342
0.0LeuXaa: 0.0 ± 0.0
Met
1.983MetAla: 1.983 ± 0.35
0.322MetCys: 0.322 ± 0.158
1.233MetAsp: 1.233 ± 0.273
0.911MetGlu: 0.911 ± 0.177
0.643MetPhe: 0.643 ± 0.215
1.929MetGly: 1.929 ± 0.279
0.161MetHis: 0.161 ± 0.09
1.018MetIle: 1.018 ± 0.233
1.179MetLys: 1.179 ± 0.25
1.179MetLeu: 1.179 ± 0.194
0.59MetMet: 0.59 ± 0.248
1.34MetAsn: 1.34 ± 0.241
1.661MetPro: 1.661 ± 0.291
0.536MetGln: 0.536 ± 0.17
1.608MetArg: 1.608 ± 0.262
2.841MetSer: 2.841 ± 0.339
1.769MetThr: 1.769 ± 0.272
1.233MetVal: 1.233 ± 0.292
0.375MetTrp: 0.375 ± 0.121
0.322MetTyr: 0.322 ± 0.123
0.0MetXaa: 0.0 ± 0.0
Asn
3.537AsnAla: 3.537 ± 0.412
0.214AsnCys: 0.214 ± 0.093
1.822AsnAsp: 1.822 ± 0.248
1.876AsnGlu: 1.876 ± 0.268
0.75AsnPhe: 0.75 ± 0.304
4.395AsnGly: 4.395 ± 0.669
0.75AsnHis: 0.75 ± 0.168
1.394AsnIle: 1.394 ± 0.337
0.858AsnLys: 0.858 ± 0.196
2.412AsnLeu: 2.412 ± 0.331
0.75AsnMet: 0.75 ± 0.177
1.876AsnAsn: 1.876 ± 0.37
2.733AsnPro: 2.733 ± 0.416
1.179AsnGln: 1.179 ± 0.399
2.251AsnArg: 2.251 ± 0.368
2.037AsnSer: 2.037 ± 0.319
2.787AsnThr: 2.787 ± 0.321
1.929AsnVal: 1.929 ± 0.302
0.59AsnTrp: 0.59 ± 0.151
0.375AsnTyr: 0.375 ± 0.14
0.0AsnXaa: 0.0 ± 0.0
Pro
5.788ProAla: 5.788 ± 0.643
0.643ProCys: 0.643 ± 0.181
4.716ProAsp: 4.716 ± 0.493
4.073ProGlu: 4.073 ± 0.491
2.09ProPhe: 2.09 ± 0.426
6.217ProGly: 6.217 ± 0.65
1.447ProHis: 1.447 ± 0.242
2.144ProIle: 2.144 ± 0.264
2.251ProLys: 2.251 ± 0.325
4.127ProLeu: 4.127 ± 0.599
1.286ProMet: 1.286 ± 0.309
2.626ProAsn: 2.626 ± 0.348
3.752ProPro: 3.752 ± 0.53
2.251ProGln: 2.251 ± 0.34
3.43ProArg: 3.43 ± 0.427
3.377ProSer: 3.377 ± 0.434
3.537ProThr: 3.537 ± 0.439
4.234ProVal: 4.234 ± 0.438
1.233ProTrp: 1.233 ± 0.194
1.554ProTyr: 1.554 ± 0.317
0.0ProXaa: 0.0 ± 0.0
Gln
4.877GlnAla: 4.877 ± 0.71
0.322GlnCys: 0.322 ± 0.12
1.769GlnAsp: 1.769 ± 0.277
2.037GlnGlu: 2.037 ± 0.375
0.965GlnPhe: 0.965 ± 0.241
1.929GlnGly: 1.929 ± 0.404
0.858GlnHis: 0.858 ± 0.208
1.822GlnIle: 1.822 ± 0.348
1.233GlnLys: 1.233 ± 0.25
3.055GlnLeu: 3.055 ± 0.407
0.643GlnMet: 0.643 ± 0.195
0.75GlnAsn: 0.75 ± 0.173
2.305GlnPro: 2.305 ± 0.414
1.661GlnGln: 1.661 ± 0.389
2.841GlnArg: 2.841 ± 0.338
2.251GlnSer: 2.251 ± 0.36
1.715GlnThr: 1.715 ± 0.269
2.465GlnVal: 2.465 ± 0.405
0.75GlnTrp: 0.75 ± 0.161
0.965GlnTyr: 0.965 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
6.753ArgAla: 6.753 ± 0.576
1.34ArgCys: 1.34 ± 0.349
4.663ArgAsp: 4.663 ± 0.504
4.288ArgGlu: 4.288 ± 0.491
2.358ArgPhe: 2.358 ± 0.315
4.288ArgGly: 4.288 ± 0.422
1.554ArgHis: 1.554 ± 0.335
3.377ArgIle: 3.377 ± 0.484
2.305ArgLys: 2.305 ± 0.393
5.092ArgLeu: 5.092 ± 0.619
2.465ArgMet: 2.465 ± 0.414
2.626ArgAsn: 2.626 ± 0.366
3.484ArgPro: 3.484 ± 0.423
2.305ArgGln: 2.305 ± 0.367
5.36ArgArg: 5.36 ± 0.732
4.127ArgSer: 4.127 ± 0.432
3.216ArgThr: 3.216 ± 0.47
4.395ArgVal: 4.395 ± 0.583
1.661ArgTrp: 1.661 ± 0.283
1.929ArgTyr: 1.929 ± 0.286
0.0ArgXaa: 0.0 ± 0.0
Ser
6.056SerAla: 6.056 ± 0.753
0.59SerCys: 0.59 ± 0.203
4.02SerAsp: 4.02 ± 0.477
3.162SerGlu: 3.162 ± 0.411
1.983SerPhe: 1.983 ± 0.372
6.807SerGly: 6.807 ± 1.018
1.126SerHis: 1.126 ± 0.24
2.841SerIle: 2.841 ± 0.451
2.251SerLys: 2.251 ± 0.312
3.698SerLeu: 3.698 ± 0.412
1.34SerMet: 1.34 ± 0.246
2.037SerAsn: 2.037 ± 0.496
3.162SerPro: 3.162 ± 0.364
1.501SerGln: 1.501 ± 0.199
3.269SerArg: 3.269 ± 0.377
3.698SerSer: 3.698 ± 0.558
3.43SerThr: 3.43 ± 0.44
4.609SerVal: 4.609 ± 0.674
1.554SerTrp: 1.554 ± 0.276
1.126SerTyr: 1.126 ± 0.251
0.0SerXaa: 0.0 ± 0.0
Thr
5.949ThrAla: 5.949 ± 0.658
0.375ThrCys: 0.375 ± 0.141
3.645ThrAsp: 3.645 ± 0.545
3.162ThrGlu: 3.162 ± 0.367
1.501ThrPhe: 1.501 ± 0.337
6.056ThrGly: 6.056 ± 0.497
1.661ThrHis: 1.661 ± 0.279
3.109ThrIle: 3.109 ± 0.41
2.251ThrLys: 2.251 ± 0.356
4.609ThrLeu: 4.609 ± 0.501
0.697ThrMet: 0.697 ± 0.213
2.358ThrAsn: 2.358 ± 0.413
4.395ThrPro: 4.395 ± 0.425
2.144ThrGln: 2.144 ± 0.283
4.02ThrArg: 4.02 ± 0.451
3.698ThrSer: 3.698 ± 0.382
4.931ThrThr: 4.931 ± 0.514
6.164ThrVal: 6.164 ± 0.662
1.233ThrTrp: 1.233 ± 0.255
2.144ThrTyr: 2.144 ± 0.361
0.0ThrXaa: 0.0 ± 0.0
Val
7.611ValAla: 7.611 ± 0.642
0.965ValCys: 0.965 ± 0.193
5.36ValAsp: 5.36 ± 0.53
4.663ValGlu: 4.663 ± 0.594
2.144ValPhe: 2.144 ± 0.39
5.52ValGly: 5.52 ± 0.638
1.286ValHis: 1.286 ± 0.276
2.841ValIle: 2.841 ± 0.378
2.197ValLys: 2.197 ± 0.335
6.324ValLeu: 6.324 ± 0.607
1.554ValMet: 1.554 ± 0.223
2.412ValAsn: 2.412 ± 0.35
4.02ValPro: 4.02 ± 0.36
2.251ValGln: 2.251 ± 0.388
4.127ValArg: 4.127 ± 0.496
4.448ValSer: 4.448 ± 0.586
4.395ValThr: 4.395 ± 0.433
6.378ValVal: 6.378 ± 0.65
1.822ValTrp: 1.822 ± 0.39
1.554ValTyr: 1.554 ± 0.3
0.0ValXaa: 0.0 ± 0.0
Trp
1.822TrpAla: 1.822 ± 0.308
0.268TrpCys: 0.268 ± 0.156
1.501TrpAsp: 1.501 ± 0.299
1.501TrpGlu: 1.501 ± 0.257
0.804TrpPhe: 0.804 ± 0.207
0.858TrpGly: 0.858 ± 0.202
0.643TrpHis: 0.643 ± 0.171
1.286TrpIle: 1.286 ± 0.259
0.536TrpLys: 0.536 ± 0.157
1.554TrpLeu: 1.554 ± 0.326
1.072TrpMet: 1.072 ± 0.25
0.643TrpAsn: 0.643 ± 0.214
1.126TrpPro: 1.126 ± 0.267
0.965TrpGln: 0.965 ± 0.253
2.251TrpArg: 2.251 ± 0.346
1.286TrpSer: 1.286 ± 0.292
1.447TrpThr: 1.447 ± 0.327
1.876TrpVal: 1.876 ± 0.374
0.697TrpTrp: 0.697 ± 0.194
0.643TrpTyr: 0.643 ± 0.189
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.733TyrAla: 2.733 ± 0.368
0.322TyrCys: 0.322 ± 0.141
1.929TyrAsp: 1.929 ± 0.45
1.822TyrGlu: 1.822 ± 0.318
0.697TyrPhe: 0.697 ± 0.175
2.144TyrGly: 2.144 ± 0.474
0.429TyrHis: 0.429 ± 0.129
1.179TyrIle: 1.179 ± 0.248
0.697TyrLys: 0.697 ± 0.221
1.822TyrLeu: 1.822 ± 0.303
0.161TyrMet: 0.161 ± 0.092
0.75TyrAsn: 0.75 ± 0.197
1.34TyrPro: 1.34 ± 0.265
0.911TyrGln: 0.911 ± 0.208
1.929TyrArg: 1.929 ± 0.307
1.072TyrSer: 1.072 ± 0.251
1.608TyrThr: 1.608 ± 0.35
2.412TyrVal: 2.412 ± 0.306
0.643TyrTrp: 0.643 ± 0.192
0.643TyrTyr: 0.643 ± 0.184
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 112 proteins (18659 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski