Amino acid dipepetide frequency for Mycobacterium phage Panchino

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.22AlaAla: 20.22 ± 2.813
0.922AlaCys: 0.922 ± 0.35
8.868AlaAsp: 8.868 ± 0.88
8.088AlaGlu: 8.088 ± 0.903
3.193AlaPhe: 3.193 ± 0.59
9.578AlaGly: 9.578 ± 1.317
2.199AlaHis: 2.199 ± 0.413
5.321AlaIle: 5.321 ± 0.682
3.476AlaLys: 3.476 ± 0.537
9.436AlaLeu: 9.436 ± 0.845
2.412AlaMet: 2.412 ± 0.477
3.335AlaAsn: 3.335 ± 0.571
6.456AlaPro: 6.456 ± 0.628
4.753AlaGln: 4.753 ± 0.729
7.946AlaArg: 7.946 ± 0.746
5.534AlaSer: 5.534 ± 0.62
8.088AlaThr: 8.088 ± 0.803
6.953AlaVal: 6.953 ± 0.645
2.412AlaTrp: 2.412 ± 0.472
2.057AlaTyr: 2.057 ± 0.449
0.0AlaXaa: 0.0 ± 0.0
Cys
1.419CysAla: 1.419 ± 0.43
0.142CysCys: 0.142 ± 0.099
0.851CysAsp: 0.851 ± 0.212
0.709CysGlu: 0.709 ± 0.23
0.071CysPhe: 0.071 ± 0.082
1.845CysGly: 1.845 ± 0.443
0.213CysHis: 0.213 ± 0.112
0.213CysIle: 0.213 ± 0.136
0.213CysLys: 0.213 ± 0.124
0.355CysLeu: 0.355 ± 0.156
0.213CysMet: 0.213 ± 0.107
0.426CysAsn: 0.426 ± 0.186
0.993CysPro: 0.993 ± 0.371
0.284CysGln: 0.284 ± 0.142
0.922CysArg: 0.922 ± 0.262
0.568CysSer: 0.568 ± 0.183
0.78CysThr: 0.78 ± 0.223
0.568CysVal: 0.568 ± 0.218
0.142CysTrp: 0.142 ± 0.098
0.284CysTyr: 0.284 ± 0.147
0.0CysXaa: 0.0 ± 0.0
Asp
7.449AspAla: 7.449 ± 0.664
0.993AspCys: 0.993 ± 0.306
5.676AspAsp: 5.676 ± 0.803
4.966AspGlu: 4.966 ± 0.596
2.057AspPhe: 2.057 ± 0.385
7.095AspGly: 7.095 ± 0.616
1.277AspHis: 1.277 ± 0.266
2.767AspIle: 2.767 ± 0.433
1.632AspLys: 1.632 ± 0.316
6.031AspLeu: 6.031 ± 0.556
1.348AspMet: 1.348 ± 0.324
1.845AspAsn: 1.845 ± 0.391
4.895AspPro: 4.895 ± 0.648
2.341AspGln: 2.341 ± 0.387
4.683AspArg: 4.683 ± 0.708
3.831AspSer: 3.831 ± 0.775
2.412AspThr: 2.412 ± 0.407
3.973AspVal: 3.973 ± 0.714
1.135AspTrp: 1.135 ± 0.269
1.703AspTyr: 1.703 ± 0.397
0.0AspXaa: 0.0 ± 0.0
Glu
6.243GluAla: 6.243 ± 0.652
0.709GluCys: 0.709 ± 0.209
3.051GluAsp: 3.051 ± 0.511
2.057GluGlu: 2.057 ± 0.436
2.199GluPhe: 2.199 ± 0.42
3.264GluGly: 3.264 ± 0.591
0.922GluHis: 0.922 ± 0.271
2.696GluIle: 2.696 ± 0.414
2.696GluLys: 2.696 ± 0.367
5.747GluLeu: 5.747 ± 0.587
1.845GluMet: 1.845 ± 0.345
1.845GluAsn: 1.845 ± 0.263
2.696GluPro: 2.696 ± 0.464
3.122GluGln: 3.122 ± 0.441
4.186GluArg: 4.186 ± 0.479
2.767GluSer: 2.767 ± 0.506
4.328GluThr: 4.328 ± 0.659
3.973GluVal: 3.973 ± 0.722
1.419GluTrp: 1.419 ± 0.358
1.561GluTyr: 1.561 ± 0.291
0.0GluXaa: 0.0 ± 0.0
Phe
3.831PheAla: 3.831 ± 0.554
0.213PheCys: 0.213 ± 0.157
2.199PheAsp: 2.199 ± 0.43
1.632PheGlu: 1.632 ± 0.346
0.922PhePhe: 0.922 ± 0.272
2.767PheGly: 2.767 ± 0.481
0.922PheHis: 0.922 ± 0.255
1.135PheIle: 1.135 ± 0.272
0.993PheLys: 0.993 ± 0.254
1.987PheLeu: 1.987 ± 0.443
0.213PheMet: 0.213 ± 0.168
1.064PheAsn: 1.064 ± 0.275
1.277PhePro: 1.277 ± 0.292
0.497PheGln: 0.497 ± 0.158
1.916PheArg: 1.916 ± 0.292
1.348PheSer: 1.348 ± 0.226
2.341PheThr: 2.341 ± 0.419
2.483PheVal: 2.483 ± 0.417
0.355PheTrp: 0.355 ± 0.139
0.78PheTyr: 0.78 ± 0.219
0.0PheXaa: 0.0 ± 0.0
Gly
8.088GlyAla: 8.088 ± 1.371
1.135GlyCys: 1.135 ± 0.336
6.101GlyAsp: 6.101 ± 0.603
3.973GlyGlu: 3.973 ± 0.567
2.767GlyPhe: 2.767 ± 0.523
10.5GlyGly: 10.5 ± 1.523
1.632GlyHis: 1.632 ± 0.354
4.257GlyIle: 4.257 ± 0.738
2.767GlyLys: 2.767 ± 0.528
6.598GlyLeu: 6.598 ± 0.666
1.774GlyMet: 1.774 ± 0.38
2.98GlyAsn: 2.98 ± 0.399
4.47GlyPro: 4.47 ± 0.831
3.902GlyGln: 3.902 ± 0.478
5.889GlyArg: 5.889 ± 0.628
6.101GlySer: 6.101 ± 0.819
6.456GlyThr: 6.456 ± 0.674
6.456GlyVal: 6.456 ± 0.711
2.057GlyTrp: 2.057 ± 0.383
2.909GlyTyr: 2.909 ± 0.578
0.0GlyXaa: 0.0 ± 0.0
His
1.632HisAla: 1.632 ± 0.367
0.355HisCys: 0.355 ± 0.15
1.348HisAsp: 1.348 ± 0.221
1.419HisGlu: 1.419 ± 0.328
0.284HisPhe: 0.284 ± 0.145
1.916HisGly: 1.916 ± 0.384
0.922HisHis: 0.922 ± 0.341
0.639HisIle: 0.639 ± 0.185
0.709HisLys: 0.709 ± 0.219
1.277HisLeu: 1.277 ± 0.365
0.426HisMet: 0.426 ± 0.183
0.426HisAsn: 0.426 ± 0.168
1.064HisPro: 1.064 ± 0.318
0.851HisGln: 0.851 ± 0.196
1.987HisArg: 1.987 ± 0.396
0.639HisSer: 0.639 ± 0.209
1.206HisThr: 1.206 ± 0.303
1.277HisVal: 1.277 ± 0.337
0.426HisTrp: 0.426 ± 0.157
0.709HisTyr: 0.709 ± 0.178
0.0HisXaa: 0.0 ± 0.0
Ile
5.25IleAla: 5.25 ± 0.546
0.497IleCys: 0.497 ± 0.2
3.547IleAsp: 3.547 ± 0.547
3.831IleGlu: 3.831 ± 0.448
0.78IlePhe: 0.78 ± 0.263
4.895IleGly: 4.895 ± 0.719
0.851IleHis: 0.851 ± 0.265
1.135IleIle: 1.135 ± 0.299
0.993IleLys: 0.993 ± 0.259
2.625IleLeu: 2.625 ± 0.421
0.497IleMet: 0.497 ± 0.193
1.703IleAsn: 1.703 ± 0.304
2.554IlePro: 2.554 ± 0.474
1.206IleGln: 1.206 ± 0.301
3.547IleArg: 3.547 ± 0.503
2.27IleSer: 2.27 ± 0.36
4.044IleThr: 4.044 ± 0.517
3.476IleVal: 3.476 ± 0.428
0.709IleTrp: 0.709 ± 0.191
0.922IleTyr: 0.922 ± 0.227
0.0IleXaa: 0.0 ± 0.0
Lys
4.541LysAla: 4.541 ± 0.889
0.213LysCys: 0.213 ± 0.127
1.135LysAsp: 1.135 ± 0.3
1.135LysGlu: 1.135 ± 0.266
0.851LysPhe: 0.851 ± 0.214
2.909LysGly: 2.909 ± 0.473
0.355LysHis: 0.355 ± 0.155
1.774LysIle: 1.774 ± 0.329
0.639LysLys: 0.639 ± 0.31
2.412LysLeu: 2.412 ± 0.562
0.78LysMet: 0.78 ± 0.236
0.78LysAsn: 0.78 ± 0.224
2.27LysPro: 2.27 ± 0.389
1.561LysGln: 1.561 ± 0.343
2.625LysArg: 2.625 ± 0.477
1.774LysSer: 1.774 ± 0.352
1.348LysThr: 1.348 ± 0.269
2.412LysVal: 2.412 ± 0.327
0.426LysTrp: 0.426 ± 0.16
0.851LysTyr: 0.851 ± 0.187
0.0LysXaa: 0.0 ± 0.0
Leu
10.358LeuAla: 10.358 ± 0.918
0.78LeuCys: 0.78 ± 0.293
5.392LeuAsp: 5.392 ± 0.546
3.547LeuGlu: 3.547 ± 0.361
2.838LeuPhe: 2.838 ± 0.454
8.017LeuGly: 8.017 ± 0.88
1.561LeuHis: 1.561 ± 0.3
3.76LeuIle: 3.76 ± 0.557
3.193LeuLys: 3.193 ± 0.503
7.52LeuLeu: 7.52 ± 0.804
1.419LeuMet: 1.419 ± 0.393
2.554LeuAsn: 2.554 ± 0.472
3.335LeuPro: 3.335 ± 0.517
2.554LeuGln: 2.554 ± 0.467
5.747LeuArg: 5.747 ± 0.681
4.115LeuSer: 4.115 ± 0.489
5.037LeuThr: 5.037 ± 0.626
5.179LeuVal: 5.179 ± 0.547
1.064LeuTrp: 1.064 ± 0.185
1.561LeuTyr: 1.561 ± 0.292
0.0LeuXaa: 0.0 ± 0.0
Met
3.051MetAla: 3.051 ± 0.512
0.0MetCys: 0.0 ± 0.0
0.851MetAsp: 0.851 ± 0.262
0.709MetGlu: 0.709 ± 0.239
0.851MetPhe: 0.851 ± 0.257
0.639MetGly: 0.639 ± 0.188
0.355MetHis: 0.355 ± 0.155
0.993MetIle: 0.993 ± 0.239
0.568MetLys: 0.568 ± 0.215
1.561MetLeu: 1.561 ± 0.346
0.568MetMet: 0.568 ± 0.213
0.639MetAsn: 0.639 ± 0.213
1.419MetPro: 1.419 ± 0.332
0.78MetGln: 0.78 ± 0.222
1.206MetArg: 1.206 ± 0.324
2.625MetSer: 2.625 ± 0.264
2.128MetThr: 2.128 ± 0.41
1.064MetVal: 1.064 ± 0.269
0.639MetTrp: 0.639 ± 0.266
0.213MetTyr: 0.213 ± 0.102
0.0MetXaa: 0.0 ± 0.0
Asn
3.831AsnAla: 3.831 ± 0.783
0.426AsnCys: 0.426 ± 0.164
1.774AsnAsp: 1.774 ± 0.414
1.064AsnGlu: 1.064 ± 0.31
0.993AsnPhe: 0.993 ± 0.248
4.044AsnGly: 4.044 ± 0.611
0.213AsnHis: 0.213 ± 0.132
1.774AsnIle: 1.774 ± 0.394
0.639AsnLys: 0.639 ± 0.219
2.412AsnLeu: 2.412 ± 0.465
0.426AsnMet: 0.426 ± 0.142
1.064AsnAsn: 1.064 ± 0.217
2.767AsnPro: 2.767 ± 0.391
1.348AsnGln: 1.348 ± 0.309
1.561AsnArg: 1.561 ± 0.326
1.277AsnSer: 1.277 ± 0.274
1.845AsnThr: 1.845 ± 0.345
1.845AsnVal: 1.845 ± 0.298
0.639AsnTrp: 0.639 ± 0.212
0.355AsnTyr: 0.355 ± 0.202
0.0AsnXaa: 0.0 ± 0.0
Pro
6.953ProAla: 6.953 ± 0.916
0.639ProCys: 0.639 ± 0.244
4.824ProAsp: 4.824 ± 0.531
4.044ProGlu: 4.044 ± 0.687
2.128ProPhe: 2.128 ± 0.342
5.889ProGly: 5.889 ± 0.812
0.78ProHis: 0.78 ± 0.239
1.987ProIle: 1.987 ± 0.364
1.206ProLys: 1.206 ± 0.351
4.044ProLeu: 4.044 ± 0.613
1.561ProMet: 1.561 ± 0.353
1.561ProAsn: 1.561 ± 0.305
3.051ProPro: 3.051 ± 0.471
1.632ProGln: 1.632 ± 0.337
3.193ProArg: 3.193 ± 0.437
2.767ProSer: 2.767 ± 0.469
3.831ProThr: 3.831 ± 0.543
4.044ProVal: 4.044 ± 0.48
1.49ProTrp: 1.49 ± 0.414
1.419ProTyr: 1.419 ± 0.471
0.0ProXaa: 0.0 ± 0.0
Gln
4.753GlnAla: 4.753 ± 0.938
0.568GlnCys: 0.568 ± 0.218
1.49GlnAsp: 1.49 ± 0.423
1.703GlnGlu: 1.703 ± 0.43
0.993GlnPhe: 0.993 ± 0.252
1.774GlnGly: 1.774 ± 0.323
1.135GlnHis: 1.135 ± 0.258
2.554GlnIle: 2.554 ± 0.379
1.348GlnLys: 1.348 ± 0.312
3.973GlnLeu: 3.973 ± 0.55
0.993GlnMet: 0.993 ± 0.277
0.497GlnAsn: 0.497 ± 0.238
2.412GlnPro: 2.412 ± 0.455
2.128GlnGln: 2.128 ± 0.403
2.767GlnArg: 2.767 ± 0.453
1.561GlnSer: 1.561 ± 0.326
1.987GlnThr: 1.987 ± 0.363
3.051GlnVal: 3.051 ± 0.429
0.639GlnTrp: 0.639 ± 0.193
0.993GlnTyr: 0.993 ± 0.262
0.0GlnXaa: 0.0 ± 0.0
Arg
6.456ArgAla: 6.456 ± 0.717
1.277ArgCys: 1.277 ± 0.371
5.037ArgAsp: 5.037 ± 0.647
4.47ArgGlu: 4.47 ± 0.729
1.703ArgPhe: 1.703 ± 0.403
4.612ArgGly: 4.612 ± 0.538
2.128ArgHis: 2.128 ± 0.436
3.547ArgIle: 3.547 ± 0.501
2.483ArgLys: 2.483 ± 0.477
5.747ArgLeu: 5.747 ± 0.553
2.27ArgMet: 2.27 ± 0.524
1.987ArgAsn: 1.987 ± 0.342
3.831ArgPro: 3.831 ± 0.47
2.767ArgGln: 2.767 ± 0.547
6.527ArgArg: 6.527 ± 0.835
2.838ArgSer: 2.838 ± 0.363
4.399ArgThr: 4.399 ± 0.57
4.399ArgVal: 4.399 ± 0.617
1.561ArgTrp: 1.561 ± 0.282
1.987ArgTyr: 1.987 ± 0.335
0.0ArgXaa: 0.0 ± 0.0
Ser
6.598SerAla: 6.598 ± 0.721
0.355SerCys: 0.355 ± 0.162
3.618SerAsp: 3.618 ± 0.5
1.916SerGlu: 1.916 ± 0.312
1.206SerPhe: 1.206 ± 0.285
5.818SerGly: 5.818 ± 0.998
0.78SerHis: 0.78 ± 0.255
2.199SerIle: 2.199 ± 0.361
1.632SerLys: 1.632 ± 0.333
4.186SerLeu: 4.186 ± 0.739
1.916SerMet: 1.916 ± 0.388
1.419SerAsn: 1.419 ± 0.346
2.838SerPro: 2.838 ± 0.431
1.774SerGln: 1.774 ± 0.364
2.838SerArg: 2.838 ± 0.465
3.264SerSer: 3.264 ± 0.507
3.902SerThr: 3.902 ± 0.563
3.335SerVal: 3.335 ± 0.453
0.851SerTrp: 0.851 ± 0.197
1.561SerTyr: 1.561 ± 0.271
0.0SerXaa: 0.0 ± 0.0
Thr
8.301ThrAla: 8.301 ± 0.747
0.639ThrCys: 0.639 ± 0.217
4.257ThrAsp: 4.257 ± 0.547
3.831ThrGlu: 3.831 ± 0.474
2.128ThrPhe: 2.128 ± 0.431
7.095ThrGly: 7.095 ± 0.791
0.78ThrHis: 0.78 ± 0.249
3.122ThrIle: 3.122 ± 0.558
2.27ThrLys: 2.27 ± 0.373
5.392ThrLeu: 5.392 ± 0.527
0.639ThrMet: 0.639 ± 0.19
1.632ThrAsn: 1.632 ± 0.39
4.612ThrPro: 4.612 ± 0.661
2.057ThrGln: 2.057 ± 0.447
4.47ThrArg: 4.47 ± 0.483
2.98ThrSer: 2.98 ± 0.48
3.618ThrThr: 3.618 ± 0.475
5.889ThrVal: 5.889 ± 0.789
0.851ThrTrp: 0.851 ± 0.312
1.632ThrTyr: 1.632 ± 0.314
0.0ThrXaa: 0.0 ± 0.0
Val
8.301ValAla: 8.301 ± 0.723
0.639ValCys: 0.639 ± 0.182
4.824ValAsp: 4.824 ± 0.835
5.534ValGlu: 5.534 ± 0.696
1.632ValPhe: 1.632 ± 0.342
4.895ValGly: 4.895 ± 0.594
1.206ValHis: 1.206 ± 0.275
3.689ValIle: 3.689 ± 0.456
2.341ValLys: 2.341 ± 0.521
4.541ValLeu: 4.541 ± 0.556
1.064ValMet: 1.064 ± 0.248
2.909ValAsn: 2.909 ± 0.492
3.973ValPro: 3.973 ± 0.529
2.341ValGln: 2.341 ± 0.4
4.753ValArg: 4.753 ± 0.716
3.689ValSer: 3.689 ± 0.524
5.179ValThr: 5.179 ± 0.629
6.385ValVal: 6.385 ± 0.72
1.206ValTrp: 1.206 ± 0.263
1.277ValTyr: 1.277 ± 0.306
0.0ValXaa: 0.0 ± 0.0
Trp
1.774TrpAla: 1.774 ± 0.333
0.426TrpCys: 0.426 ± 0.166
1.419TrpAsp: 1.419 ± 0.271
0.78TrpGlu: 0.78 ± 0.258
0.639TrpPhe: 0.639 ± 0.251
1.277TrpGly: 1.277 ± 0.284
0.709TrpHis: 0.709 ± 0.197
0.851TrpIle: 0.851 ± 0.234
0.426TrpLys: 0.426 ± 0.154
1.703TrpLeu: 1.703 ± 0.43
0.213TrpMet: 0.213 ± 0.127
0.639TrpAsn: 0.639 ± 0.167
0.922TrpPro: 0.922 ± 0.302
0.639TrpGln: 0.639 ± 0.212
1.348TrpArg: 1.348 ± 0.293
1.135TrpSer: 1.135 ± 0.284
1.49TrpThr: 1.49 ± 0.334
1.703TrpVal: 1.703 ± 0.315
0.639TrpTrp: 0.639 ± 0.209
0.426TrpTyr: 0.426 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.199TyrAla: 2.199 ± 0.51
0.284TyrCys: 0.284 ± 0.134
1.845TyrAsp: 1.845 ± 0.455
1.916TyrGlu: 1.916 ± 0.382
0.639TyrPhe: 0.639 ± 0.228
1.987TyrGly: 1.987 ± 0.275
0.568TyrHis: 0.568 ± 0.203
0.922TyrIle: 0.922 ± 0.283
0.568TyrLys: 0.568 ± 0.175
1.987TyrLeu: 1.987 ± 0.326
0.213TyrMet: 0.213 ± 0.122
0.993TyrAsn: 0.993 ± 0.224
1.206TyrPro: 1.206 ± 0.302
0.78TyrGln: 0.78 ± 0.206
1.916TyrArg: 1.916 ± 0.326
0.922TyrSer: 0.922 ± 0.273
1.774TyrThr: 1.774 ± 0.394
1.916TyrVal: 1.916 ± 0.518
0.568TyrTrp: 0.568 ± 0.224
0.78TyrTyr: 0.78 ± 0.206
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (14096 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski