Amino acid dipepetide frequency for Mycobacterium virus JC27

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.847AlaAla: 12.847 ± 1.303
0.556AlaCys: 0.556 ± 0.158
7.227AlaAsp: 7.227 ± 0.703
5.744AlaGlu: 5.744 ± 0.744
3.027AlaPhe: 3.027 ± 0.505
7.412AlaGly: 7.412 ± 0.879
1.606AlaHis: 1.606 ± 0.371
4.324AlaIle: 4.324 ± 0.597
4.324AlaLys: 4.324 ± 0.593
9.018AlaLeu: 9.018 ± 0.84
2.656AlaMet: 2.656 ± 0.346
2.409AlaAsn: 2.409 ± 0.399
4.88AlaPro: 4.88 ± 0.759
3.088AlaGln: 3.088 ± 0.53
6.238AlaArg: 6.238 ± 0.507
5.127AlaSer: 5.127 ± 0.747
6.115AlaThr: 6.115 ± 0.583
8.277AlaVal: 8.277 ± 0.763
1.853AlaTrp: 1.853 ± 0.382
2.594AlaTyr: 2.594 ± 0.329
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 0.295
0.062CysCys: 0.062 ± 0.063
0.494CysAsp: 0.494 ± 0.184
0.618CysGlu: 0.618 ± 0.186
0.124CysPhe: 0.124 ± 0.087
0.556CysGly: 0.556 ± 0.233
0.247CysHis: 0.247 ± 0.111
0.124CysIle: 0.124 ± 0.093
0.247CysLys: 0.247 ± 0.12
0.309CysLeu: 0.309 ± 0.15
0.185CysMet: 0.185 ± 0.114
0.309CysAsn: 0.309 ± 0.125
0.247CysPro: 0.247 ± 0.116
0.247CysGln: 0.247 ± 0.118
0.494CysArg: 0.494 ± 0.191
0.309CysSer: 0.309 ± 0.135
0.247CysThr: 0.247 ± 0.132
0.247CysVal: 0.247 ± 0.116
0.124CysTrp: 0.124 ± 0.081
0.247CysTyr: 0.247 ± 0.136
0.0CysXaa: 0.0 ± 0.0
Asp
6.362AspAla: 6.362 ± 0.74
0.556AspCys: 0.556 ± 0.18
4.941AspAsp: 4.941 ± 0.546
3.83AspGlu: 3.83 ± 0.47
2.718AspPhe: 2.718 ± 0.419
5.621AspGly: 5.621 ± 0.674
1.297AspHis: 1.297 ± 0.286
2.779AspIle: 2.779 ± 0.427
3.027AspLys: 3.027 ± 0.423
6.362AspLeu: 6.362 ± 0.719
1.359AspMet: 1.359 ± 0.255
1.977AspAsn: 1.977 ± 0.38
4.941AspPro: 4.941 ± 0.625
1.606AspGln: 1.606 ± 0.385
4.2AspArg: 4.2 ± 0.467
3.644AspSer: 3.644 ± 0.564
3.706AspThr: 3.706 ± 0.351
5.127AspVal: 5.127 ± 0.539
1.729AspTrp: 1.729 ± 0.314
1.977AspTyr: 1.977 ± 0.328
0.0AspXaa: 0.0 ± 0.0
Glu
6.3GluAla: 6.3 ± 0.695
0.309GluCys: 0.309 ± 0.173
5.127GluAsp: 5.127 ± 0.473
5.806GluGlu: 5.806 ± 0.764
1.729GluPhe: 1.729 ± 0.323
4.2GluGly: 4.2 ± 0.462
1.482GluHis: 1.482 ± 0.312
3.768GluIle: 3.768 ± 0.488
2.532GluLys: 2.532 ± 0.388
7.041GluLeu: 7.041 ± 0.556
1.606GluMet: 1.606 ± 0.3
1.668GluAsn: 1.668 ± 0.325
2.841GluPro: 2.841 ± 0.476
2.718GluGln: 2.718 ± 0.4
3.582GluArg: 3.582 ± 0.578
3.088GluSer: 3.088 ± 0.383
3.706GluThr: 3.706 ± 0.447
5.744GluVal: 5.744 ± 0.548
1.297GluTrp: 1.297 ± 0.348
2.038GluTyr: 2.038 ± 0.412
0.0GluXaa: 0.0 ± 0.0
Phe
2.471PheAla: 2.471 ± 0.51
0.247PheCys: 0.247 ± 0.174
2.594PheAsp: 2.594 ± 0.323
2.285PheGlu: 2.285 ± 0.364
0.494PhePhe: 0.494 ± 0.209
3.397PheGly: 3.397 ± 0.445
0.679PheHis: 0.679 ± 0.219
1.05PheIle: 1.05 ± 0.249
1.421PheLys: 1.421 ± 0.334
2.718PheLeu: 2.718 ± 0.55
0.618PheMet: 0.618 ± 0.191
1.482PheAsn: 1.482 ± 0.361
1.729PhePro: 1.729 ± 0.272
0.988PheGln: 0.988 ± 0.299
2.038PheArg: 2.038 ± 0.367
1.606PheSer: 1.606 ± 0.291
2.1PheThr: 2.1 ± 0.417
2.1PheVal: 2.1 ± 0.392
0.556PheTrp: 0.556 ± 0.167
1.05PheTyr: 1.05 ± 0.256
0.0PheXaa: 0.0 ± 0.0
Gly
6.177GlyAla: 6.177 ± 0.99
0.494GlyCys: 0.494 ± 0.193
5.559GlyAsp: 5.559 ± 0.556
4.694GlyGlu: 4.694 ± 0.523
2.594GlyPhe: 2.594 ± 0.508
7.783GlyGly: 7.783 ± 1.04
1.791GlyHis: 1.791 ± 0.395
4.509GlyIle: 4.509 ± 0.784
3.274GlyLys: 3.274 ± 0.537
7.721GlyLeu: 7.721 ± 0.866
1.977GlyMet: 1.977 ± 0.4
3.274GlyAsn: 3.274 ± 0.483
3.521GlyPro: 3.521 ± 0.612
2.162GlyGln: 2.162 ± 0.343
5.374GlyArg: 5.374 ± 0.653
5.93GlySer: 5.93 ± 0.719
4.818GlyThr: 4.818 ± 0.642
5.683GlyVal: 5.683 ± 0.65
2.656GlyTrp: 2.656 ± 0.412
2.594GlyTyr: 2.594 ± 0.394
0.0GlyXaa: 0.0 ± 0.0
His
1.482HisAla: 1.482 ± 0.321
0.124HisCys: 0.124 ± 0.089
1.112HisAsp: 1.112 ± 0.279
1.791HisGlu: 1.791 ± 0.386
0.741HisPhe: 0.741 ± 0.209
1.297HisGly: 1.297 ± 0.347
0.803HisHis: 0.803 ± 0.245
0.679HisIle: 0.679 ± 0.172
1.112HisLys: 1.112 ± 0.319
1.421HisLeu: 1.421 ± 0.369
0.124HisMet: 0.124 ± 0.146
0.371HisAsn: 0.371 ± 0.141
1.421HisPro: 1.421 ± 0.3
0.741HisGln: 0.741 ± 0.253
1.544HisArg: 1.544 ± 0.364
0.741HisSer: 0.741 ± 0.21
1.112HisThr: 1.112 ± 0.251
1.791HisVal: 1.791 ± 0.399
0.494HisTrp: 0.494 ± 0.154
0.618HisTyr: 0.618 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
6.238IleAla: 6.238 ± 0.65
0.247IleCys: 0.247 ± 0.135
3.582IleAsp: 3.582 ± 0.45
3.582IleGlu: 3.582 ± 0.471
0.865IlePhe: 0.865 ± 0.185
4.262IleGly: 4.262 ± 0.646
0.926IleHis: 0.926 ± 0.231
1.915IleIle: 1.915 ± 0.315
1.729IleLys: 1.729 ± 0.304
3.15IleLeu: 3.15 ± 0.444
0.865IleMet: 0.865 ± 0.222
1.606IleAsn: 1.606 ± 0.305
3.15IlePro: 3.15 ± 0.439
1.482IleGln: 1.482 ± 0.304
3.335IleArg: 3.335 ± 0.475
3.088IleSer: 3.088 ± 0.481
3.397IleThr: 3.397 ± 0.407
2.903IleVal: 2.903 ± 0.542
0.618IleTrp: 0.618 ± 0.169
1.606IleTyr: 1.606 ± 0.277
0.0IleXaa: 0.0 ± 0.0
Lys
3.953LysAla: 3.953 ± 0.636
0.185LysCys: 0.185 ± 0.115
2.409LysAsp: 2.409 ± 0.42
2.285LysGlu: 2.285 ± 0.334
1.544LysPhe: 1.544 ± 0.376
2.779LysGly: 2.779 ± 0.364
1.174LysHis: 1.174 ± 0.309
2.594LysIle: 2.594 ± 0.464
2.409LysLys: 2.409 ± 0.507
3.459LysLeu: 3.459 ± 0.505
1.112LysMet: 1.112 ± 0.245
1.112LysAsn: 1.112 ± 0.255
2.409LysPro: 2.409 ± 0.4
1.977LysGln: 1.977 ± 0.405
2.656LysArg: 2.656 ± 0.511
2.347LysSer: 2.347 ± 0.296
2.347LysThr: 2.347 ± 0.369
3.212LysVal: 3.212 ± 0.542
0.679LysTrp: 0.679 ± 0.202
0.865LysTyr: 0.865 ± 0.274
0.0LysXaa: 0.0 ± 0.0
Leu
9.512LeuAla: 9.512 ± 0.754
0.494LeuCys: 0.494 ± 0.177
6.609LeuAsp: 6.609 ± 0.657
5.435LeuGlu: 5.435 ± 0.624
2.1LeuPhe: 2.1 ± 0.382
6.794LeuGly: 6.794 ± 0.707
1.359LeuHis: 1.359 ± 0.286
4.509LeuIle: 4.509 ± 0.549
4.138LeuLys: 4.138 ± 0.552
6.115LeuLeu: 6.115 ± 0.568
1.668LeuMet: 1.668 ± 0.288
2.779LeuAsn: 2.779 ± 0.364
5.374LeuPro: 5.374 ± 0.56
2.841LeuGln: 2.841 ± 0.512
6.238LeuArg: 6.238 ± 0.679
6.115LeuSer: 6.115 ± 0.602
5.683LeuThr: 5.683 ± 0.453
4.88LeuVal: 4.88 ± 0.718
1.235LeuTrp: 1.235 ± 0.329
2.347LeuTyr: 2.347 ± 0.404
0.0LeuXaa: 0.0 ± 0.0
Met
2.656MetAla: 2.656 ± 0.477
0.062MetCys: 0.062 ± 0.054
1.235MetAsp: 1.235 ± 0.287
1.668MetGlu: 1.668 ± 0.334
0.618MetPhe: 0.618 ± 0.156
1.359MetGly: 1.359 ± 0.301
0.247MetHis: 0.247 ± 0.135
0.556MetIle: 0.556 ± 0.203
1.174MetLys: 1.174 ± 0.261
0.988MetLeu: 0.988 ± 0.211
0.309MetMet: 0.309 ± 0.144
0.988MetAsn: 0.988 ± 0.234
1.05MetPro: 1.05 ± 0.263
0.432MetGln: 0.432 ± 0.168
1.421MetArg: 1.421 ± 0.329
2.532MetSer: 2.532 ± 0.42
2.1MetThr: 2.1 ± 0.295
1.235MetVal: 1.235 ± 0.291
0.185MetTrp: 0.185 ± 0.094
0.494MetTyr: 0.494 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.706AsnAla: 3.706 ± 0.519
0.062AsnCys: 0.062 ± 0.058
1.977AsnAsp: 1.977 ± 0.41
1.977AsnGlu: 1.977 ± 0.325
0.988AsnPhe: 0.988 ± 0.269
3.459AsnGly: 3.459 ± 0.468
0.679AsnHis: 0.679 ± 0.2
1.606AsnIle: 1.606 ± 0.397
0.494AsnLys: 0.494 ± 0.262
2.347AsnLeu: 2.347 ± 0.343
0.679AsnMet: 0.679 ± 0.225
0.741AsnAsn: 0.741 ± 0.194
2.471AsnPro: 2.471 ± 0.373
0.988AsnGln: 0.988 ± 0.283
1.482AsnArg: 1.482 ± 0.337
1.421AsnSer: 1.421 ± 0.342
1.977AsnThr: 1.977 ± 0.325
2.594AsnVal: 2.594 ± 0.395
0.803AsnTrp: 0.803 ± 0.198
1.482AsnTyr: 1.482 ± 0.345
0.0AsnXaa: 0.0 ± 0.0
Pro
4.818ProAla: 4.818 ± 0.582
0.432ProCys: 0.432 ± 0.172
4.2ProAsp: 4.2 ± 0.486
4.509ProGlu: 4.509 ± 0.54
2.347ProPhe: 2.347 ± 0.354
4.941ProGly: 4.941 ± 0.58
0.741ProHis: 0.741 ± 0.233
2.162ProIle: 2.162 ± 0.36
2.162ProLys: 2.162 ± 0.333
4.262ProLeu: 4.262 ± 0.573
1.297ProMet: 1.297 ± 0.312
1.668ProAsn: 1.668 ± 0.313
3.274ProPro: 3.274 ± 0.552
1.544ProGln: 1.544 ± 0.307
2.903ProArg: 2.903 ± 0.453
4.324ProSer: 4.324 ± 0.493
3.768ProThr: 3.768 ± 0.575
3.644ProVal: 3.644 ± 0.413
0.803ProTrp: 0.803 ± 0.258
1.482ProTyr: 1.482 ± 0.349
0.0ProXaa: 0.0 ± 0.0
Gln
2.965GlnAla: 2.965 ± 0.484
0.124GlnCys: 0.124 ± 0.086
1.421GlnAsp: 1.421 ± 0.369
1.729GlnGlu: 1.729 ± 0.314
1.235GlnPhe: 1.235 ± 0.272
2.594GlnGly: 2.594 ± 0.329
0.494GlnHis: 0.494 ± 0.162
2.779GlnIle: 2.779 ± 0.474
1.544GlnLys: 1.544 ± 0.389
4.015GlnLeu: 4.015 ± 0.492
0.865GlnMet: 0.865 ± 0.263
0.556GlnAsn: 0.556 ± 0.182
1.791GlnPro: 1.791 ± 0.34
1.977GlnGln: 1.977 ± 0.418
1.791GlnArg: 1.791 ± 0.364
1.853GlnSer: 1.853 ± 0.376
1.791GlnThr: 1.791 ± 0.384
2.224GlnVal: 2.224 ± 0.311
0.556GlnTrp: 0.556 ± 0.14
0.679GlnTyr: 0.679 ± 0.189
0.0GlnXaa: 0.0 ± 0.0
Arg
5.312ArgAla: 5.312 ± 0.624
0.926ArgCys: 0.926 ± 0.331
3.521ArgAsp: 3.521 ± 0.467
4.385ArgGlu: 4.385 ± 0.607
1.915ArgPhe: 1.915 ± 0.368
4.941ArgGly: 4.941 ± 0.676
1.05ArgHis: 1.05 ± 0.234
3.274ArgIle: 3.274 ± 0.494
2.903ArgLys: 2.903 ± 0.578
6.362ArgLeu: 6.362 ± 0.772
1.853ArgMet: 1.853 ± 0.345
2.224ArgAsn: 2.224 ± 0.439
2.594ArgPro: 2.594 ± 0.448
1.915ArgGln: 1.915 ± 0.346
5.868ArgArg: 5.868 ± 0.764
3.768ArgSer: 3.768 ± 0.556
3.274ArgThr: 3.274 ± 0.499
5.435ArgVal: 5.435 ± 0.58
1.297ArgTrp: 1.297 ± 0.292
1.853ArgTyr: 1.853 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
6.3SerAla: 6.3 ± 0.871
0.432SerCys: 0.432 ± 0.187
3.582SerAsp: 3.582 ± 0.52
4.015SerGlu: 4.015 ± 0.577
2.1SerPhe: 2.1 ± 0.442
5.621SerGly: 5.621 ± 0.635
1.359SerHis: 1.359 ± 0.304
2.841SerIle: 2.841 ± 0.463
2.162SerLys: 2.162 ± 0.372
5.435SerLeu: 5.435 ± 0.498
1.112SerMet: 1.112 ± 0.285
2.594SerAsn: 2.594 ± 0.467
3.335SerPro: 3.335 ± 0.443
1.977SerGln: 1.977 ± 0.286
3.397SerArg: 3.397 ± 0.501
3.274SerSer: 3.274 ± 0.57
3.212SerThr: 3.212 ± 0.465
4.138SerVal: 4.138 ± 0.581
1.235SerTrp: 1.235 ± 0.363
1.297SerTyr: 1.297 ± 0.316
0.0SerXaa: 0.0 ± 0.0
Thr
5.868ThrAla: 5.868 ± 0.6
0.185ThrCys: 0.185 ± 0.114
4.2ThrAsp: 4.2 ± 0.546
4.324ThrGlu: 4.324 ± 0.548
2.409ThrPhe: 2.409 ± 0.377
6.177ThrGly: 6.177 ± 0.601
1.297ThrHis: 1.297 ± 0.326
2.285ThrIle: 2.285 ± 0.539
2.471ThrLys: 2.471 ± 0.321
5.435ThrLeu: 5.435 ± 0.717
0.865ThrMet: 0.865 ± 0.196
1.668ThrAsn: 1.668 ± 0.345
3.83ThrPro: 3.83 ± 0.51
1.544ThrGln: 1.544 ± 0.332
3.274ThrArg: 3.274 ± 0.583
3.274ThrSer: 3.274 ± 0.608
3.83ThrThr: 3.83 ± 0.577
5.93ThrVal: 5.93 ± 0.615
1.235ThrTrp: 1.235 ± 0.262
2.038ThrTyr: 2.038 ± 0.378
0.0ThrXaa: 0.0 ± 0.0
Val
7.35ValAla: 7.35 ± 0.683
0.247ValCys: 0.247 ± 0.128
5.25ValAsp: 5.25 ± 0.541
4.756ValGlu: 4.756 ± 0.535
2.594ValPhe: 2.594 ± 0.346
5.127ValGly: 5.127 ± 0.647
1.235ValHis: 1.235 ± 0.217
3.891ValIle: 3.891 ± 0.403
2.965ValLys: 2.965 ± 0.391
5.435ValLeu: 5.435 ± 0.604
1.112ValMet: 1.112 ± 0.272
3.027ValAsn: 3.027 ± 0.404
4.262ValPro: 4.262 ± 0.473
2.718ValGln: 2.718 ± 0.456
5.065ValArg: 5.065 ± 0.73
4.509ValSer: 4.509 ± 0.404
5.683ValThr: 5.683 ± 0.575
4.694ValVal: 4.694 ± 0.757
1.421ValTrp: 1.421 ± 0.293
2.471ValTyr: 2.471 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
1.668TrpAla: 1.668 ± 0.305
0.247TrpCys: 0.247 ± 0.118
1.606TrpAsp: 1.606 ± 0.317
1.05TrpGlu: 1.05 ± 0.272
0.803TrpPhe: 0.803 ± 0.217
1.668TrpGly: 1.668 ± 0.324
0.371TrpHis: 0.371 ± 0.158
1.235TrpIle: 1.235 ± 0.256
0.432TrpLys: 0.432 ± 0.196
2.038TrpLeu: 2.038 ± 0.342
0.247TrpMet: 0.247 ± 0.153
0.432TrpAsn: 0.432 ± 0.142
0.803TrpPro: 0.803 ± 0.253
0.803TrpGln: 0.803 ± 0.188
1.235TrpArg: 1.235 ± 0.271
0.988TrpSer: 0.988 ± 0.234
1.421TrpThr: 1.421 ± 0.399
1.853TrpVal: 1.853 ± 0.326
0.741TrpTrp: 0.741 ± 0.279
0.247TrpTyr: 0.247 ± 0.117
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.471TyrAla: 2.471 ± 0.403
0.309TyrCys: 0.309 ± 0.145
1.112TyrAsp: 1.112 ± 0.323
2.1TyrGlu: 2.1 ± 0.35
0.679TyrPhe: 0.679 ± 0.194
2.656TyrGly: 2.656 ± 0.431
0.679TyrHis: 0.679 ± 0.217
1.668TyrIle: 1.668 ± 0.316
0.988TyrLys: 0.988 ± 0.253
2.594TyrLeu: 2.594 ± 0.395
0.741TyrMet: 0.741 ± 0.213
1.05TyrAsn: 1.05 ± 0.299
1.359TyrPro: 1.359 ± 0.298
1.235TyrGln: 1.235 ± 0.281
2.532TyrArg: 2.532 ± 0.399
1.482TyrSer: 1.482 ± 0.253
1.853TyrThr: 1.853 ± 0.348
2.1TyrVal: 2.1 ± 0.389
0.371TyrTrp: 0.371 ± 0.147
0.618TyrTyr: 0.618 ± 0.203
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (16191 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski