Amino acid dipepetide frequency for Mycobacterium virus Billknuckles

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.003AlaAla: 13.003 ± 1.456
0.565AlaCys: 0.565 ± 0.2
6.658AlaAsp: 6.658 ± 0.72
7.475AlaGlu: 7.475 ± 0.749
3.58AlaPhe: 3.58 ± 0.49
8.417AlaGly: 8.417 ± 0.94
1.382AlaHis: 1.382 ± 0.338
4.585AlaIle: 4.585 ± 0.715
4.02AlaLys: 4.02 ± 0.458
8.543AlaLeu: 8.543 ± 0.715
2.513AlaMet: 2.513 ± 0.398
2.387AlaAsn: 2.387 ± 0.377
4.962AlaPro: 4.962 ± 0.553
2.701AlaGln: 2.701 ± 0.397
5.905AlaArg: 5.905 ± 0.606
5.214AlaSer: 5.214 ± 0.635
5.967AlaThr: 5.967 ± 0.574
7.349AlaVal: 7.349 ± 0.759
1.57AlaTrp: 1.57 ± 0.272
2.764AlaTyr: 2.764 ± 0.361
0.0AlaXaa: 0.0 ± 0.0
Cys
0.754CysAla: 0.754 ± 0.269
0.0CysCys: 0.0 ± 0.0
0.691CysAsp: 0.691 ± 0.198
0.754CysGlu: 0.754 ± 0.209
0.251CysPhe: 0.251 ± 0.119
0.565CysGly: 0.565 ± 0.254
0.251CysHis: 0.251 ± 0.13
0.44CysIle: 0.44 ± 0.175
0.314CysLys: 0.314 ± 0.192
0.44CysLeu: 0.44 ± 0.195
0.063CysMet: 0.063 ± 0.071
0.251CysAsn: 0.251 ± 0.13
0.251CysPro: 0.251 ± 0.12
0.251CysGln: 0.251 ± 0.138
0.565CysArg: 0.565 ± 0.228
0.314CysSer: 0.314 ± 0.141
0.314CysThr: 0.314 ± 0.136
0.377CysVal: 0.377 ± 0.157
0.126CysTrp: 0.126 ± 0.077
0.188CysTyr: 0.188 ± 0.112
0.0CysXaa: 0.0 ± 0.0
Asp
6.533AspAla: 6.533 ± 0.753
0.44AspCys: 0.44 ± 0.191
4.146AspAsp: 4.146 ± 0.457
3.392AspGlu: 3.392 ± 0.469
2.45AspPhe: 2.45 ± 0.375
6.595AspGly: 6.595 ± 0.667
1.256AspHis: 1.256 ± 0.317
2.701AspIle: 2.701 ± 0.385
2.513AspLys: 2.513 ± 0.554
5.905AspLeu: 5.905 ± 0.725
1.256AspMet: 1.256 ± 0.238
2.01AspAsn: 2.01 ± 0.321
5.088AspPro: 5.088 ± 0.649
1.759AspGln: 1.759 ± 0.451
3.58AspArg: 3.58 ± 0.41
3.015AspSer: 3.015 ± 0.479
3.894AspThr: 3.894 ± 0.423
4.146AspVal: 4.146 ± 0.545
1.759AspTrp: 1.759 ± 0.328
2.198AspTyr: 2.198 ± 0.301
0.0AspXaa: 0.0 ± 0.0
Glu
6.093GluAla: 6.093 ± 0.733
0.503GluCys: 0.503 ± 0.236
4.397GluAsp: 4.397 ± 0.557
4.962GluGlu: 4.962 ± 0.676
1.884GluPhe: 1.884 ± 0.371
4.209GluGly: 4.209 ± 0.545
1.445GluHis: 1.445 ± 0.294
3.204GluIle: 3.204 ± 0.401
2.764GluLys: 2.764 ± 0.369
6.91GluLeu: 6.91 ± 0.657
1.57GluMet: 1.57 ± 0.322
1.947GluAsn: 1.947 ± 0.326
2.387GluPro: 2.387 ± 0.419
2.575GluGln: 2.575 ± 0.414
3.643GluArg: 3.643 ± 0.565
3.455GluSer: 3.455 ± 0.484
3.769GluThr: 3.769 ± 0.436
5.465GluVal: 5.465 ± 0.588
1.633GluTrp: 1.633 ± 0.365
2.575GluTyr: 2.575 ± 0.469
0.0GluXaa: 0.0 ± 0.0
Phe
2.324PheAla: 2.324 ± 0.333
0.377PheCys: 0.377 ± 0.197
2.889PheAsp: 2.889 ± 0.391
1.822PheGlu: 1.822 ± 0.255
0.503PhePhe: 0.503 ± 0.178
3.455PheGly: 3.455 ± 0.547
0.817PheHis: 0.817 ± 0.263
1.382PheIle: 1.382 ± 0.282
1.508PheLys: 1.508 ± 0.273
2.45PheLeu: 2.45 ± 0.375
0.628PheMet: 0.628 ± 0.221
0.754PheAsn: 0.754 ± 0.22
1.319PhePro: 1.319 ± 0.293
1.068PheGln: 1.068 ± 0.245
1.884PheArg: 1.884 ± 0.347
2.324PheSer: 2.324 ± 0.496
2.198PheThr: 2.198 ± 0.335
2.45PheVal: 2.45 ± 0.416
0.628PheTrp: 0.628 ± 0.187
0.879PheTyr: 0.879 ± 0.245
0.0PheXaa: 0.0 ± 0.0
Gly
7.663GlyAla: 7.663 ± 1.289
0.628GlyCys: 0.628 ± 0.21
6.219GlyAsp: 6.219 ± 0.456
5.151GlyGlu: 5.151 ± 0.552
2.952GlyPhe: 2.952 ± 0.549
10.302GlyGly: 10.302 ± 2.801
2.387GlyHis: 2.387 ± 0.477
4.585GlyIle: 4.585 ± 0.774
3.769GlyLys: 3.769 ± 0.452
7.852GlyLeu: 7.852 ± 0.917
1.822GlyMet: 1.822 ± 0.279
3.329GlyAsn: 3.329 ± 0.487
3.58GlyPro: 3.58 ± 0.595
2.638GlyGln: 2.638 ± 0.378
4.334GlyArg: 4.334 ± 0.472
5.967GlySer: 5.967 ± 0.866
5.402GlyThr: 5.402 ± 0.766
5.088GlyVal: 5.088 ± 0.468
1.947GlyTrp: 1.947 ± 0.347
3.015GlyTyr: 3.015 ± 0.375
0.0GlyXaa: 0.0 ± 0.0
His
1.759HisAla: 1.759 ± 0.367
0.188HisCys: 0.188 ± 0.17
1.193HisAsp: 1.193 ± 0.224
1.759HisGlu: 1.759 ± 0.419
0.817HisPhe: 0.817 ± 0.218
2.136HisGly: 2.136 ± 0.429
0.691HisHis: 0.691 ± 0.193
0.817HisIle: 0.817 ± 0.194
1.005HisLys: 1.005 ± 0.299
1.319HisLeu: 1.319 ± 0.321
0.251HisMet: 0.251 ± 0.158
0.565HisAsn: 0.565 ± 0.215
1.256HisPro: 1.256 ± 0.257
0.754HisGln: 0.754 ± 0.214
1.696HisArg: 1.696 ± 0.331
0.754HisSer: 0.754 ± 0.216
1.256HisThr: 1.256 ± 0.251
1.822HisVal: 1.822 ± 0.376
0.565HisTrp: 0.565 ± 0.178
0.628HisTyr: 0.628 ± 0.219
0.0HisXaa: 0.0 ± 0.0
Ile
6.47IleAla: 6.47 ± 0.841
0.377IleCys: 0.377 ± 0.137
3.392IleAsp: 3.392 ± 0.331
3.329IleGlu: 3.329 ± 0.482
1.005IlePhe: 1.005 ± 0.273
3.392IleGly: 3.392 ± 0.536
1.068IleHis: 1.068 ± 0.282
1.759IleIle: 1.759 ± 0.298
1.57IleLys: 1.57 ± 0.334
4.083IleLeu: 4.083 ± 0.502
0.942IleMet: 0.942 ± 0.214
1.759IleAsn: 1.759 ± 0.291
3.769IlePro: 3.769 ± 0.486
1.633IleGln: 1.633 ± 0.383
3.643IleArg: 3.643 ± 0.425
3.141IleSer: 3.141 ± 0.489
3.455IleThr: 3.455 ± 0.425
2.701IleVal: 2.701 ± 0.51
0.691IleTrp: 0.691 ± 0.193
1.696IleTyr: 1.696 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
3.329LysAla: 3.329 ± 0.578
0.377LysCys: 0.377 ± 0.17
2.764LysAsp: 2.764 ± 0.479
2.01LysGlu: 2.01 ± 0.365
1.508LysPhe: 1.508 ± 0.344
2.638LysGly: 2.638 ± 0.39
1.256LysHis: 1.256 ± 0.382
2.701LysIle: 2.701 ± 0.419
2.261LysLys: 2.261 ± 0.399
3.392LysLeu: 3.392 ± 0.528
0.879LysMet: 0.879 ± 0.202
1.633LysAsn: 1.633 ± 0.315
2.701LysPro: 2.701 ± 0.446
1.382LysGln: 1.382 ± 0.304
2.764LysArg: 2.764 ± 0.502
2.45LysSer: 2.45 ± 0.4
2.073LysThr: 2.073 ± 0.431
3.329LysVal: 3.329 ± 0.414
0.942LysTrp: 0.942 ± 0.22
1.256LysTyr: 1.256 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
9.296LeuAla: 9.296 ± 0.823
0.251LeuCys: 0.251 ± 0.138
5.967LeuAsp: 5.967 ± 0.725
5.653LeuGlu: 5.653 ± 0.648
2.136LeuPhe: 2.136 ± 0.434
7.098LeuGly: 7.098 ± 0.876
1.57LeuHis: 1.57 ± 0.322
5.339LeuIle: 5.339 ± 0.61
3.329LeuLys: 3.329 ± 0.528
5.276LeuLeu: 5.276 ± 0.47
1.445LeuMet: 1.445 ± 0.323
3.078LeuAsn: 3.078 ± 0.397
5.025LeuPro: 5.025 ± 0.644
2.387LeuGln: 2.387 ± 0.44
5.842LeuArg: 5.842 ± 0.621
5.151LeuSer: 5.151 ± 0.486
5.528LeuThr: 5.528 ± 0.604
5.276LeuVal: 5.276 ± 0.703
1.131LeuTrp: 1.131 ± 0.332
2.261LeuTyr: 2.261 ± 0.441
0.0LeuXaa: 0.0 ± 0.0
Met
2.701MetAla: 2.701 ± 0.419
0.0MetCys: 0.0 ± 0.0
1.256MetAsp: 1.256 ± 0.277
1.445MetGlu: 1.445 ± 0.298
0.503MetPhe: 0.503 ± 0.189
1.508MetGly: 1.508 ± 0.317
0.44MetHis: 0.44 ± 0.21
0.503MetIle: 0.503 ± 0.187
1.256MetLys: 1.256 ± 0.252
1.319MetLeu: 1.319 ± 0.33
0.44MetMet: 0.44 ± 0.17
0.817MetAsn: 0.817 ± 0.206
1.068MetPro: 1.068 ± 0.256
0.691MetGln: 0.691 ± 0.206
1.319MetArg: 1.319 ± 0.311
1.759MetSer: 1.759 ± 0.313
2.073MetThr: 2.073 ± 0.333
1.256MetVal: 1.256 ± 0.273
0.126MetTrp: 0.126 ± 0.081
0.44MetTyr: 0.44 ± 0.187
0.0MetXaa: 0.0 ± 0.0
Asn
3.832AsnAla: 3.832 ± 0.459
0.063AsnCys: 0.063 ± 0.057
1.696AsnAsp: 1.696 ± 0.331
1.696AsnGlu: 1.696 ± 0.319
1.005AsnPhe: 1.005 ± 0.255
3.518AsnGly: 3.518 ± 0.501
0.691AsnHis: 0.691 ± 0.22
1.445AsnIle: 1.445 ± 0.345
0.817AsnLys: 0.817 ± 0.227
2.638AsnLeu: 2.638 ± 0.364
0.817AsnMet: 0.817 ± 0.176
0.942AsnAsn: 0.942 ± 0.201
2.701AsnPro: 2.701 ± 0.423
1.005AsnGln: 1.005 ± 0.254
1.822AsnArg: 1.822 ± 0.406
2.136AsnSer: 2.136 ± 0.447
1.193AsnThr: 1.193 ± 0.257
2.073AsnVal: 2.073 ± 0.409
0.565AsnTrp: 0.565 ± 0.173
1.382AsnTyr: 1.382 ± 0.274
0.0AsnXaa: 0.0 ± 0.0
Pro
4.774ProAla: 4.774 ± 0.476
0.377ProCys: 0.377 ± 0.166
3.643ProAsp: 3.643 ± 0.476
4.774ProGlu: 4.774 ± 0.641
2.01ProPhe: 2.01 ± 0.367
4.899ProGly: 4.899 ± 0.681
1.068ProHis: 1.068 ± 0.259
2.261ProIle: 2.261 ± 0.356
1.947ProLys: 1.947 ± 0.318
3.643ProLeu: 3.643 ± 0.49
0.754ProMet: 0.754 ± 0.219
1.822ProAsn: 1.822 ± 0.337
2.889ProPro: 2.889 ± 0.456
1.759ProGln: 1.759 ± 0.368
2.701ProArg: 2.701 ± 0.477
3.957ProSer: 3.957 ± 0.469
4.271ProThr: 4.271 ± 0.564
4.146ProVal: 4.146 ± 0.464
0.691ProTrp: 0.691 ± 0.252
1.445ProTyr: 1.445 ± 0.316
0.0ProXaa: 0.0 ± 0.0
Gln
2.764GlnAla: 2.764 ± 0.466
0.126GlnCys: 0.126 ± 0.087
1.256GlnAsp: 1.256 ± 0.285
1.445GlnGlu: 1.445 ± 0.258
1.131GlnPhe: 1.131 ± 0.205
2.638GlnGly: 2.638 ± 0.349
0.377GlnHis: 0.377 ± 0.126
3.015GlnIle: 3.015 ± 0.507
1.256GlnLys: 1.256 ± 0.32
3.643GlnLeu: 3.643 ± 0.486
0.817GlnMet: 0.817 ± 0.24
0.691GlnAsn: 0.691 ± 0.188
1.319GlnPro: 1.319 ± 0.257
1.822GlnGln: 1.822 ± 0.384
2.198GlnArg: 2.198 ± 0.442
1.759GlnSer: 1.759 ± 0.313
2.01GlnThr: 2.01 ± 0.314
2.575GlnVal: 2.575 ± 0.402
0.879GlnTrp: 0.879 ± 0.193
0.942GlnTyr: 0.942 ± 0.21
0.0GlnXaa: 0.0 ± 0.0
Arg
4.837ArgAla: 4.837 ± 0.631
0.942ArgCys: 0.942 ± 0.321
2.827ArgAsp: 2.827 ± 0.439
4.46ArgGlu: 4.46 ± 0.659
1.884ArgPhe: 1.884 ± 0.421
4.962ArgGly: 4.962 ± 0.583
1.319ArgHis: 1.319 ± 0.327
2.701ArgIle: 2.701 ± 0.369
3.58ArgLys: 3.58 ± 0.542
5.276ArgLeu: 5.276 ± 0.657
1.759ArgMet: 1.759 ± 0.365
1.696ArgAsn: 1.696 ± 0.414
2.827ArgPro: 2.827 ± 0.476
1.884ArgGln: 1.884 ± 0.332
5.528ArgArg: 5.528 ± 0.889
3.706ArgSer: 3.706 ± 0.511
3.455ArgThr: 3.455 ± 0.584
5.088ArgVal: 5.088 ± 0.588
1.131ArgTrp: 1.131 ± 0.264
2.01ArgTyr: 2.01 ± 0.394
0.0ArgXaa: 0.0 ± 0.0
Ser
7.035SerAla: 7.035 ± 0.847
0.503SerCys: 0.503 ± 0.189
3.204SerAsp: 3.204 ± 0.403
3.832SerGlu: 3.832 ± 0.479
2.198SerPhe: 2.198 ± 0.406
6.344SerGly: 6.344 ± 0.742
1.382SerHis: 1.382 ± 0.319
3.015SerIle: 3.015 ± 0.508
2.45SerLys: 2.45 ± 0.447
5.025SerLeu: 5.025 ± 0.617
1.319SerMet: 1.319 ± 0.264
2.324SerAsn: 2.324 ± 0.416
2.827SerPro: 2.827 ± 0.486
2.01SerGln: 2.01 ± 0.302
2.701SerArg: 2.701 ± 0.417
3.266SerSer: 3.266 ± 0.63
3.518SerThr: 3.518 ± 0.539
4.02SerVal: 4.02 ± 0.487
1.696SerTrp: 1.696 ± 0.35
1.256SerTyr: 1.256 ± 0.317
0.0SerXaa: 0.0 ± 0.0
Thr
6.281ThrAla: 6.281 ± 0.663
0.377ThrCys: 0.377 ± 0.187
3.706ThrAsp: 3.706 ± 0.472
3.894ThrGlu: 3.894 ± 0.508
2.01ThrPhe: 2.01 ± 0.401
6.784ThrGly: 6.784 ± 0.673
0.942ThrHis: 0.942 ± 0.266
3.141ThrIle: 3.141 ± 0.551
2.889ThrLys: 2.889 ± 0.454
5.779ThrLeu: 5.779 ± 0.548
1.193ThrMet: 1.193 ± 0.228
1.696ThrAsn: 1.696 ± 0.358
4.146ThrPro: 4.146 ± 0.473
1.947ThrGln: 1.947 ± 0.385
2.889ThrArg: 2.889 ± 0.414
3.518ThrSer: 3.518 ± 0.505
4.271ThrThr: 4.271 ± 0.579
5.528ThrVal: 5.528 ± 0.603
1.256ThrTrp: 1.256 ± 0.299
2.324ThrTyr: 2.324 ± 0.33
0.0ThrXaa: 0.0 ± 0.0
Val
6.658ValAla: 6.658 ± 0.6
0.503ValCys: 0.503 ± 0.175
5.842ValAsp: 5.842 ± 0.623
4.271ValGlu: 4.271 ± 0.482
2.198ValPhe: 2.198 ± 0.328
4.648ValGly: 4.648 ± 0.755
1.822ValHis: 1.822 ± 0.333
3.643ValIle: 3.643 ± 0.53
2.827ValLys: 2.827 ± 0.449
5.025ValLeu: 5.025 ± 0.575
1.319ValMet: 1.319 ± 0.334
2.638ValAsn: 2.638 ± 0.398
3.706ValPro: 3.706 ± 0.436
2.324ValGln: 2.324 ± 0.408
5.025ValArg: 5.025 ± 0.682
5.402ValSer: 5.402 ± 0.535
5.59ValThr: 5.59 ± 0.564
5.088ValVal: 5.088 ± 0.704
1.005ValTrp: 1.005 ± 0.249
2.073ValTyr: 2.073 ± 0.319
0.0ValXaa: 0.0 ± 0.0
Trp
1.256TrpAla: 1.256 ± 0.287
0.188TrpCys: 0.188 ± 0.096
1.445TrpAsp: 1.445 ± 0.275
1.068TrpGlu: 1.068 ± 0.253
0.942TrpPhe: 0.942 ± 0.257
1.696TrpGly: 1.696 ± 0.296
0.314TrpHis: 0.314 ± 0.128
1.068TrpIle: 1.068 ± 0.206
0.314TrpLys: 0.314 ± 0.158
1.759TrpLeu: 1.759 ± 0.343
0.377TrpMet: 0.377 ± 0.146
0.503TrpAsn: 0.503 ± 0.231
0.565TrpPro: 0.565 ± 0.215
0.942TrpGln: 0.942 ± 0.197
1.382TrpArg: 1.382 ± 0.376
1.068TrpSer: 1.068 ± 0.246
1.696TrpThr: 1.696 ± 0.32
1.884TrpVal: 1.884 ± 0.292
0.503TrpTrp: 0.503 ± 0.2
0.314TrpTyr: 0.314 ± 0.132
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.198TyrAla: 2.198 ± 0.406
0.377TyrCys: 0.377 ± 0.187
1.445TyrAsp: 1.445 ± 0.308
2.198TyrGlu: 2.198 ± 0.302
0.691TyrPhe: 0.691 ± 0.192
2.827TyrGly: 2.827 ± 0.37
0.754TyrHis: 0.754 ± 0.236
1.57TyrIle: 1.57 ± 0.298
1.256TyrLys: 1.256 ± 0.277
2.764TyrLeu: 2.764 ± 0.405
0.691TyrMet: 0.691 ± 0.183
1.193TyrAsn: 1.193 ± 0.285
1.445TyrPro: 1.445 ± 0.302
1.131TyrGln: 1.131 ± 0.273
2.513TyrArg: 2.513 ± 0.351
1.57TyrSer: 1.57 ± 0.289
2.638TyrThr: 2.638 ± 0.352
1.947TyrVal: 1.947 ± 0.365
0.44TyrTrp: 0.44 ± 0.167
0.817TyrTyr: 0.817 ± 0.224
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (15921 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski