Amino acid dipepetide frequency for Mycobacterium virus Che9c

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.916AlaAla: 15.916 ± 0.914
1.173AlaCys: 1.173 ± 0.327
8.88AlaAsp: 8.88 ± 0.868
9.326AlaGlu: 9.326 ± 0.912
3.072AlaPhe: 3.072 ± 0.473
10.611AlaGly: 10.611 ± 1.236
1.955AlaHis: 1.955 ± 0.308
5.305AlaIle: 5.305 ± 0.519
3.518AlaLys: 3.518 ± 0.495
9.382AlaLeu: 9.382 ± 0.637
2.513AlaMet: 2.513 ± 0.357
3.407AlaAsn: 3.407 ± 0.471
6.59AlaPro: 6.59 ± 0.56
3.965AlaGln: 3.965 ± 0.733
8.433AlaArg: 8.433 ± 0.721
6.311AlaSer: 6.311 ± 0.7
6.311AlaThr: 6.311 ± 0.68
6.981AlaVal: 6.981 ± 0.749
2.066AlaTrp: 2.066 ± 0.343
2.737AlaTyr: 2.737 ± 0.34
0.0AlaXaa: 0.0 ± 0.0
Cys
0.894CysAla: 0.894 ± 0.217
0.056CysCys: 0.056 ± 0.057
0.67CysAsp: 0.67 ± 0.204
0.558CysGlu: 0.558 ± 0.2
0.0CysPhe: 0.0 ± 0.0
1.508CysGly: 1.508 ± 0.316
0.223CysHis: 0.223 ± 0.119
0.503CysIle: 0.503 ± 0.153
0.279CysLys: 0.279 ± 0.128
0.726CysLeu: 0.726 ± 0.216
0.223CysMet: 0.223 ± 0.099
0.223CysAsn: 0.223 ± 0.109
1.005CysPro: 1.005 ± 0.266
0.391CysGln: 0.391 ± 0.183
1.005CysArg: 1.005 ± 0.237
0.223CysSer: 0.223 ± 0.115
0.726CysThr: 0.726 ± 0.225
0.558CysVal: 0.558 ± 0.163
0.279CysTrp: 0.279 ± 0.128
0.279CysTyr: 0.279 ± 0.137
0.0CysXaa: 0.0 ± 0.0
Asp
8.042AspAla: 8.042 ± 0.78
0.503AspCys: 0.503 ± 0.176
4.803AspAsp: 4.803 ± 0.512
5.361AspGlu: 5.361 ± 0.67
1.62AspPhe: 1.62 ± 0.338
7.204AspGly: 7.204 ± 0.741
1.396AspHis: 1.396 ± 0.295
2.066AspIle: 2.066 ± 0.313
1.452AspLys: 1.452 ± 0.262
6.478AspLeu: 6.478 ± 0.617
1.005AspMet: 1.005 ± 0.227
1.955AspAsn: 1.955 ± 0.386
4.356AspPro: 4.356 ± 0.381
2.01AspGln: 2.01 ± 0.355
3.742AspArg: 3.742 ± 0.422
3.072AspSer: 3.072 ± 0.415
3.239AspThr: 3.239 ± 0.393
4.524AspVal: 4.524 ± 0.522
1.452AspTrp: 1.452 ± 0.226
1.173AspTyr: 1.173 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
6.646GluAla: 6.646 ± 0.687
1.173GluCys: 1.173 ± 0.293
2.513GluAsp: 2.513 ± 0.351
2.737GluGlu: 2.737 ± 0.495
2.681GluPhe: 2.681 ± 0.368
3.407GluGly: 3.407 ± 0.467
1.787GluHis: 1.787 ± 0.354
2.569GluIle: 2.569 ± 0.46
2.122GluLys: 2.122 ± 0.345
6.087GluLeu: 6.087 ± 0.596
1.34GluMet: 1.34 ± 0.238
1.843GluAsn: 1.843 ± 0.33
3.63GluPro: 3.63 ± 0.432
2.178GluGln: 2.178 ± 0.381
4.244GluArg: 4.244 ± 0.45
3.183GluSer: 3.183 ± 0.527
3.686GluThr: 3.686 ± 0.387
4.412GluVal: 4.412 ± 0.519
1.173GluTrp: 1.173 ± 0.234
1.452GluTyr: 1.452 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
3.463PheAla: 3.463 ± 0.466
0.391PheCys: 0.391 ± 0.143
2.066PheAsp: 2.066 ± 0.375
1.508PheGlu: 1.508 ± 0.277
0.447PhePhe: 0.447 ± 0.149
2.904PheGly: 2.904 ± 0.386
0.447PheHis: 0.447 ± 0.144
1.731PheIle: 1.731 ± 0.322
1.005PheLys: 1.005 ± 0.245
2.01PheLeu: 2.01 ± 0.37
0.558PheMet: 0.558 ± 0.191
0.894PheAsn: 0.894 ± 0.206
1.005PhePro: 1.005 ± 0.264
0.726PheGln: 0.726 ± 0.222
2.569PheArg: 2.569 ± 0.377
1.396PheSer: 1.396 ± 0.323
2.01PheThr: 2.01 ± 0.328
2.122PheVal: 2.122 ± 0.296
0.726PheTrp: 0.726 ± 0.201
0.726PheTyr: 0.726 ± 0.208
0.0PheXaa: 0.0 ± 0.0
Gly
9.047GlyAla: 9.047 ± 1.087
0.726GlyCys: 0.726 ± 0.272
4.691GlyAsp: 4.691 ± 0.491
3.909GlyGlu: 3.909 ± 0.562
3.072GlyPhe: 3.072 ± 0.484
9.159GlyGly: 9.159 ± 1.6
2.01GlyHis: 2.01 ± 0.358
4.747GlyIle: 4.747 ± 0.659
3.127GlyLys: 3.127 ± 0.527
6.869GlyLeu: 6.869 ± 0.747
1.787GlyMet: 1.787 ± 0.292
2.96GlyAsn: 2.96 ± 0.395
4.468GlyPro: 4.468 ± 0.543
3.183GlyGln: 3.183 ± 0.717
6.087GlyArg: 6.087 ± 0.581
4.579GlySer: 4.579 ± 0.554
5.752GlyThr: 5.752 ± 0.608
5.976GlyVal: 5.976 ± 0.529
2.29GlyTrp: 2.29 ± 0.394
2.904GlyTyr: 2.904 ± 0.477
0.0GlyXaa: 0.0 ± 0.0
His
2.513HisAla: 2.513 ± 0.421
0.168HisCys: 0.168 ± 0.086
1.229HisAsp: 1.229 ± 0.259
0.949HisGlu: 0.949 ± 0.228
0.503HisPhe: 0.503 ± 0.208
1.396HisGly: 1.396 ± 0.27
0.558HisHis: 0.558 ± 0.233
0.838HisIle: 0.838 ± 0.225
0.838HisLys: 0.838 ± 0.256
1.731HisLeu: 1.731 ± 0.343
0.391HisMet: 0.391 ± 0.129
0.949HisAsn: 0.949 ± 0.229
1.899HisPro: 1.899 ± 0.319
1.173HisGln: 1.173 ± 0.294
1.787HisArg: 1.787 ± 0.42
0.782HisSer: 0.782 ± 0.223
1.061HisThr: 1.061 ± 0.281
1.787HisVal: 1.787 ± 0.317
0.503HisTrp: 0.503 ± 0.189
0.726HisTyr: 0.726 ± 0.216
0.0HisXaa: 0.0 ± 0.0
Ile
5.026IleAla: 5.026 ± 0.597
0.335IleCys: 0.335 ± 0.139
4.133IleAsp: 4.133 ± 0.532
3.127IleGlu: 3.127 ± 0.44
1.229IlePhe: 1.229 ± 0.28
4.021IleGly: 4.021 ± 0.624
0.894IleHis: 0.894 ± 0.217
1.396IleIle: 1.396 ± 0.302
1.229IleLys: 1.229 ± 0.222
2.346IleLeu: 2.346 ± 0.342
0.614IleMet: 0.614 ± 0.184
1.62IleAsn: 1.62 ± 0.339
2.96IlePro: 2.96 ± 0.431
1.675IleGln: 1.675 ± 0.282
2.625IleArg: 2.625 ± 0.356
2.457IleSer: 2.457 ± 0.398
2.737IleThr: 2.737 ± 0.404
3.518IleVal: 3.518 ± 0.434
0.503IleTrp: 0.503 ± 0.175
0.782IleTyr: 0.782 ± 0.21
0.0IleXaa: 0.0 ± 0.0
Lys
3.853LysAla: 3.853 ± 0.579
0.447LysCys: 0.447 ± 0.228
1.675LysAsp: 1.675 ± 0.356
1.117LysGlu: 1.117 ± 0.223
0.949LysPhe: 0.949 ± 0.249
2.96LysGly: 2.96 ± 0.377
0.782LysHis: 0.782 ± 0.274
1.173LysIle: 1.173 ± 0.273
1.508LysLys: 1.508 ± 0.312
3.127LysLeu: 3.127 ± 0.402
0.558LysMet: 0.558 ± 0.167
0.67LysAsn: 0.67 ± 0.183
2.401LysPro: 2.401 ± 0.428
1.005LysGln: 1.005 ± 0.266
2.904LysArg: 2.904 ± 0.523
1.229LysSer: 1.229 ± 0.343
2.01LysThr: 2.01 ± 0.423
2.01LysVal: 2.01 ± 0.331
0.67LysTrp: 0.67 ± 0.184
0.894LysTyr: 0.894 ± 0.24
0.0LysXaa: 0.0 ± 0.0
Leu
10.052LeuAla: 10.052 ± 0.742
0.726LeuCys: 0.726 ± 0.207
4.803LeuAsp: 4.803 ± 0.581
3.965LeuGlu: 3.965 ± 0.376
2.401LeuPhe: 2.401 ± 0.368
7.148LeuGly: 7.148 ± 0.531
1.787LeuHis: 1.787 ± 0.371
3.351LeuIle: 3.351 ± 0.436
2.625LeuLys: 2.625 ± 0.541
6.702LeuLeu: 6.702 ± 0.527
1.955LeuMet: 1.955 ± 0.337
3.127LeuAsn: 3.127 ± 0.443
5.92LeuPro: 5.92 ± 0.665
2.457LeuGln: 2.457 ± 0.353
5.585LeuArg: 5.585 ± 0.603
3.909LeuSer: 3.909 ± 0.455
5.585LeuThr: 5.585 ± 0.418
5.641LeuVal: 5.641 ± 0.496
1.396LeuTrp: 1.396 ± 0.32
1.396LeuTyr: 1.396 ± 0.338
0.0LeuXaa: 0.0 ± 0.0
Met
2.457MetAla: 2.457 ± 0.344
0.168MetCys: 0.168 ± 0.105
0.838MetAsp: 0.838 ± 0.192
1.117MetGlu: 1.117 ± 0.295
0.391MetPhe: 0.391 ± 0.141
1.564MetGly: 1.564 ± 0.305
0.614MetHis: 0.614 ± 0.196
0.726MetIle: 0.726 ± 0.191
0.894MetLys: 0.894 ± 0.229
1.173MetLeu: 1.173 ± 0.328
0.279MetMet: 0.279 ± 0.123
0.782MetAsn: 0.782 ± 0.212
1.675MetPro: 1.675 ± 0.308
0.726MetGln: 0.726 ± 0.168
1.843MetArg: 1.843 ± 0.367
2.457MetSer: 2.457 ± 0.417
2.01MetThr: 2.01 ± 0.269
0.894MetVal: 0.894 ± 0.203
0.223MetTrp: 0.223 ± 0.107
0.168MetTyr: 0.168 ± 0.093
0.0MetXaa: 0.0 ± 0.0
Asn
3.909AsnAla: 3.909 ± 0.448
0.223AsnCys: 0.223 ± 0.116
1.843AsnAsp: 1.843 ± 0.318
1.452AsnGlu: 1.452 ± 0.276
0.67AsnPhe: 0.67 ± 0.187
3.351AsnGly: 3.351 ± 0.442
1.005AsnHis: 1.005 ± 0.231
1.396AsnIle: 1.396 ± 0.303
0.614AsnLys: 0.614 ± 0.186
2.457AsnLeu: 2.457 ± 0.449
0.391AsnMet: 0.391 ± 0.152
0.949AsnAsn: 0.949 ± 0.238
3.295AsnPro: 3.295 ± 0.428
1.117AsnGln: 1.117 ± 0.243
1.843AsnArg: 1.843 ± 0.31
1.675AsnSer: 1.675 ± 0.3
1.005AsnThr: 1.005 ± 0.213
1.899AsnVal: 1.899 ± 0.273
0.782AsnTrp: 0.782 ± 0.164
0.503AsnTyr: 0.503 ± 0.177
0.0AsnXaa: 0.0 ± 0.0
Pro
7.763ProAla: 7.763 ± 0.808
0.223ProCys: 0.223 ± 0.122
5.417ProAsp: 5.417 ± 0.604
4.468ProGlu: 4.468 ± 0.423
2.01ProPhe: 2.01 ± 0.276
6.143ProGly: 6.143 ± 0.763
0.838ProHis: 0.838 ± 0.21
2.234ProIle: 2.234 ± 0.34
2.01ProLys: 2.01 ± 0.409
4.579ProLeu: 4.579 ± 0.521
1.396ProMet: 1.396 ± 0.298
1.62ProAsn: 1.62 ± 0.244
3.965ProPro: 3.965 ± 0.533
2.29ProGln: 2.29 ± 0.336
3.798ProArg: 3.798 ± 0.578
3.798ProSer: 3.798 ± 0.462
5.082ProThr: 5.082 ± 0.687
4.524ProVal: 4.524 ± 0.47
1.564ProTrp: 1.564 ± 0.334
0.949ProTyr: 0.949 ± 0.21
0.0ProXaa: 0.0 ± 0.0
Gln
4.859GlnAla: 4.859 ± 0.599
0.447GlnCys: 0.447 ± 0.168
1.34GlnAsp: 1.34 ± 0.263
1.452GlnGlu: 1.452 ± 0.277
0.782GlnPhe: 0.782 ± 0.161
2.569GlnGly: 2.569 ± 0.46
0.838GlnHis: 0.838 ± 0.231
1.787GlnIle: 1.787 ± 0.233
1.117GlnLys: 1.117 ± 0.21
3.518GlnLeu: 3.518 ± 0.46
0.67GlnMet: 0.67 ± 0.173
1.117GlnAsn: 1.117 ± 0.194
2.29GlnPro: 2.29 ± 0.352
1.899GlnGln: 1.899 ± 0.472
3.965GlnArg: 3.965 ± 0.432
1.899GlnSer: 1.899 ± 0.306
1.564GlnThr: 1.564 ± 0.347
2.513GlnVal: 2.513 ± 0.436
0.726GlnTrp: 0.726 ± 0.217
0.838GlnTyr: 0.838 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
7.595ArgAla: 7.595 ± 0.852
0.726ArgCys: 0.726 ± 0.237
4.97ArgAsp: 4.97 ± 0.576
3.686ArgGlu: 3.686 ± 0.507
2.29ArgPhe: 2.29 ± 0.342
5.473ArgGly: 5.473 ± 0.52
2.122ArgHis: 2.122 ± 0.449
3.127ArgIle: 3.127 ± 0.373
2.681ArgLys: 2.681 ± 0.308
6.143ArgLeu: 6.143 ± 0.589
2.457ArgMet: 2.457 ± 0.353
2.457ArgAsn: 2.457 ± 0.342
3.798ArgPro: 3.798 ± 0.415
2.625ArgGln: 2.625 ± 0.411
6.813ArgArg: 6.813 ± 0.693
3.518ArgSer: 3.518 ± 0.464
3.909ArgThr: 3.909 ± 0.46
4.915ArgVal: 4.915 ± 0.605
1.731ArgTrp: 1.731 ± 0.328
1.955ArgTyr: 1.955 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
5.92SerAla: 5.92 ± 0.786
0.503SerCys: 0.503 ± 0.153
4.133SerAsp: 4.133 ± 0.383
2.737SerGlu: 2.737 ± 0.398
1.284SerPhe: 1.284 ± 0.324
4.468SerGly: 4.468 ± 0.583
0.949SerHis: 0.949 ± 0.238
2.457SerIle: 2.457 ± 0.364
1.731SerLys: 1.731 ± 0.25
3.853SerLeu: 3.853 ± 0.522
1.675SerMet: 1.675 ± 0.271
1.284SerAsn: 1.284 ± 0.246
3.686SerPro: 3.686 ± 0.499
2.29SerGln: 2.29 ± 0.418
2.96SerArg: 2.96 ± 0.471
3.072SerSer: 3.072 ± 0.46
3.742SerThr: 3.742 ± 0.401
3.909SerVal: 3.909 ± 0.509
1.173SerTrp: 1.173 ± 0.251
1.005SerTyr: 1.005 ± 0.266
0.0SerXaa: 0.0 ± 0.0
Thr
7.986ThrAla: 7.986 ± 0.757
0.503ThrCys: 0.503 ± 0.222
3.742ThrAsp: 3.742 ± 0.531
3.463ThrGlu: 3.463 ± 0.434
1.843ThrPhe: 1.843 ± 0.389
4.412ThrGly: 4.412 ± 0.568
1.117ThrHis: 1.117 ± 0.213
2.681ThrIle: 2.681 ± 0.439
2.569ThrLys: 2.569 ± 0.329
5.417ThrLeu: 5.417 ± 0.532
1.005ThrMet: 1.005 ± 0.274
1.675ThrAsn: 1.675 ± 0.309
4.859ThrPro: 4.859 ± 0.469
2.178ThrGln: 2.178 ± 0.335
3.742ThrArg: 3.742 ± 0.49
2.904ThrSer: 2.904 ± 0.394
3.742ThrThr: 3.742 ± 0.496
4.859ThrVal: 4.859 ± 0.546
1.452ThrTrp: 1.452 ± 0.279
1.843ThrTyr: 1.843 ± 0.344
0.0ThrXaa: 0.0 ± 0.0
Val
7.651ValAla: 7.651 ± 0.686
1.005ValCys: 1.005 ± 0.232
5.473ValAsp: 5.473 ± 0.588
4.747ValGlu: 4.747 ± 0.475
2.178ValPhe: 2.178 ± 0.296
5.696ValGly: 5.696 ± 0.539
1.452ValHis: 1.452 ± 0.275
3.072ValIle: 3.072 ± 0.422
1.62ValLys: 1.62 ± 0.319
4.133ValLeu: 4.133 ± 0.432
1.508ValMet: 1.508 ± 0.28
1.899ValAsn: 1.899 ± 0.337
4.244ValPro: 4.244 ± 0.424
2.178ValGln: 2.178 ± 0.32
5.696ValArg: 5.696 ± 0.647
3.965ValSer: 3.965 ± 0.491
4.747ValThr: 4.747 ± 0.528
5.417ValVal: 5.417 ± 0.676
1.117ValTrp: 1.117 ± 0.222
1.564ValTyr: 1.564 ± 0.267
0.0ValXaa: 0.0 ± 0.0
Trp
2.122TrpAla: 2.122 ± 0.312
0.558TrpCys: 0.558 ± 0.172
0.949TrpAsp: 0.949 ± 0.226
1.061TrpGlu: 1.061 ± 0.246
0.726TrpPhe: 0.726 ± 0.257
1.396TrpGly: 1.396 ± 0.305
0.558TrpHis: 0.558 ± 0.147
0.949TrpIle: 0.949 ± 0.261
0.447TrpLys: 0.447 ± 0.144
2.066TrpLeu: 2.066 ± 0.293
0.503TrpMet: 0.503 ± 0.163
0.391TrpAsn: 0.391 ± 0.142
0.949TrpPro: 0.949 ± 0.235
1.117TrpGln: 1.117 ± 0.299
1.731TrpArg: 1.731 ± 0.356
1.396TrpSer: 1.396 ± 0.331
1.508TrpThr: 1.508 ± 0.266
1.173TrpVal: 1.173 ± 0.246
0.503TrpTrp: 0.503 ± 0.159
0.558TrpTyr: 0.558 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.737TyrAla: 2.737 ± 0.334
0.279TyrCys: 0.279 ± 0.121
1.284TyrAsp: 1.284 ± 0.292
2.122TyrGlu: 2.122 ± 0.351
0.558TyrPhe: 0.558 ± 0.175
2.01TyrGly: 2.01 ± 0.308
0.447TyrHis: 0.447 ± 0.146
1.117TyrIle: 1.117 ± 0.238
0.558TyrLys: 0.558 ± 0.238
1.731TyrLeu: 1.731 ± 0.319
0.112TyrMet: 0.112 ± 0.083
0.558TyrAsn: 0.558 ± 0.212
1.675TyrPro: 1.675 ± 0.286
1.005TyrGln: 1.005 ± 0.227
1.62TyrArg: 1.62 ± 0.308
1.005TyrSer: 1.005 ± 0.248
1.564TyrThr: 1.564 ± 0.359
1.731TyrVal: 1.731 ± 0.325
0.335TyrTrp: 0.335 ± 0.134
0.838TyrTyr: 0.838 ± 0.228
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 84 proteins (17907 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski