Amino acid dipepetide frequency for Mycobacterium virus Drago

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.279AlaAla: 14.279 ± 1.8
1.204AlaCys: 1.204 ± 0.284
6.537AlaAsp: 6.537 ± 0.709
7.455AlaGlu: 7.455 ± 0.724
2.695AlaPhe: 2.695 ± 0.42
9.519AlaGly: 9.519 ± 1.193
2.581AlaHis: 2.581 ± 0.42
4.186AlaIle: 4.186 ± 0.512
4.072AlaLys: 4.072 ± 0.429
7.57AlaLeu: 7.57 ± 0.681
2.982AlaMet: 2.982 ± 0.48
3.326AlaAsn: 3.326 ± 0.622
4.989AlaPro: 4.989 ± 0.687
3.613AlaGln: 3.613 ± 0.496
7.226AlaArg: 7.226 ± 0.764
5.448AlaSer: 5.448 ± 0.703
6.767AlaThr: 6.767 ± 0.645
7.168AlaVal: 7.168 ± 0.65
2.523AlaTrp: 2.523 ± 0.405
2.236AlaTyr: 2.236 ± 0.337
0.0AlaXaa: 0.0 ± 0.0
Cys
0.918CysAla: 0.918 ± 0.319
0.115CysCys: 0.115 ± 0.074
1.032CysAsp: 1.032 ± 0.261
0.688CysGlu: 0.688 ± 0.23
0.287CysPhe: 0.287 ± 0.144
1.663CysGly: 1.663 ± 0.329
0.172CysHis: 0.172 ± 0.089
0.229CysIle: 0.229 ± 0.112
0.401CysLys: 0.401 ± 0.151
0.803CysLeu: 0.803 ± 0.236
0.115CysMet: 0.115 ± 0.077
0.516CysAsn: 0.516 ± 0.165
1.319CysPro: 1.319 ± 0.257
0.573CysGln: 0.573 ± 0.212
0.918CysArg: 0.918 ± 0.286
0.803CysSer: 0.803 ± 0.29
0.918CysThr: 0.918 ± 0.246
0.631CysVal: 0.631 ± 0.193
0.229CysTrp: 0.229 ± 0.103
0.287CysTyr: 0.287 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
6.824AspAla: 6.824 ± 0.587
1.032AspCys: 1.032 ± 0.251
4.645AspAsp: 4.645 ± 0.519
3.211AspGlu: 3.211 ± 0.41
1.204AspPhe: 1.204 ± 0.244
6.308AspGly: 6.308 ± 0.544
1.434AspHis: 1.434 ± 0.256
2.351AspIle: 2.351 ± 0.324
1.606AspLys: 1.606 ± 0.272
5.563AspLeu: 5.563 ± 0.436
1.147AspMet: 1.147 ± 0.248
1.95AspAsn: 1.95 ± 0.411
5.333AspPro: 5.333 ± 0.554
2.294AspGln: 2.294 ± 0.322
5.62AspArg: 5.62 ± 0.807
3.097AspSer: 3.097 ± 0.432
4.186AspThr: 4.186 ± 0.49
4.53AspVal: 4.53 ± 0.636
1.778AspTrp: 1.778 ± 0.338
2.294AspTyr: 2.294 ± 0.345
0.0AspXaa: 0.0 ± 0.0
Glu
6.48GluAla: 6.48 ± 0.746
1.032GluCys: 1.032 ± 0.249
3.097GluAsp: 3.097 ± 0.32
3.039GluGlu: 3.039 ± 0.453
2.179GluPhe: 2.179 ± 0.271
3.441GluGly: 3.441 ± 0.343
1.491GluHis: 1.491 ± 0.368
2.81GluIle: 2.81 ± 0.488
2.064GluLys: 2.064 ± 0.366
5.161GluLeu: 5.161 ± 0.705
1.491GluMet: 1.491 ± 0.276
1.95GluAsn: 1.95 ± 0.29
2.925GluPro: 2.925 ± 0.395
2.925GluGln: 2.925 ± 0.442
4.588GluArg: 4.588 ± 0.493
3.613GluSer: 3.613 ± 0.472
4.014GluThr: 4.014 ± 0.516
3.67GluVal: 3.67 ± 0.588
1.032GluTrp: 1.032 ± 0.175
1.434GluTyr: 1.434 ± 0.278
0.0GluXaa: 0.0 ± 0.0
Phe
3.097PheAla: 3.097 ± 0.398
0.344PheCys: 0.344 ± 0.124
2.523PheAsp: 2.523 ± 0.365
1.663PheGlu: 1.663 ± 0.309
0.86PhePhe: 0.86 ± 0.222
3.269PheGly: 3.269 ± 0.614
0.344PheHis: 0.344 ± 0.127
1.262PheIle: 1.262 ± 0.343
1.262PheLys: 1.262 ± 0.27
2.064PheLeu: 2.064 ± 0.334
0.631PheMet: 0.631 ± 0.195
1.319PheAsn: 1.319 ± 0.356
1.663PhePro: 1.663 ± 0.279
0.975PheGln: 0.975 ± 0.277
1.548PheArg: 1.548 ± 0.28
1.835PheSer: 1.835 ± 0.292
2.351PheThr: 2.351 ± 0.329
1.892PheVal: 1.892 ± 0.282
0.688PheTrp: 0.688 ± 0.178
0.86PheTyr: 0.86 ± 0.266
0.0PheXaa: 0.0 ± 0.0
Gly
9.577GlyAla: 9.577 ± 1.044
1.032GlyCys: 1.032 ± 0.335
6.251GlyAsp: 6.251 ± 0.524
3.67GlyGlu: 3.67 ± 0.535
2.982GlyPhe: 2.982 ± 0.489
10.265GlyGly: 10.265 ± 2.067
2.007GlyHis: 2.007 ± 0.322
4.129GlyIle: 4.129 ± 0.606
2.753GlyLys: 2.753 ± 0.369
5.849GlyLeu: 5.849 ± 0.638
2.466GlyMet: 2.466 ± 0.455
3.097GlyAsn: 3.097 ± 0.395
3.727GlyPro: 3.727 ± 0.476
1.835GlyGln: 1.835 ± 0.498
4.473GlyArg: 4.473 ± 0.61
5.505GlySer: 5.505 ± 0.706
6.709GlyThr: 6.709 ± 0.668
6.48GlyVal: 6.48 ± 0.565
2.695GlyTrp: 2.695 ± 0.368
2.294GlyTyr: 2.294 ± 0.409
0.0GlyXaa: 0.0 ± 0.0
His
1.95HisAla: 1.95 ± 0.37
0.401HisCys: 0.401 ± 0.182
1.319HisAsp: 1.319 ± 0.278
1.032HisGlu: 1.032 ± 0.224
0.344HisPhe: 0.344 ± 0.104
1.778HisGly: 1.778 ± 0.334
0.918HisHis: 0.918 ± 0.283
1.491HisIle: 1.491 ± 0.333
0.745HisLys: 0.745 ± 0.235
1.892HisLeu: 1.892 ± 0.294
0.459HisMet: 0.459 ± 0.136
0.803HisAsn: 0.803 ± 0.217
1.548HisPro: 1.548 ± 0.226
0.516HisGln: 0.516 ± 0.164
1.835HisArg: 1.835 ± 0.332
0.918HisSer: 0.918 ± 0.224
1.491HisThr: 1.491 ± 0.34
1.376HisVal: 1.376 ± 0.312
0.401HisTrp: 0.401 ± 0.132
0.86HisTyr: 0.86 ± 0.212
0.0HisXaa: 0.0 ± 0.0
Ile
5.218IleAla: 5.218 ± 0.554
0.688IleCys: 0.688 ± 0.223
3.441IleAsp: 3.441 ± 0.443
3.211IleGlu: 3.211 ± 0.418
1.032IlePhe: 1.032 ± 0.264
3.785IleGly: 3.785 ± 0.515
1.204IleHis: 1.204 ± 0.281
1.376IleIle: 1.376 ± 0.288
1.376IleLys: 1.376 ± 0.297
2.409IleLeu: 2.409 ± 0.418
0.344IleMet: 0.344 ± 0.129
1.95IleAsn: 1.95 ± 0.333
2.753IlePro: 2.753 ± 0.38
1.663IleGln: 1.663 ± 0.313
2.294IleArg: 2.294 ± 0.366
2.064IleSer: 2.064 ± 0.366
3.383IleThr: 3.383 ± 0.389
3.211IleVal: 3.211 ± 0.349
1.09IleTrp: 1.09 ± 0.243
0.975IleTyr: 0.975 ± 0.253
0.0IleXaa: 0.0 ± 0.0
Lys
3.957LysAla: 3.957 ± 0.529
0.401LysCys: 0.401 ± 0.137
1.72LysAsp: 1.72 ± 0.235
1.434LysGlu: 1.434 ± 0.272
1.032LysPhe: 1.032 ± 0.188
3.039LysGly: 3.039 ± 0.337
1.262LysHis: 1.262 ± 0.327
0.975LysIle: 0.975 ± 0.257
1.548LysLys: 1.548 ± 0.366
3.039LysLeu: 3.039 ± 0.472
0.803LysMet: 0.803 ± 0.206
0.86LysAsn: 0.86 ± 0.204
2.523LysPro: 2.523 ± 0.344
1.606LysGln: 1.606 ± 0.254
2.236LysArg: 2.236 ± 0.354
1.72LysSer: 1.72 ± 0.296
2.007LysThr: 2.007 ± 0.373
2.409LysVal: 2.409 ± 0.374
0.688LysTrp: 0.688 ± 0.247
0.86LysTyr: 0.86 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
7.283LeuAla: 7.283 ± 0.722
0.918LeuCys: 0.918 ± 0.234
5.391LeuAsp: 5.391 ± 0.554
3.785LeuGlu: 3.785 ± 0.498
2.294LeuPhe: 2.294 ± 0.31
5.505LeuGly: 5.505 ± 0.581
0.975LeuHis: 0.975 ± 0.264
3.154LeuIle: 3.154 ± 0.353
2.581LeuLys: 2.581 ± 0.378
4.989LeuLeu: 4.989 ± 0.59
1.663LeuMet: 1.663 ± 0.313
2.466LeuAsn: 2.466 ± 0.378
4.702LeuPro: 4.702 ± 0.72
2.925LeuGln: 2.925 ± 0.457
5.391LeuArg: 5.391 ± 0.611
4.645LeuSer: 4.645 ± 0.614
5.563LeuThr: 5.563 ± 0.534
5.161LeuVal: 5.161 ± 0.603
1.319LeuTrp: 1.319 ± 0.363
2.236LeuTyr: 2.236 ± 0.375
0.0LeuXaa: 0.0 ± 0.0
Met
2.007MetAla: 2.007 ± 0.385
0.172MetCys: 0.172 ± 0.179
1.204MetAsp: 1.204 ± 0.23
0.975MetGlu: 0.975 ± 0.214
0.745MetPhe: 0.745 ± 0.218
2.064MetGly: 2.064 ± 0.361
0.115MetHis: 0.115 ± 0.082
0.975MetIle: 0.975 ± 0.251
0.688MetLys: 0.688 ± 0.179
1.548MetLeu: 1.548 ± 0.227
0.459MetMet: 0.459 ± 0.209
1.204MetAsn: 1.204 ± 0.238
1.262MetPro: 1.262 ± 0.243
0.401MetGln: 0.401 ± 0.147
1.663MetArg: 1.663 ± 0.354
2.753MetSer: 2.753 ± 0.441
2.064MetThr: 2.064 ± 0.294
1.319MetVal: 1.319 ± 0.334
0.459MetTrp: 0.459 ± 0.175
0.344MetTyr: 0.344 ± 0.14
0.0MetXaa: 0.0 ± 0.0
Asn
3.498AsnAla: 3.498 ± 0.507
0.344AsnCys: 0.344 ± 0.157
1.835AsnAsp: 1.835 ± 0.26
2.064AsnGlu: 2.064 ± 0.386
0.918AsnPhe: 0.918 ± 0.276
4.014AsnGly: 4.014 ± 0.612
0.918AsnHis: 0.918 ± 0.192
1.491AsnIle: 1.491 ± 0.445
0.918AsnLys: 0.918 ± 0.252
2.581AsnLeu: 2.581 ± 0.353
0.688AsnMet: 0.688 ± 0.166
1.606AsnAsn: 1.606 ± 0.408
3.154AsnPro: 3.154 ± 0.39
1.09AsnGln: 1.09 ± 0.367
1.835AsnArg: 1.835 ± 0.316
1.663AsnSer: 1.663 ± 0.31
2.122AsnThr: 2.122 ± 0.331
1.663AsnVal: 1.663 ± 0.3
0.86AsnTrp: 0.86 ± 0.219
0.745AsnTyr: 0.745 ± 0.184
0.0AsnXaa: 0.0 ± 0.0
Pro
5.907ProAla: 5.907 ± 0.56
0.688ProCys: 0.688 ± 0.201
4.358ProAsp: 4.358 ± 0.522
4.817ProGlu: 4.817 ± 0.491
2.294ProPhe: 2.294 ± 0.377
6.251ProGly: 6.251 ± 0.698
1.319ProHis: 1.319 ± 0.285
1.95ProIle: 1.95 ± 0.293
2.064ProLys: 2.064 ± 0.301
3.957ProLeu: 3.957 ± 0.494
1.376ProMet: 1.376 ± 0.301
2.294ProAsn: 2.294 ± 0.334
4.645ProPro: 4.645 ± 0.675
2.409ProGln: 2.409 ± 0.352
3.039ProArg: 3.039 ± 0.437
3.326ProSer: 3.326 ± 0.5
2.982ProThr: 2.982 ± 0.446
4.53ProVal: 4.53 ± 0.548
1.09ProTrp: 1.09 ± 0.273
1.892ProTyr: 1.892 ± 0.354
0.0ProXaa: 0.0 ± 0.0
Gln
4.645GlnAla: 4.645 ± 0.468
0.287GlnCys: 0.287 ± 0.169
1.548GlnAsp: 1.548 ± 0.284
2.064GlnGlu: 2.064 ± 0.348
1.032GlnPhe: 1.032 ± 0.234
2.351GlnGly: 2.351 ± 0.452
0.745GlnHis: 0.745 ± 0.214
1.778GlnIle: 1.778 ± 0.355
1.606GlnLys: 1.606 ± 0.259
3.269GlnLeu: 3.269 ± 0.418
0.573GlnMet: 0.573 ± 0.168
0.803GlnAsn: 0.803 ± 0.203
2.064GlnPro: 2.064 ± 0.395
1.778GlnGln: 1.778 ± 0.405
2.466GlnArg: 2.466 ± 0.333
2.581GlnSer: 2.581 ± 0.352
1.491GlnThr: 1.491 ± 0.316
2.409GlnVal: 2.409 ± 0.361
0.745GlnTrp: 0.745 ± 0.172
0.918GlnTyr: 0.918 ± 0.229
0.0GlnXaa: 0.0 ± 0.0
Arg
6.939ArgAla: 6.939 ± 0.654
1.204ArgCys: 1.204 ± 0.373
4.817ArgAsp: 4.817 ± 0.579
5.333ArgGlu: 5.333 ± 0.774
2.294ArgPhe: 2.294 ± 0.337
3.9ArgGly: 3.9 ± 0.424
1.147ArgHis: 1.147 ± 0.28
3.957ArgIle: 3.957 ± 0.522
2.122ArgLys: 2.122 ± 0.319
4.301ArgLeu: 4.301 ± 0.558
2.409ArgMet: 2.409 ± 0.377
2.294ArgAsn: 2.294 ± 0.331
3.326ArgPro: 3.326 ± 0.454
2.236ArgGln: 2.236 ± 0.368
4.76ArgArg: 4.76 ± 0.71
4.129ArgSer: 4.129 ± 0.478
3.498ArgThr: 3.498 ± 0.51
4.301ArgVal: 4.301 ± 0.527
1.835ArgTrp: 1.835 ± 0.34
1.892ArgTyr: 1.892 ± 0.286
0.0ArgXaa: 0.0 ± 0.0
Ser
5.792SerAla: 5.792 ± 0.621
0.401SerCys: 0.401 ± 0.172
4.072SerAsp: 4.072 ± 0.539
3.842SerGlu: 3.842 ± 0.448
2.409SerPhe: 2.409 ± 0.458
6.136SerGly: 6.136 ± 0.746
1.319SerHis: 1.319 ± 0.247
2.409SerIle: 2.409 ± 0.407
2.294SerLys: 2.294 ± 0.345
3.498SerLeu: 3.498 ± 0.465
1.262SerMet: 1.262 ± 0.264
2.294SerAsn: 2.294 ± 0.468
3.613SerPro: 3.613 ± 0.432
1.663SerGln: 1.663 ± 0.284
3.842SerArg: 3.842 ± 0.43
3.842SerSer: 3.842 ± 0.597
3.498SerThr: 3.498 ± 0.429
4.53SerVal: 4.53 ± 0.519
1.434SerTrp: 1.434 ± 0.325
1.376SerTyr: 1.376 ± 0.266
0.0SerXaa: 0.0 ± 0.0
Thr
6.079ThrAla: 6.079 ± 0.63
0.803ThrCys: 0.803 ± 0.217
3.957ThrAsp: 3.957 ± 0.541
3.498ThrGlu: 3.498 ± 0.4
1.95ThrPhe: 1.95 ± 0.326
5.849ThrGly: 5.849 ± 0.724
1.778ThrHis: 1.778 ± 0.326
3.613ThrIle: 3.613 ± 0.432
2.351ThrLys: 2.351 ± 0.41
4.874ThrLeu: 4.874 ± 0.602
0.975ThrMet: 0.975 ± 0.207
2.236ThrAsn: 2.236 ± 0.389
4.645ThrPro: 4.645 ± 0.766
1.892ThrGln: 1.892 ± 0.328
4.014ThrArg: 4.014 ± 0.573
3.9ThrSer: 3.9 ± 0.39
4.989ThrThr: 4.989 ± 0.57
5.677ThrVal: 5.677 ± 0.64
1.204ThrTrp: 1.204 ± 0.29
1.835ThrTyr: 1.835 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
7.283ValAla: 7.283 ± 0.634
1.147ValCys: 1.147 ± 0.272
5.104ValAsp: 5.104 ± 0.613
3.9ValGlu: 3.9 ± 0.546
2.294ValPhe: 2.294 ± 0.382
5.677ValGly: 5.677 ± 0.691
1.491ValHis: 1.491 ± 0.268
2.925ValIle: 2.925 ± 0.363
2.236ValLys: 2.236 ± 0.387
5.333ValLeu: 5.333 ± 0.65
1.147ValMet: 1.147 ± 0.198
2.122ValAsn: 2.122 ± 0.358
4.244ValPro: 4.244 ± 0.428
2.925ValGln: 2.925 ± 0.38
4.645ValArg: 4.645 ± 0.706
4.932ValSer: 4.932 ± 0.555
4.76ValThr: 4.76 ± 0.485
5.792ValVal: 5.792 ± 0.673
1.835ValTrp: 1.835 ± 0.383
1.204ValTyr: 1.204 ± 0.28
0.0ValXaa: 0.0 ± 0.0
Trp
2.179TrpAla: 2.179 ± 0.302
0.229TrpCys: 0.229 ± 0.112
1.548TrpAsp: 1.548 ± 0.293
1.032TrpGlu: 1.032 ± 0.213
0.688TrpPhe: 0.688 ± 0.186
0.918TrpGly: 0.918 ± 0.214
0.516TrpHis: 0.516 ± 0.159
1.319TrpIle: 1.319 ± 0.273
0.688TrpLys: 0.688 ± 0.178
1.892TrpLeu: 1.892 ± 0.381
1.09TrpMet: 1.09 ± 0.288
0.401TrpAsn: 0.401 ± 0.184
1.491TrpPro: 1.491 ± 0.305
1.204TrpGln: 1.204 ± 0.252
2.064TrpArg: 2.064 ± 0.31
1.548TrpSer: 1.548 ± 0.32
1.491TrpThr: 1.491 ± 0.241
1.72TrpVal: 1.72 ± 0.396
0.86TrpTrp: 0.86 ± 0.21
0.516TrpTyr: 0.516 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.523TyrAla: 2.523 ± 0.375
0.172TyrCys: 0.172 ± 0.096
2.007TyrAsp: 2.007 ± 0.337
1.72TyrGlu: 1.72 ± 0.285
0.918TyrPhe: 0.918 ± 0.23
2.064TyrGly: 2.064 ± 0.336
0.344TyrHis: 0.344 ± 0.146
1.09TyrIle: 1.09 ± 0.255
0.745TyrLys: 0.745 ± 0.215
2.351TyrLeu: 2.351 ± 0.428
0.287TyrMet: 0.287 ± 0.131
0.516TyrAsn: 0.516 ± 0.154
1.262TyrPro: 1.262 ± 0.266
0.631TyrGln: 0.631 ± 0.196
2.236TyrArg: 2.236 ± 0.345
1.204TyrSer: 1.204 ± 0.234
1.835TyrThr: 1.835 ± 0.313
2.523TyrVal: 2.523 ± 0.331
0.631TyrTrp: 0.631 ± 0.191
0.745TyrTyr: 0.745 ± 0.18
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 103 proteins (17439 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski