Amino acid dipepetide frequency for Mycobacterium phage Violet

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.065AlaAla: 13.065 ± 1.54
0.783AlaCys: 0.783 ± 0.21
6.502AlaAsp: 6.502 ± 0.723
6.081AlaGlu: 6.081 ± 0.688
3.131AlaPhe: 3.131 ± 0.44
8.369AlaGly: 8.369 ± 0.763
1.204AlaHis: 1.204 ± 0.334
4.154AlaIle: 4.154 ± 0.588
4.034AlaLys: 4.034 ± 0.52
8.309AlaLeu: 8.309 ± 0.854
1.987AlaMet: 1.987 ± 0.315
2.89AlaAsn: 2.89 ± 0.444
5.178AlaPro: 5.178 ± 0.768
3.01AlaGln: 3.01 ± 0.47
6.081AlaArg: 6.081 ± 0.507
5.479AlaSer: 5.479 ± 0.715
5.78AlaThr: 5.78 ± 0.657
7.947AlaVal: 7.947 ± 0.724
1.927AlaTrp: 1.927 ± 0.366
2.77AlaTyr: 2.77 ± 0.406
0.0AlaXaa: 0.0 ± 0.0
Cys
0.963CysAla: 0.963 ± 0.335
0.06CysCys: 0.06 ± 0.062
0.662CysAsp: 0.662 ± 0.233
0.783CysGlu: 0.783 ± 0.184
0.12CysPhe: 0.12 ± 0.094
0.602CysGly: 0.602 ± 0.195
0.181CysHis: 0.181 ± 0.109
0.482CysIle: 0.482 ± 0.17
0.241CysLys: 0.241 ± 0.142
0.602CysLeu: 0.602 ± 0.233
0.12CysMet: 0.12 ± 0.105
0.241CysAsn: 0.241 ± 0.133
0.241CysPro: 0.241 ± 0.109
0.12CysGln: 0.12 ± 0.077
0.783CysArg: 0.783 ± 0.295
0.361CysSer: 0.361 ± 0.145
0.361CysThr: 0.361 ± 0.173
0.241CysVal: 0.241 ± 0.125
0.181CysTrp: 0.181 ± 0.102
0.12CysTyr: 0.12 ± 0.098
0.0CysXaa: 0.0 ± 0.0
Asp
5.84AspAla: 5.84 ± 0.623
0.722AspCys: 0.722 ± 0.246
4.215AspAsp: 4.215 ± 0.524
3.733AspGlu: 3.733 ± 0.437
2.228AspPhe: 2.228 ± 0.304
6.262AspGly: 6.262 ± 0.808
1.144AspHis: 1.144 ± 0.324
2.649AspIle: 2.649 ± 0.368
2.228AspLys: 2.228 ± 0.419
6.864AspLeu: 6.864 ± 0.835
1.144AspMet: 1.144 ± 0.24
1.686AspAsn: 1.686 ± 0.34
4.576AspPro: 4.576 ± 0.635
1.746AspGln: 1.746 ± 0.452
4.154AspArg: 4.154 ± 0.474
3.191AspSer: 3.191 ± 0.494
4.395AspThr: 4.395 ± 0.571
3.974AspVal: 3.974 ± 0.559
1.505AspTrp: 1.505 ± 0.345
1.866AspTyr: 1.866 ± 0.351
0.0AspXaa: 0.0 ± 0.0
Glu
6.081GluAla: 6.081 ± 0.863
0.361GluCys: 0.361 ± 0.205
4.395GluAsp: 4.395 ± 0.534
5.118GluGlu: 5.118 ± 0.62
1.987GluPhe: 1.987 ± 0.365
3.853GluGly: 3.853 ± 0.438
1.565GluHis: 1.565 ± 0.309
3.492GluIle: 3.492 ± 0.461
2.95GluLys: 2.95 ± 0.424
6.804GluLeu: 6.804 ± 0.5
1.385GluMet: 1.385 ± 0.277
1.686GluAsn: 1.686 ± 0.439
2.77GluPro: 2.77 ± 0.442
2.469GluGln: 2.469 ± 0.422
3.853GluArg: 3.853 ± 0.562
3.853GluSer: 3.853 ± 0.452
3.673GluThr: 3.673 ± 0.467
5.479GluVal: 5.479 ± 0.537
1.445GluTrp: 1.445 ± 0.343
2.649GluTyr: 2.649 ± 0.465
0.0GluXaa: 0.0 ± 0.0
Phe
2.408PheAla: 2.408 ± 0.33
0.361PheCys: 0.361 ± 0.168
3.071PheAsp: 3.071 ± 0.376
2.167PheGlu: 2.167 ± 0.417
0.662PhePhe: 0.662 ± 0.192
3.131PheGly: 3.131 ± 0.537
0.602PheHis: 0.602 ± 0.242
1.204PheIle: 1.204 ± 0.264
1.325PheLys: 1.325 ± 0.292
2.408PheLeu: 2.408 ± 0.525
0.662PheMet: 0.662 ± 0.231
1.325PheAsn: 1.325 ± 0.301
1.626PhePro: 1.626 ± 0.326
0.903PheGln: 0.903 ± 0.218
1.927PheArg: 1.927 ± 0.434
2.167PheSer: 2.167 ± 0.528
1.866PheThr: 1.866 ± 0.376
1.806PheVal: 1.806 ± 0.355
0.482PheTrp: 0.482 ± 0.17
0.963PheTyr: 0.963 ± 0.28
0.0PheXaa: 0.0 ± 0.0
Gly
7.646GlyAla: 7.646 ± 1.042
0.783GlyCys: 0.783 ± 0.308
5.961GlyAsp: 5.961 ± 0.539
4.877GlyGlu: 4.877 ± 0.542
2.89GlyPhe: 2.89 ± 0.503
9.874GlyGly: 9.874 ± 2.116
1.987GlyHis: 1.987 ± 0.41
4.636GlyIle: 4.636 ± 0.61
3.492GlyLys: 3.492 ± 0.538
7.827GlyLeu: 7.827 ± 0.863
2.167GlyMet: 2.167 ± 0.327
3.01GlyAsn: 3.01 ± 0.488
3.673GlyPro: 3.673 ± 0.538
2.469GlyGln: 2.469 ± 0.289
5.539GlyArg: 5.539 ± 0.576
6.262GlySer: 6.262 ± 0.901
5.78GlyThr: 5.78 ± 0.697
5.419GlyVal: 5.419 ± 0.591
2.228GlyTrp: 2.228 ± 0.374
2.89GlyTyr: 2.89 ± 0.455
0.0GlyXaa: 0.0 ± 0.0
His
1.626HisAla: 1.626 ± 0.383
0.06HisCys: 0.06 ± 0.057
0.963HisAsp: 0.963 ± 0.243
1.445HisGlu: 1.445 ± 0.298
0.722HisPhe: 0.722 ± 0.198
1.866HisGly: 1.866 ± 0.392
0.662HisHis: 0.662 ± 0.206
0.963HisIle: 0.963 ± 0.207
1.024HisLys: 1.024 ± 0.314
1.626HisLeu: 1.626 ± 0.365
0.06HisMet: 0.06 ± 0.07
0.301HisAsn: 0.301 ± 0.117
1.264HisPro: 1.264 ± 0.245
0.843HisGln: 0.843 ± 0.241
1.866HisArg: 1.866 ± 0.413
0.662HisSer: 0.662 ± 0.217
1.024HisThr: 1.024 ± 0.235
1.626HisVal: 1.626 ± 0.367
0.542HisTrp: 0.542 ± 0.163
0.542HisTyr: 0.542 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
6.081IleAla: 6.081 ± 0.777
0.301IleCys: 0.301 ± 0.15
3.372IleAsp: 3.372 ± 0.325
3.612IleGlu: 3.612 ± 0.435
0.903IlePhe: 0.903 ± 0.223
3.974IleGly: 3.974 ± 0.412
1.084IleHis: 1.084 ± 0.299
1.385IleIle: 1.385 ± 0.309
1.686IleLys: 1.686 ± 0.331
3.311IleLeu: 3.311 ± 0.456
0.722IleMet: 0.722 ± 0.207
1.686IleAsn: 1.686 ± 0.291
3.612IlePro: 3.612 ± 0.417
1.445IleGln: 1.445 ± 0.367
3.612IleArg: 3.612 ± 0.451
3.071IleSer: 3.071 ± 0.467
3.131IleThr: 3.131 ± 0.421
2.709IleVal: 2.709 ± 0.568
0.662IleTrp: 0.662 ± 0.138
1.565IleTyr: 1.565 ± 0.26
0.0IleXaa: 0.0 ± 0.0
Lys
3.853LysAla: 3.853 ± 0.567
0.181LysCys: 0.181 ± 0.127
2.288LysAsp: 2.288 ± 0.455
1.987LysGlu: 1.987 ± 0.383
1.626LysPhe: 1.626 ± 0.331
2.589LysGly: 2.589 ± 0.335
1.084LysHis: 1.084 ± 0.282
2.348LysIle: 2.348 ± 0.467
1.806LysLys: 1.806 ± 0.441
3.432LysLeu: 3.432 ± 0.429
0.903LysMet: 0.903 ± 0.212
1.746LysAsn: 1.746 ± 0.305
2.89LysPro: 2.89 ± 0.495
1.445LysGln: 1.445 ± 0.349
2.529LysArg: 2.529 ± 0.536
2.469LysSer: 2.469 ± 0.374
2.348LysThr: 2.348 ± 0.381
3.552LysVal: 3.552 ± 0.456
0.963LysTrp: 0.963 ± 0.227
0.843LysTyr: 0.843 ± 0.28
0.0LysXaa: 0.0 ± 0.0
Leu
9.152LeuAla: 9.152 ± 0.961
0.181LeuCys: 0.181 ± 0.099
6.442LeuAsp: 6.442 ± 0.532
5.961LeuGlu: 5.961 ± 0.714
2.228LeuPhe: 2.228 ± 0.402
7.767LeuGly: 7.767 ± 0.944
1.204LeuHis: 1.204 ± 0.296
4.516LeuIle: 4.516 ± 0.59
4.094LeuLys: 4.094 ± 0.596
5.66LeuLeu: 5.66 ± 0.491
1.626LeuMet: 1.626 ± 0.308
2.709LeuAsn: 2.709 ± 0.358
5.118LeuPro: 5.118 ± 0.636
2.589LeuGln: 2.589 ± 0.401
5.78LeuArg: 5.78 ± 0.609
5.479LeuSer: 5.479 ± 0.54
6.262LeuThr: 6.262 ± 0.493
5.057LeuVal: 5.057 ± 0.633
1.144LeuTrp: 1.144 ± 0.296
2.228LeuTyr: 2.228 ± 0.352
0.0LeuXaa: 0.0 ± 0.0
Met
2.107MetAla: 2.107 ± 0.355
0.0MetCys: 0.0 ± 0.0
1.024MetAsp: 1.024 ± 0.263
1.385MetGlu: 1.385 ± 0.31
0.602MetPhe: 0.602 ± 0.213
1.385MetGly: 1.385 ± 0.322
0.12MetHis: 0.12 ± 0.108
0.542MetIle: 0.542 ± 0.171
1.024MetLys: 1.024 ± 0.268
1.144MetLeu: 1.144 ± 0.305
0.181MetMet: 0.181 ± 0.11
1.084MetAsn: 1.084 ± 0.234
1.024MetPro: 1.024 ± 0.253
0.542MetGln: 0.542 ± 0.16
1.385MetArg: 1.385 ± 0.276
2.047MetSer: 2.047 ± 0.358
2.167MetThr: 2.167 ± 0.323
1.024MetVal: 1.024 ± 0.247
0.361MetTrp: 0.361 ± 0.144
0.421MetTyr: 0.421 ± 0.134
0.0MetXaa: 0.0 ± 0.0
Asn
3.251AsnAla: 3.251 ± 0.494
0.06AsnCys: 0.06 ± 0.055
2.107AsnAsp: 2.107 ± 0.403
1.445AsnGlu: 1.445 ± 0.271
0.843AsnPhe: 0.843 ± 0.252
3.612AsnGly: 3.612 ± 0.54
0.903AsnHis: 0.903 ± 0.28
1.204AsnIle: 1.204 ± 0.293
0.722AsnLys: 0.722 ± 0.191
2.408AsnLeu: 2.408 ± 0.398
0.602AsnMet: 0.602 ± 0.209
0.783AsnAsn: 0.783 ± 0.202
2.649AsnPro: 2.649 ± 0.456
0.903AsnGln: 0.903 ± 0.261
1.505AsnArg: 1.505 ± 0.35
2.047AsnSer: 2.047 ± 0.416
2.047AsnThr: 2.047 ± 0.333
2.589AsnVal: 2.589 ± 0.409
0.783AsnTrp: 0.783 ± 0.189
1.084AsnTyr: 1.084 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
5.238ProAla: 5.238 ± 0.605
0.361ProCys: 0.361 ± 0.179
4.275ProAsp: 4.275 ± 0.492
4.215ProGlu: 4.215 ± 0.666
2.047ProPhe: 2.047 ± 0.366
4.817ProGly: 4.817 ± 0.563
0.903ProHis: 0.903 ± 0.236
2.469ProIle: 2.469 ± 0.439
2.167ProLys: 2.167 ± 0.342
4.576ProLeu: 4.576 ± 0.599
0.903ProMet: 0.903 ± 0.235
1.445ProAsn: 1.445 ± 0.297
2.89ProPro: 2.89 ± 0.492
1.204ProGln: 1.204 ± 0.343
3.01ProArg: 3.01 ± 0.589
3.914ProSer: 3.914 ± 0.531
3.974ProThr: 3.974 ± 0.452
4.034ProVal: 4.034 ± 0.612
0.783ProTrp: 0.783 ± 0.263
1.565ProTyr: 1.565 ± 0.333
0.0ProXaa: 0.0 ± 0.0
Gln
3.071GlnAla: 3.071 ± 0.488
0.181GlnCys: 0.181 ± 0.095
1.264GlnAsp: 1.264 ± 0.429
1.686GlnGlu: 1.686 ± 0.325
1.204GlnPhe: 1.204 ± 0.257
2.529GlnGly: 2.529 ± 0.304
0.542GlnHis: 0.542 ± 0.207
2.589GlnIle: 2.589 ± 0.52
0.963GlnLys: 0.963 ± 0.272
3.793GlnLeu: 3.793 ± 0.502
0.903GlnMet: 0.903 ± 0.264
0.361GlnAsn: 0.361 ± 0.132
1.806GlnPro: 1.806 ± 0.27
1.746GlnGln: 1.746 ± 0.406
2.107GlnArg: 2.107 ± 0.419
1.626GlnSer: 1.626 ± 0.286
1.806GlnThr: 1.806 ± 0.337
2.348GlnVal: 2.348 ± 0.407
0.542GlnTrp: 0.542 ± 0.154
0.662GlnTyr: 0.662 ± 0.153
0.0GlnXaa: 0.0 ± 0.0
Arg
5.72ArgAla: 5.72 ± 0.612
1.144ArgCys: 1.144 ± 0.386
2.649ArgAsp: 2.649 ± 0.481
5.178ArgGlu: 5.178 ± 0.643
2.047ArgPhe: 2.047 ± 0.487
5.84ArgGly: 5.84 ± 0.725
1.204ArgHis: 1.204 ± 0.308
3.372ArgIle: 3.372 ± 0.456
3.492ArgLys: 3.492 ± 0.678
5.961ArgLeu: 5.961 ± 0.669
1.626ArgMet: 1.626 ± 0.327
2.167ArgAsn: 2.167 ± 0.427
2.408ArgPro: 2.408 ± 0.366
2.047ArgGln: 2.047 ± 0.381
5.72ArgArg: 5.72 ± 0.881
4.094ArgSer: 4.094 ± 0.504
3.251ArgThr: 3.251 ± 0.441
5.419ArgVal: 5.419 ± 0.453
1.445ArgTrp: 1.445 ± 0.267
1.686ArgTyr: 1.686 ± 0.271
0.0ArgXaa: 0.0 ± 0.0
Ser
6.141SerAla: 6.141 ± 0.942
0.482SerCys: 0.482 ± 0.2
3.251SerAsp: 3.251 ± 0.395
3.733SerGlu: 3.733 ± 0.469
2.107SerPhe: 2.107 ± 0.401
7.105SerGly: 7.105 ± 1.105
1.505SerHis: 1.505 ± 0.326
3.131SerIle: 3.131 ± 0.538
2.288SerLys: 2.288 ± 0.434
4.817SerLeu: 4.817 ± 0.634
1.144SerMet: 1.144 ± 0.242
2.288SerAsn: 2.288 ± 0.377
3.251SerPro: 3.251 ± 0.507
1.806SerGln: 1.806 ± 0.271
3.432SerArg: 3.432 ± 0.47
3.071SerSer: 3.071 ± 0.639
3.251SerThr: 3.251 ± 0.478
4.576SerVal: 4.576 ± 0.563
1.445SerTrp: 1.445 ± 0.336
1.264SerTyr: 1.264 ± 0.33
0.0SerXaa: 0.0 ± 0.0
Thr
6.502ThrAla: 6.502 ± 0.912
0.301ThrCys: 0.301 ± 0.155
4.335ThrAsp: 4.335 ± 0.583
4.215ThrGlu: 4.215 ± 0.482
2.047ThrPhe: 2.047 ± 0.36
6.262ThrGly: 6.262 ± 0.58
1.264ThrHis: 1.264 ± 0.321
2.77ThrIle: 2.77 ± 0.592
2.77ThrLys: 2.77 ± 0.417
5.539ThrLeu: 5.539 ± 0.635
1.204ThrMet: 1.204 ± 0.249
1.987ThrAsn: 1.987 ± 0.354
3.853ThrPro: 3.853 ± 0.504
2.107ThrGln: 2.107 ± 0.376
3.432ThrArg: 3.432 ± 0.533
3.191ThrSer: 3.191 ± 0.448
4.756ThrThr: 4.756 ± 0.572
5.84ThrVal: 5.84 ± 0.686
1.144ThrTrp: 1.144 ± 0.229
1.686ThrTyr: 1.686 ± 0.382
0.0ThrXaa: 0.0 ± 0.0
Val
6.502ValAla: 6.502 ± 0.768
0.602ValCys: 0.602 ± 0.222
5.178ValAsp: 5.178 ± 0.556
4.696ValGlu: 4.696 ± 0.551
2.288ValPhe: 2.288 ± 0.378
5.178ValGly: 5.178 ± 0.767
1.385ValHis: 1.385 ± 0.235
3.311ValIle: 3.311 ± 0.454
3.01ValLys: 3.01 ± 0.431
5.72ValLeu: 5.72 ± 0.733
1.204ValMet: 1.204 ± 0.323
2.529ValAsn: 2.529 ± 0.367
3.914ValPro: 3.914 ± 0.489
2.288ValGln: 2.288 ± 0.428
5.961ValArg: 5.961 ± 0.72
5.057ValSer: 5.057 ± 0.427
5.298ValThr: 5.298 ± 0.617
4.877ValVal: 4.877 ± 0.663
1.264ValTrp: 1.264 ± 0.288
2.348ValTyr: 2.348 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
1.264TrpAla: 1.264 ± 0.291
0.241TrpCys: 0.241 ± 0.12
1.264TrpAsp: 1.264 ± 0.293
1.084TrpGlu: 1.084 ± 0.236
0.903TrpPhe: 0.903 ± 0.248
1.746TrpGly: 1.746 ± 0.289
0.602TrpHis: 0.602 ± 0.197
1.084TrpIle: 1.084 ± 0.225
0.361TrpLys: 0.361 ± 0.163
2.047TrpLeu: 2.047 ± 0.398
0.361TrpMet: 0.361 ± 0.171
0.421TrpAsn: 0.421 ± 0.179
0.843TrpPro: 0.843 ± 0.28
0.843TrpGln: 0.843 ± 0.21
1.264TrpArg: 1.264 ± 0.25
0.602TrpSer: 0.602 ± 0.221
1.866TrpThr: 1.866 ± 0.386
2.047TrpVal: 2.047 ± 0.274
0.482TrpTrp: 0.482 ± 0.185
0.301TrpTyr: 0.301 ± 0.127
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.167TyrAla: 2.167 ± 0.387
0.301TyrCys: 0.301 ± 0.17
0.903TyrAsp: 0.903 ± 0.249
2.469TyrGlu: 2.469 ± 0.351
0.482TyrPhe: 0.482 ± 0.141
2.649TyrGly: 2.649 ± 0.376
0.542TyrHis: 0.542 ± 0.176
1.626TyrIle: 1.626 ± 0.378
1.204TyrLys: 1.204 ± 0.248
2.348TyrLeu: 2.348 ± 0.36
0.602TyrMet: 0.602 ± 0.173
1.204TyrAsn: 1.204 ± 0.328
1.204TyrPro: 1.204 ± 0.271
1.204TyrGln: 1.204 ± 0.266
2.529TyrArg: 2.529 ± 0.351
1.445TyrSer: 1.445 ± 0.264
2.167TyrThr: 2.167 ± 0.392
2.047TyrVal: 2.047 ± 0.348
0.361TyrTrp: 0.361 ± 0.167
0.602TyrTyr: 0.602 ± 0.17
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 90 proteins (16610 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski