Amino acid dipepetide frequency for Mycobacterium phage VA6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.573AlaAla: 10.573 ± 1.104
0.55AlaCys: 0.55 ± 0.173
4.462AlaAsp: 4.462 ± 0.542
8.251AlaGlu: 8.251 ± 0.687
3.728AlaPhe: 3.728 ± 0.469
8.618AlaGly: 8.618 ± 0.923
1.711AlaHis: 1.711 ± 0.335
4.523AlaIle: 4.523 ± 0.588
4.462AlaLys: 4.462 ± 0.54
9.962AlaLeu: 9.962 ± 1.123
2.873AlaMet: 2.873 ± 0.409
3.056AlaAsn: 3.056 ± 0.465
5.256AlaPro: 5.256 ± 0.558
3.912AlaGln: 3.912 ± 0.556
6.784AlaArg: 6.784 ± 0.663
4.828AlaSer: 4.828 ± 0.638
5.439AlaThr: 5.439 ± 0.51
7.09AlaVal: 7.09 ± 0.657
1.589AlaTrp: 1.589 ± 0.339
2.322AlaTyr: 2.322 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
0.733CysAla: 0.733 ± 0.18
0.122CysCys: 0.122 ± 0.117
0.733CysAsp: 0.733 ± 0.205
0.428CysGlu: 0.428 ± 0.16
0.244CysPhe: 0.244 ± 0.141
0.733CysGly: 0.733 ± 0.211
0.244CysHis: 0.244 ± 0.137
0.244CysIle: 0.244 ± 0.155
0.672CysLys: 0.672 ± 0.178
0.917CysLeu: 0.917 ± 0.283
0.244CysMet: 0.244 ± 0.178
0.55CysAsn: 0.55 ± 0.162
0.55CysPro: 0.55 ± 0.239
0.061CysGln: 0.061 ± 0.068
0.428CysArg: 0.428 ± 0.173
0.795CysSer: 0.795 ± 0.234
0.55CysThr: 0.55 ± 0.201
0.55CysVal: 0.55 ± 0.181
0.306CysTrp: 0.306 ± 0.141
0.611CysTyr: 0.611 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
6.295AspAla: 6.295 ± 0.745
0.733AspCys: 0.733 ± 0.2
3.667AspAsp: 3.667 ± 0.528
5.256AspGlu: 5.256 ± 0.753
2.506AspPhe: 2.506 ± 0.469
5.867AspGly: 5.867 ± 0.574
1.895AspHis: 1.895 ± 0.405
3.117AspIle: 3.117 ± 0.43
2.384AspLys: 2.384 ± 0.346
4.889AspLeu: 4.889 ± 0.533
1.834AspMet: 1.834 ± 0.33
1.65AspAsn: 1.65 ± 0.383
4.4AspPro: 4.4 ± 0.553
2.261AspGln: 2.261 ± 0.306
3.239AspArg: 3.239 ± 0.437
2.628AspSer: 2.628 ± 0.376
2.811AspThr: 2.811 ± 0.409
4.156AspVal: 4.156 ± 0.51
1.406AspTrp: 1.406 ± 0.298
1.956AspTyr: 1.956 ± 0.288
0.0AspXaa: 0.0 ± 0.0
Glu
7.64GluAla: 7.64 ± 0.735
0.244GluCys: 0.244 ± 0.123
5.623GluAsp: 5.623 ± 0.719
5.195GluGlu: 5.195 ± 0.643
3.361GluPhe: 3.361 ± 0.472
5.317GluGly: 5.317 ± 0.564
2.078GluHis: 2.078 ± 0.343
3.912GluIle: 3.912 ± 0.462
2.934GluLys: 2.934 ± 0.422
6.784GluLeu: 6.784 ± 0.673
2.384GluMet: 2.384 ± 0.403
2.078GluAsn: 2.078 ± 0.359
2.506GluPro: 2.506 ± 0.44
2.567GluGln: 2.567 ± 0.326
4.645GluArg: 4.645 ± 0.441
2.934GluSer: 2.934 ± 0.482
3.912GluThr: 3.912 ± 0.434
4.523GluVal: 4.523 ± 0.533
1.467GluTrp: 1.467 ± 0.323
1.895GluTyr: 1.895 ± 0.34
0.0GluXaa: 0.0 ± 0.0
Phe
3.361PheAla: 3.361 ± 0.474
0.367PheCys: 0.367 ± 0.137
2.873PheAsp: 2.873 ± 0.512
2.322PheGlu: 2.322 ± 0.361
0.856PhePhe: 0.856 ± 0.269
3.423PheGly: 3.423 ± 0.452
0.978PheHis: 0.978 ± 0.298
1.222PheIle: 1.222 ± 0.33
1.834PheLys: 1.834 ± 0.357
2.934PheLeu: 2.934 ± 0.403
0.55PheMet: 0.55 ± 0.169
1.467PheAsn: 1.467 ± 0.339
1.834PhePro: 1.834 ± 0.353
1.161PheGln: 1.161 ± 0.268
1.956PheArg: 1.956 ± 0.317
1.895PheSer: 1.895 ± 0.356
1.956PheThr: 1.956 ± 0.271
2.384PheVal: 2.384 ± 0.404
0.55PheTrp: 0.55 ± 0.178
0.978PheTyr: 0.978 ± 0.207
0.0PheXaa: 0.0 ± 0.0
Gly
7.273GlyAla: 7.273 ± 0.895
0.733GlyCys: 0.733 ± 0.218
5.501GlyAsp: 5.501 ± 0.645
5.806GlyGlu: 5.806 ± 0.64
3.3GlyPhe: 3.3 ± 0.511
8.556GlyGly: 8.556 ± 1.644
1.528GlyHis: 1.528 ± 0.29
4.4GlyIle: 4.4 ± 0.636
4.278GlyLys: 4.278 ± 0.515
6.967GlyLeu: 6.967 ± 0.705
1.65GlyMet: 1.65 ± 0.308
2.995GlyAsn: 2.995 ± 0.531
3.667GlyPro: 3.667 ± 0.382
3.117GlyGln: 3.117 ± 0.458
3.973GlyArg: 3.973 ± 0.401
4.339GlySer: 4.339 ± 0.606
5.073GlyThr: 5.073 ± 0.666
6.601GlyVal: 6.601 ± 0.735
1.406GlyTrp: 1.406 ± 0.268
2.445GlyTyr: 2.445 ± 0.339
0.0GlyXaa: 0.0 ± 0.0
His
1.589HisAla: 1.589 ± 0.321
0.306HisCys: 0.306 ± 0.124
1.406HisAsp: 1.406 ± 0.289
1.406HisGlu: 1.406 ± 0.307
0.55HisPhe: 0.55 ± 0.161
1.956HisGly: 1.956 ± 0.433
0.55HisHis: 0.55 ± 0.196
1.283HisIle: 1.283 ± 0.253
1.039HisLys: 1.039 ± 0.244
1.528HisLeu: 1.528 ± 0.356
0.428HisMet: 0.428 ± 0.18
0.672HisAsn: 0.672 ± 0.203
1.161HisPro: 1.161 ± 0.246
0.795HisGln: 0.795 ± 0.194
1.65HisArg: 1.65 ± 0.367
0.856HisSer: 0.856 ± 0.276
1.345HisThr: 1.345 ± 0.222
1.039HisVal: 1.039 ± 0.264
0.428HisTrp: 0.428 ± 0.164
0.55HisTyr: 0.55 ± 0.243
0.0HisXaa: 0.0 ± 0.0
Ile
5.623IleAla: 5.623 ± 0.429
0.672IleCys: 0.672 ± 0.222
3.423IleAsp: 3.423 ± 0.453
4.339IleGlu: 4.339 ± 0.506
1.589IlePhe: 1.589 ± 0.311
4.706IleGly: 4.706 ± 0.661
1.1IleHis: 1.1 ± 0.206
1.772IleIle: 1.772 ± 0.276
2.384IleLys: 2.384 ± 0.391
3.423IleLeu: 3.423 ± 0.467
0.795IleMet: 0.795 ± 0.19
2.2IleAsn: 2.2 ± 0.297
3.361IlePro: 3.361 ± 0.484
1.528IleGln: 1.528 ± 0.342
3.178IleArg: 3.178 ± 0.384
2.567IleSer: 2.567 ± 0.321
3.178IleThr: 3.178 ± 0.377
2.139IleVal: 2.139 ± 0.338
0.489IleTrp: 0.489 ± 0.139
0.917IleTyr: 0.917 ± 0.233
0.0IleXaa: 0.0 ± 0.0
Lys
4.95LysAla: 4.95 ± 0.603
0.306LysCys: 0.306 ± 0.111
2.873LysAsp: 2.873 ± 0.396
2.322LysGlu: 2.322 ± 0.295
1.161LysPhe: 1.161 ± 0.234
3.606LysGly: 3.606 ± 0.557
0.611LysHis: 0.611 ± 0.203
2.811LysIle: 2.811 ± 0.427
2.811LysLys: 2.811 ± 0.598
4.156LysLeu: 4.156 ± 0.435
1.161LysMet: 1.161 ± 0.297
1.345LysAsn: 1.345 ± 0.27
2.445LysPro: 2.445 ± 0.433
1.834LysGln: 1.834 ± 0.326
3.423LysArg: 3.423 ± 0.454
2.567LysSer: 2.567 ± 0.381
2.873LysThr: 2.873 ± 0.428
3.85LysVal: 3.85 ± 0.366
0.611LysTrp: 0.611 ± 0.211
1.406LysTyr: 1.406 ± 0.317
0.0LysXaa: 0.0 ± 0.0
Leu
8.434LeuAla: 8.434 ± 0.717
0.795LeuCys: 0.795 ± 0.22
5.378LeuAsp: 5.378 ± 0.694
6.295LeuGlu: 6.295 ± 0.74
2.75LeuPhe: 2.75 ± 0.334
5.562LeuGly: 5.562 ± 0.726
1.895LeuHis: 1.895 ± 0.354
4.095LeuIle: 4.095 ± 0.435
3.423LeuLys: 3.423 ± 0.431
6.295LeuLeu: 6.295 ± 0.634
2.567LeuMet: 2.567 ± 0.451
2.384LeuAsn: 2.384 ± 0.366
4.339LeuPro: 4.339 ± 0.478
2.75LeuGln: 2.75 ± 0.494
5.623LeuArg: 5.623 ± 0.578
5.378LeuSer: 5.378 ± 0.724
5.562LeuThr: 5.562 ± 0.596
5.439LeuVal: 5.439 ± 0.575
1.589LeuTrp: 1.589 ± 0.271
2.2LeuTyr: 2.2 ± 0.363
0.0LeuXaa: 0.0 ± 0.0
Met
3.056MetAla: 3.056 ± 0.466
0.061MetCys: 0.061 ± 0.068
1.283MetAsp: 1.283 ± 0.281
1.345MetGlu: 1.345 ± 0.263
0.611MetPhe: 0.611 ± 0.179
1.711MetGly: 1.711 ± 0.32
0.489MetHis: 0.489 ± 0.135
1.467MetIle: 1.467 ± 0.304
1.406MetLys: 1.406 ± 0.293
1.283MetLeu: 1.283 ± 0.355
0.55MetMet: 0.55 ± 0.153
0.55MetAsn: 0.55 ± 0.187
1.467MetPro: 1.467 ± 0.322
1.1MetGln: 1.1 ± 0.299
1.65MetArg: 1.65 ± 0.308
2.445MetSer: 2.445 ± 0.283
2.506MetThr: 2.506 ± 0.353
0.856MetVal: 0.856 ± 0.219
0.183MetTrp: 0.183 ± 0.096
0.733MetTyr: 0.733 ± 0.201
0.0MetXaa: 0.0 ± 0.0
Asn
2.75AsnAla: 2.75 ± 0.424
0.489AsnCys: 0.489 ± 0.185
1.956AsnAsp: 1.956 ± 0.413
2.139AsnGlu: 2.139 ± 0.348
0.733AsnPhe: 0.733 ± 0.221
3.117AsnGly: 3.117 ± 0.453
0.856AsnHis: 0.856 ± 0.191
1.1AsnIle: 1.1 ± 0.211
1.161AsnLys: 1.161 ± 0.249
3.239AsnLeu: 3.239 ± 0.434
0.672AsnMet: 0.672 ± 0.192
0.611AsnAsn: 0.611 ± 0.211
2.445AsnPro: 2.445 ± 0.377
0.856AsnGln: 0.856 ± 0.214
2.261AsnArg: 2.261 ± 0.453
1.345AsnSer: 1.345 ± 0.269
2.322AsnThr: 2.322 ± 0.454
2.078AsnVal: 2.078 ± 0.37
0.978AsnTrp: 0.978 ± 0.241
0.917AsnTyr: 0.917 ± 0.227
0.0AsnXaa: 0.0 ± 0.0
Pro
4.889ProAla: 4.889 ± 0.524
0.428ProCys: 0.428 ± 0.198
4.339ProAsp: 4.339 ± 0.516
5.012ProGlu: 5.012 ± 0.547
1.711ProPhe: 1.711 ± 0.354
4.462ProGly: 4.462 ± 0.635
0.856ProHis: 0.856 ± 0.254
2.384ProIle: 2.384 ± 0.337
2.261ProLys: 2.261 ± 0.382
3.178ProLeu: 3.178 ± 0.488
1.222ProMet: 1.222 ± 0.263
1.834ProAsn: 1.834 ± 0.335
2.017ProPro: 2.017 ± 0.399
1.528ProGln: 1.528 ± 0.314
3.484ProArg: 3.484 ± 0.477
2.445ProSer: 2.445 ± 0.345
3.178ProThr: 3.178 ± 0.438
3.85ProVal: 3.85 ± 0.425
1.222ProTrp: 1.222 ± 0.325
1.467ProTyr: 1.467 ± 0.288
0.0ProXaa: 0.0 ± 0.0
Gln
4.584GlnAla: 4.584 ± 0.611
0.306GlnCys: 0.306 ± 0.152
1.222GlnAsp: 1.222 ± 0.271
1.834GlnGlu: 1.834 ± 0.291
1.283GlnPhe: 1.283 ± 0.279
2.75GlnGly: 2.75 ± 0.443
0.489GlnHis: 0.489 ± 0.131
2.75GlnIle: 2.75 ± 0.349
1.772GlnLys: 1.772 ± 0.367
3.361GlnLeu: 3.361 ± 0.643
0.978GlnMet: 0.978 ± 0.224
0.611GlnAsn: 0.611 ± 0.198
1.039GlnPro: 1.039 ± 0.32
2.261GlnGln: 2.261 ± 0.428
2.445GlnArg: 2.445 ± 0.433
1.528GlnSer: 1.528 ± 0.315
2.322GlnThr: 2.322 ± 0.378
3.056GlnVal: 3.056 ± 0.377
0.795GlnTrp: 0.795 ± 0.254
0.978GlnTyr: 0.978 ± 0.218
0.0GlnXaa: 0.0 ± 0.0
Arg
5.256ArgAla: 5.256 ± 0.672
1.1ArgCys: 1.1 ± 0.284
3.973ArgAsp: 3.973 ± 0.46
4.95ArgGlu: 4.95 ± 0.596
2.506ArgPhe: 2.506 ± 0.45
4.034ArgGly: 4.034 ± 0.511
1.283ArgHis: 1.283 ± 0.306
3.361ArgIle: 3.361 ± 0.448
3.667ArgLys: 3.667 ± 0.487
5.195ArgLeu: 5.195 ± 0.615
1.711ArgMet: 1.711 ± 0.325
2.384ArgAsn: 2.384 ± 0.361
2.934ArgPro: 2.934 ± 0.411
2.139ArgGln: 2.139 ± 0.337
4.95ArgArg: 4.95 ± 0.626
2.995ArgSer: 2.995 ± 0.434
3.361ArgThr: 3.361 ± 0.485
4.645ArgVal: 4.645 ± 0.586
1.222ArgTrp: 1.222 ± 0.262
2.384ArgTyr: 2.384 ± 0.382
0.0ArgXaa: 0.0 ± 0.0
Ser
5.195SerAla: 5.195 ± 0.676
0.55SerCys: 0.55 ± 0.171
3.117SerAsp: 3.117 ± 0.361
3.361SerGlu: 3.361 ± 0.525
1.834SerPhe: 1.834 ± 0.319
4.889SerGly: 4.889 ± 0.501
0.856SerHis: 0.856 ± 0.199
1.834SerIle: 1.834 ± 0.303
2.811SerLys: 2.811 ± 0.608
3.973SerLeu: 3.973 ± 0.68
1.528SerMet: 1.528 ± 0.242
1.406SerAsn: 1.406 ± 0.276
2.75SerPro: 2.75 ± 0.381
2.078SerGln: 2.078 ± 0.297
3.239SerArg: 3.239 ± 0.42
2.567SerSer: 2.567 ± 0.411
3.3SerThr: 3.3 ± 0.412
3.545SerVal: 3.545 ± 0.491
1.039SerTrp: 1.039 ± 0.278
1.528SerTyr: 1.528 ± 0.263
0.0SerXaa: 0.0 ± 0.0
Thr
6.173ThrAla: 6.173 ± 0.586
0.489ThrCys: 0.489 ± 0.161
3.423ThrAsp: 3.423 ± 0.529
3.484ThrGlu: 3.484 ± 0.402
2.017ThrPhe: 2.017 ± 0.35
5.256ThrGly: 5.256 ± 0.619
1.222ThrHis: 1.222 ± 0.313
3.056ThrIle: 3.056 ± 0.456
2.873ThrLys: 2.873 ± 0.486
5.256ThrLeu: 5.256 ± 0.523
1.345ThrMet: 1.345 ± 0.268
1.528ThrAsn: 1.528 ± 0.286
4.523ThrPro: 4.523 ± 0.545
1.772ThrGln: 1.772 ± 0.322
3.239ThrArg: 3.239 ± 0.393
2.506ThrSer: 2.506 ± 0.354
2.873ThrThr: 2.873 ± 0.429
5.623ThrVal: 5.623 ± 0.626
1.161ThrTrp: 1.161 ± 0.257
1.895ThrTyr: 1.895 ± 0.289
0.0ThrXaa: 0.0 ± 0.0
Val
6.723ValAla: 6.723 ± 0.851
0.672ValCys: 0.672 ± 0.195
4.645ValAsp: 4.645 ± 0.554
4.95ValGlu: 4.95 ± 0.522
2.567ValPhe: 2.567 ± 0.444
5.012ValGly: 5.012 ± 0.498
1.161ValHis: 1.161 ± 0.271
3.361ValIle: 3.361 ± 0.507
3.728ValLys: 3.728 ± 0.391
5.806ValLeu: 5.806 ± 0.648
1.222ValMet: 1.222 ± 0.298
3.056ValAsn: 3.056 ± 0.461
2.689ValPro: 2.689 ± 0.458
2.322ValGln: 2.322 ± 0.451
4.767ValArg: 4.767 ± 0.582
4.156ValSer: 4.156 ± 0.453
4.4ValThr: 4.4 ± 0.376
5.012ValVal: 5.012 ± 0.452
1.406ValTrp: 1.406 ± 0.306
2.017ValTyr: 2.017 ± 0.427
0.0ValXaa: 0.0 ± 0.0
Trp
1.834TrpAla: 1.834 ± 0.418
0.367TrpCys: 0.367 ± 0.168
0.978TrpAsp: 0.978 ± 0.238
1.406TrpGlu: 1.406 ± 0.247
0.672TrpPhe: 0.672 ± 0.212
1.528TrpGly: 1.528 ± 0.349
0.367TrpHis: 0.367 ± 0.165
0.978TrpIle: 0.978 ± 0.234
0.489TrpLys: 0.489 ± 0.193
1.406TrpLeu: 1.406 ± 0.298
0.367TrpMet: 0.367 ± 0.133
0.672TrpAsn: 0.672 ± 0.213
0.917TrpPro: 0.917 ± 0.24
1.283TrpGln: 1.283 ± 0.254
1.283TrpArg: 1.283 ± 0.249
0.978TrpSer: 0.978 ± 0.272
1.1TrpThr: 1.1 ± 0.258
1.283TrpVal: 1.283 ± 0.246
0.489TrpTrp: 0.489 ± 0.196
0.489TrpTyr: 0.489 ± 0.181
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.056TyrAla: 3.056 ± 0.375
0.428TyrCys: 0.428 ± 0.163
2.078TyrAsp: 2.078 ± 0.314
1.895TyrGlu: 1.895 ± 0.352
0.978TyrPhe: 0.978 ± 0.21
2.506TyrGly: 2.506 ± 0.361
0.367TyrHis: 0.367 ± 0.167
1.711TyrIle: 1.711 ± 0.302
0.733TyrLys: 0.733 ± 0.201
2.2TyrLeu: 2.2 ± 0.362
0.611TyrMet: 0.611 ± 0.187
0.978TyrAsn: 0.978 ± 0.244
1.467TyrPro: 1.467 ± 0.291
1.039TyrGln: 1.039 ± 0.224
1.834TyrArg: 1.834 ± 0.355
1.65TyrSer: 1.65 ± 0.334
1.65TyrThr: 1.65 ± 0.278
1.956TyrVal: 1.956 ± 0.366
0.55TyrTrp: 0.55 ± 0.251
0.733TyrTyr: 0.733 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 93 proteins (16363 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski