Amino acid dipepetide frequency for Tsukamurella phage TPA2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.946AlaAla: 17.946 ± 1.33
0.873AlaCys: 0.873 ± 0.284
7.036AlaAsp: 7.036 ± 0.626
8.237AlaGlu: 8.237 ± 1.039
2.782AlaPhe: 2.782 ± 0.466
9.218AlaGly: 9.218 ± 1.1
2.018AlaHis: 2.018 ± 0.44
4.091AlaIle: 4.091 ± 0.58
3.927AlaLys: 3.927 ± 0.496
8.182AlaLeu: 8.182 ± 0.77
2.782AlaMet: 2.782 ± 0.444
3.436AlaAsn: 3.436 ± 0.665
6.873AlaPro: 6.873 ± 0.728
5.182AlaGln: 5.182 ± 0.548
8.291AlaArg: 8.291 ± 0.663
6.491AlaSer: 6.491 ± 0.898
7.582AlaThr: 7.582 ± 0.7
9.164AlaVal: 9.164 ± 0.794
2.618AlaTrp: 2.618 ± 0.57
2.455AlaTyr: 2.455 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
1.036CysAla: 1.036 ± 0.294
0.109CysCys: 0.109 ± 0.078
0.709CysAsp: 0.709 ± 0.204
0.382CysGlu: 0.382 ± 0.169
0.109CysPhe: 0.109 ± 0.08
1.364CysGly: 1.364 ± 0.354
0.327CysHis: 0.327 ± 0.159
0.436CysIle: 0.436 ± 0.155
0.109CysLys: 0.109 ± 0.076
0.655CysLeu: 0.655 ± 0.192
0.164CysMet: 0.164 ± 0.097
0.273CysAsn: 0.273 ± 0.149
1.418CysPro: 1.418 ± 0.312
0.218CysGln: 0.218 ± 0.111
0.491CysArg: 0.491 ± 0.196
0.491CysSer: 0.491 ± 0.193
0.545CysThr: 0.545 ± 0.167
0.927CysVal: 0.927 ± 0.277
0.164CysTrp: 0.164 ± 0.093
0.109CysTyr: 0.109 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
6.818AspAla: 6.818 ± 0.548
0.491AspCys: 0.491 ± 0.177
4.255AspAsp: 4.255 ± 0.653
4.746AspGlu: 4.746 ± 0.65
2.127AspPhe: 2.127 ± 0.315
5.782AspGly: 5.782 ± 0.788
1.145AspHis: 1.145 ± 0.255
2.291AspIle: 2.291 ± 0.365
1.255AspLys: 1.255 ± 0.228
5.127AspLeu: 5.127 ± 0.616
1.091AspMet: 1.091 ± 0.196
1.2AspAsn: 1.2 ± 0.322
4.964AspPro: 4.964 ± 0.661
1.964AspGln: 1.964 ± 0.315
5.291AspArg: 5.291 ± 0.625
3.0AspSer: 3.0 ± 0.389
4.255AspThr: 4.255 ± 0.464
4.364AspVal: 4.364 ± 0.442
0.982AspTrp: 0.982 ± 0.296
1.418AspTyr: 1.418 ± 0.238
0.0AspXaa: 0.0 ± 0.0
Glu
5.018GluAla: 5.018 ± 0.541
0.818GluCys: 0.818 ± 0.241
3.218GluAsp: 3.218 ± 0.466
2.836GluGlu: 2.836 ± 0.571
1.745GluPhe: 1.745 ± 0.342
4.418GluGly: 4.418 ± 0.574
1.473GluHis: 1.473 ± 0.349
4.255GluIle: 4.255 ± 0.396
1.909GluLys: 1.909 ± 0.281
5.509GluLeu: 5.509 ± 0.714
1.364GluMet: 1.364 ± 0.269
1.527GluAsn: 1.527 ± 0.362
3.491GluPro: 3.491 ± 0.611
2.4GluGln: 2.4 ± 0.351
4.418GluArg: 4.418 ± 0.467
2.564GluSer: 2.564 ± 0.348
4.364GluThr: 4.364 ± 0.587
4.636GluVal: 4.636 ± 0.457
1.145GluTrp: 1.145 ± 0.235
1.2GluTyr: 1.2 ± 0.283
0.0GluXaa: 0.0 ± 0.0
Phe
3.436PheAla: 3.436 ± 0.493
0.436PheCys: 0.436 ± 0.153
2.345PheAsp: 2.345 ± 0.41
1.036PheGlu: 1.036 ± 0.262
0.655PhePhe: 0.655 ± 0.263
2.891PheGly: 2.891 ± 0.496
0.273PheHis: 0.273 ± 0.135
0.982PheIle: 0.982 ± 0.241
0.6PheLys: 0.6 ± 0.239
1.964PheLeu: 1.964 ± 0.35
0.655PheMet: 0.655 ± 0.203
0.873PheAsn: 0.873 ± 0.3
1.418PhePro: 1.418 ± 0.286
0.873PheGln: 0.873 ± 0.199
1.418PheArg: 1.418 ± 0.247
1.691PheSer: 1.691 ± 0.282
1.636PheThr: 1.636 ± 0.295
1.473PheVal: 1.473 ± 0.243
0.327PheTrp: 0.327 ± 0.147
0.327PheTyr: 0.327 ± 0.148
0.0PheXaa: 0.0 ± 0.0
Gly
9.0GlyAla: 9.0 ± 0.877
1.309GlyCys: 1.309 ± 0.414
5.127GlyAsp: 5.127 ± 0.46
4.527GlyGlu: 4.527 ± 0.529
2.455GlyPhe: 2.455 ± 0.434
10.691GlyGly: 10.691 ± 1.786
2.291GlyHis: 2.291 ± 0.53
3.873GlyIle: 3.873 ± 0.84
3.327GlyLys: 3.327 ± 0.533
6.382GlyLeu: 6.382 ± 0.889
1.8GlyMet: 1.8 ± 0.259
2.618GlyAsn: 2.618 ± 0.505
4.8GlyPro: 4.8 ± 0.507
3.818GlyGln: 3.818 ± 0.453
7.364GlyArg: 7.364 ± 1.075
5.291GlySer: 5.291 ± 0.533
6.927GlyThr: 6.927 ± 0.748
6.6GlyVal: 6.6 ± 0.499
1.909GlyTrp: 1.909 ± 0.387
2.455GlyTyr: 2.455 ± 0.412
0.0GlyXaa: 0.0 ± 0.0
His
1.309HisAla: 1.309 ± 0.288
0.109HisCys: 0.109 ± 0.07
1.364HisAsp: 1.364 ± 0.359
0.873HisGlu: 0.873 ± 0.22
0.436HisPhe: 0.436 ± 0.184
1.691HisGly: 1.691 ± 0.299
0.873HisHis: 0.873 ± 0.373
0.709HisIle: 0.709 ± 0.19
0.436HisLys: 0.436 ± 0.23
1.473HisLeu: 1.473 ± 0.369
0.436HisMet: 0.436 ± 0.141
0.6HisAsn: 0.6 ± 0.175
1.309HisPro: 1.309 ± 0.262
0.545HisGln: 0.545 ± 0.171
1.855HisArg: 1.855 ± 0.535
1.255HisSer: 1.255 ± 0.315
1.364HisThr: 1.364 ± 0.385
1.2HisVal: 1.2 ± 0.333
0.709HisTrp: 0.709 ± 0.156
0.491HisTyr: 0.491 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
6.327IleAla: 6.327 ± 0.687
0.273IleCys: 0.273 ± 0.128
3.6IleAsp: 3.6 ± 0.42
3.164IleGlu: 3.164 ± 0.498
1.309IlePhe: 1.309 ± 0.363
4.527IleGly: 4.527 ± 0.868
0.709IleHis: 0.709 ± 0.201
1.418IleIle: 1.418 ± 0.312
1.473IleLys: 1.473 ± 0.596
2.727IleLeu: 2.727 ± 0.514
0.6IleMet: 0.6 ± 0.19
1.036IleAsn: 1.036 ± 0.336
2.073IlePro: 2.073 ± 0.409
1.2IleGln: 1.2 ± 0.256
2.891IleArg: 2.891 ± 0.369
2.182IleSer: 2.182 ± 0.339
3.491IleThr: 3.491 ± 0.434
3.164IleVal: 3.164 ± 0.403
0.382IleTrp: 0.382 ± 0.131
0.6IleTyr: 0.6 ± 0.231
0.0IleXaa: 0.0 ± 0.0
Lys
4.473LysAla: 4.473 ± 0.704
0.273LysCys: 0.273 ± 0.13
1.527LysAsp: 1.527 ± 0.208
0.764LysGlu: 0.764 ± 0.206
0.709LysPhe: 0.709 ± 0.241
2.782LysGly: 2.782 ± 0.48
0.709LysHis: 0.709 ± 0.264
1.2LysIle: 1.2 ± 0.342
0.436LysLys: 0.436 ± 0.176
2.455LysLeu: 2.455 ± 0.326
0.764LysMet: 0.764 ± 0.209
0.873LysAsn: 0.873 ± 0.216
2.509LysPro: 2.509 ± 0.52
0.545LysGln: 0.545 ± 0.223
2.018LysArg: 2.018 ± 0.471
1.964LysSer: 1.964 ± 0.488
2.236LysThr: 2.236 ± 0.375
2.455LysVal: 2.455 ± 0.388
0.436LysTrp: 0.436 ± 0.163
0.982LysTyr: 0.982 ± 0.292
0.0LysXaa: 0.0 ± 0.0
Leu
10.637LeuAla: 10.637 ± 0.851
0.491LeuCys: 0.491 ± 0.164
6.055LeuAsp: 6.055 ± 0.524
3.164LeuGlu: 3.164 ± 0.519
1.582LeuPhe: 1.582 ± 0.41
7.036LeuGly: 7.036 ± 0.763
1.364LeuHis: 1.364 ± 0.324
3.927LeuIle: 3.927 ± 0.468
2.727LeuLys: 2.727 ± 0.494
5.073LeuLeu: 5.073 ± 0.606
1.364LeuMet: 1.364 ± 0.268
2.018LeuAsn: 2.018 ± 0.342
5.236LeuPro: 5.236 ± 0.661
2.345LeuGln: 2.345 ± 0.43
4.146LeuArg: 4.146 ± 0.667
3.927LeuSer: 3.927 ± 0.454
5.509LeuThr: 5.509 ± 0.577
6.273LeuVal: 6.273 ± 0.614
1.691LeuTrp: 1.691 ± 0.41
1.145LeuTyr: 1.145 ± 0.299
0.0LeuXaa: 0.0 ± 0.0
Met
2.564MetAla: 2.564 ± 0.346
0.218MetCys: 0.218 ± 0.1
1.091MetAsp: 1.091 ± 0.225
1.473MetGlu: 1.473 ± 0.389
0.709MetPhe: 0.709 ± 0.176
1.473MetGly: 1.473 ± 0.341
0.436MetHis: 0.436 ± 0.16
1.145MetIle: 1.145 ± 0.249
0.818MetLys: 0.818 ± 0.209
1.582MetLeu: 1.582 ± 0.358
0.273MetMet: 0.273 ± 0.132
0.436MetAsn: 0.436 ± 0.133
1.2MetPro: 1.2 ± 0.274
0.491MetGln: 0.491 ± 0.152
1.2MetArg: 1.2 ± 0.24
1.909MetSer: 1.909 ± 0.276
2.291MetThr: 2.291 ± 0.456
1.255MetVal: 1.255 ± 0.256
0.382MetTrp: 0.382 ± 0.134
0.545MetTyr: 0.545 ± 0.173
0.0MetXaa: 0.0 ± 0.0
Asn
2.836AsnAla: 2.836 ± 0.297
0.327AsnCys: 0.327 ± 0.154
1.473AsnAsp: 1.473 ± 0.296
1.364AsnGlu: 1.364 ± 0.247
0.818AsnPhe: 0.818 ± 0.253
3.436AsnGly: 3.436 ± 0.546
0.436AsnHis: 0.436 ± 0.14
0.873AsnIle: 0.873 ± 0.294
0.818AsnLys: 0.818 ± 0.263
1.8AsnLeu: 1.8 ± 0.471
0.327AsnMet: 0.327 ± 0.123
0.436AsnAsn: 0.436 ± 0.189
2.4AsnPro: 2.4 ± 0.417
0.273AsnGln: 0.273 ± 0.118
0.982AsnArg: 0.982 ± 0.258
2.073AsnSer: 2.073 ± 0.569
2.073AsnThr: 2.073 ± 0.394
1.964AsnVal: 1.964 ± 0.358
0.382AsnTrp: 0.382 ± 0.142
0.764AsnTyr: 0.764 ± 0.203
0.0AsnXaa: 0.0 ± 0.0
Pro
7.8ProAla: 7.8 ± 0.654
0.491ProCys: 0.491 ± 0.206
4.527ProAsp: 4.527 ± 0.556
5.564ProGlu: 5.564 ± 0.491
1.527ProPhe: 1.527 ± 0.338
5.836ProGly: 5.836 ± 0.621
0.873ProHis: 0.873 ± 0.168
2.727ProIle: 2.727 ± 0.389
1.636ProLys: 1.636 ± 0.389
3.764ProLeu: 3.764 ± 0.499
1.364ProMet: 1.364 ± 0.295
1.473ProAsn: 1.473 ± 0.3
4.2ProPro: 4.2 ± 0.741
1.527ProGln: 1.527 ± 0.251
5.127ProArg: 5.127 ± 1.071
4.473ProSer: 4.473 ± 0.605
4.855ProThr: 4.855 ± 0.642
4.8ProVal: 4.8 ± 0.496
0.982ProTrp: 0.982 ± 0.287
1.418ProTyr: 1.418 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
3.818GlnAla: 3.818 ± 0.542
0.164GlnCys: 0.164 ± 0.101
1.527GlnAsp: 1.527 ± 0.283
1.091GlnGlu: 1.091 ± 0.234
1.036GlnPhe: 1.036 ± 0.212
2.782GlnGly: 2.782 ± 0.431
0.545GlnHis: 0.545 ± 0.143
1.309GlnIle: 1.309 ± 0.304
0.491GlnLys: 0.491 ± 0.161
3.927GlnLeu: 3.927 ± 0.464
0.982GlnMet: 0.982 ± 0.276
0.873GlnAsn: 0.873 ± 0.433
2.127GlnPro: 2.127 ± 0.324
1.745GlnGln: 1.745 ± 0.269
2.564GlnArg: 2.564 ± 0.379
1.418GlnSer: 1.418 ± 0.279
2.236GlnThr: 2.236 ± 0.333
3.327GlnVal: 3.327 ± 0.443
0.436GlnTrp: 0.436 ± 0.14
0.982GlnTyr: 0.982 ± 0.168
0.0GlnXaa: 0.0 ± 0.0
Arg
8.018ArgAla: 8.018 ± 0.924
1.255ArgCys: 1.255 ± 0.305
4.255ArgAsp: 4.255 ± 0.639
4.418ArgGlu: 4.418 ± 0.537
1.473ArgPhe: 1.473 ± 0.33
6.055ArgGly: 6.055 ± 0.779
1.473ArgHis: 1.473 ± 0.29
2.727ArgIle: 2.727 ± 0.335
1.745ArgLys: 1.745 ± 0.334
5.782ArgLeu: 5.782 ± 0.672
1.964ArgMet: 1.964 ± 0.4
1.527ArgAsn: 1.527 ± 0.282
4.309ArgPro: 4.309 ± 0.837
2.073ArgGln: 2.073 ± 0.412
8.727ArgArg: 8.727 ± 1.884
3.873ArgSer: 3.873 ± 0.401
4.527ArgThr: 4.527 ± 0.671
5.018ArgVal: 5.018 ± 0.614
1.473ArgTrp: 1.473 ± 0.32
1.909ArgTyr: 1.909 ± 0.296
0.0ArgXaa: 0.0 ± 0.0
Ser
5.836SerAla: 5.836 ± 0.672
0.491SerCys: 0.491 ± 0.183
3.436SerAsp: 3.436 ± 0.437
2.345SerGlu: 2.345 ± 0.414
1.036SerPhe: 1.036 ± 0.357
5.564SerGly: 5.564 ± 0.714
0.818SerHis: 0.818 ± 0.203
2.345SerIle: 2.345 ± 0.34
2.291SerLys: 2.291 ± 0.436
4.2SerLeu: 4.2 ± 0.539
1.691SerMet: 1.691 ± 0.278
1.036SerAsn: 1.036 ± 0.224
3.818SerPro: 3.818 ± 0.517
1.691SerGln: 1.691 ± 0.226
4.036SerArg: 4.036 ± 0.572
3.709SerSer: 3.709 ± 0.535
4.255SerThr: 4.255 ± 0.512
4.636SerVal: 4.636 ± 0.543
1.636SerTrp: 1.636 ± 0.298
1.309SerTyr: 1.309 ± 0.246
0.0SerXaa: 0.0 ± 0.0
Thr
8.509ThrAla: 8.509 ± 0.82
0.491ThrCys: 0.491 ± 0.199
3.436ThrAsp: 3.436 ± 0.371
4.691ThrGlu: 4.691 ± 0.48
1.582ThrPhe: 1.582 ± 0.31
7.473ThrGly: 7.473 ± 0.578
0.818ThrHis: 0.818 ± 0.244
3.327ThrIle: 3.327 ± 0.357
2.4ThrLys: 2.4 ± 0.434
5.291ThrLeu: 5.291 ± 0.437
1.309ThrMet: 1.309 ± 0.313
1.582ThrAsn: 1.582 ± 0.284
6.055ThrPro: 6.055 ± 0.621
1.855ThrGln: 1.855 ± 0.371
3.546ThrArg: 3.546 ± 0.552
4.091ThrSer: 4.091 ± 0.587
5.455ThrThr: 5.455 ± 0.605
6.655ThrVal: 6.655 ± 0.614
1.255ThrTrp: 1.255 ± 0.251
1.418ThrTyr: 1.418 ± 0.334
0.0ThrXaa: 0.0 ± 0.0
Val
8.946ValAla: 8.946 ± 0.808
0.709ValCys: 0.709 ± 0.227
4.8ValAsp: 4.8 ± 0.786
4.855ValGlu: 4.855 ± 0.564
2.073ValPhe: 2.073 ± 0.491
6.873ValGly: 6.873 ± 0.678
1.855ValHis: 1.855 ± 0.36
3.927ValIle: 3.927 ± 0.574
2.727ValLys: 2.727 ± 0.351
5.727ValLeu: 5.727 ± 0.716
1.964ValMet: 1.964 ± 0.332
2.782ValAsn: 2.782 ± 0.404
5.073ValPro: 5.073 ± 0.568
2.891ValGln: 2.891 ± 0.476
5.182ValArg: 5.182 ± 0.741
3.327ValSer: 3.327 ± 0.499
4.473ValThr: 4.473 ± 0.608
6.6ValVal: 6.6 ± 0.832
1.636ValTrp: 1.636 ± 0.291
1.255ValTyr: 1.255 ± 0.233
0.0ValXaa: 0.0 ± 0.0
Trp
2.182TrpAla: 2.182 ± 0.357
0.436TrpCys: 0.436 ± 0.149
1.036TrpAsp: 1.036 ± 0.216
1.309TrpGlu: 1.309 ± 0.424
0.6TrpPhe: 0.6 ± 0.232
0.982TrpGly: 0.982 ± 0.269
0.327TrpHis: 0.327 ± 0.131
0.709TrpIle: 0.709 ± 0.224
0.436TrpLys: 0.436 ± 0.147
2.236TrpLeu: 2.236 ± 0.474
0.109TrpMet: 0.109 ± 0.084
0.764TrpAsn: 0.764 ± 0.352
0.818TrpPro: 0.818 ± 0.281
0.818TrpGln: 0.818 ± 0.163
1.091TrpArg: 1.091 ± 0.235
1.8TrpSer: 1.8 ± 0.34
1.691TrpThr: 1.691 ± 0.297
1.036TrpVal: 1.036 ± 0.27
0.436TrpTrp: 0.436 ± 0.188
0.382TrpTyr: 0.382 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.673TyrAla: 2.673 ± 0.415
0.273TyrCys: 0.273 ± 0.135
1.745TyrAsp: 1.745 ± 0.271
1.473TyrGlu: 1.473 ± 0.402
0.545TyrPhe: 0.545 ± 0.209
1.691TyrGly: 1.691 ± 0.295
0.327TyrHis: 0.327 ± 0.116
0.873TyrIle: 0.873 ± 0.233
0.545TyrLys: 0.545 ± 0.159
1.855TyrLeu: 1.855 ± 0.358
0.436TyrMet: 0.436 ± 0.195
0.491TyrAsn: 0.491 ± 0.18
0.873TyrPro: 0.873 ± 0.219
0.927TyrGln: 0.927 ± 0.262
1.964TyrArg: 1.964 ± 0.337
0.655TyrSer: 0.655 ± 0.191
1.473TyrThr: 1.473 ± 0.335
2.127TyrVal: 2.127 ± 0.398
0.218TyrTrp: 0.218 ± 0.138
0.273TyrTyr: 0.273 ± 0.131
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 78 proteins (18334 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski