Amino acid dipepetide frequency for Mycobacterium phage Qyrzula

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.819AlaAla: 17.819 ± 1.355
1.408AlaCys: 1.408 ± 0.335
7.768AlaAsp: 7.768 ± 0.667
7.866AlaGlu: 7.866 ± 0.736
3.496AlaPhe: 3.496 ± 0.472
10.099AlaGly: 10.099 ± 0.977
2.622AlaHis: 2.622 ± 0.435
5.292AlaIle: 5.292 ± 0.755
3.981AlaLys: 3.981 ± 0.569
11.167AlaLeu: 11.167 ± 0.681
2.913AlaMet: 2.913 ± 0.359
4.321AlaAsn: 4.321 ± 0.432
7.332AlaPro: 7.332 ± 0.546
4.37AlaGln: 4.37 ± 0.396
8.982AlaArg: 8.982 ± 0.835
6.943AlaSer: 6.943 ± 0.641
8.74AlaThr: 8.74 ± 0.646
10.585AlaVal: 10.585 ± 0.656
2.233AlaTrp: 2.233 ± 0.303
2.913AlaTyr: 2.913 ± 0.394
0.0AlaXaa: 0.0 ± 0.0
Cys
1.068CysAla: 1.068 ± 0.274
0.194CysCys: 0.194 ± 0.131
0.777CysAsp: 0.777 ± 0.225
0.68CysGlu: 0.68 ± 0.173
0.097CysPhe: 0.097 ± 0.101
1.165CysGly: 1.165 ± 0.288
0.194CysHis: 0.194 ± 0.11
0.243CysIle: 0.243 ± 0.094
0.146CysLys: 0.146 ± 0.077
0.583CysLeu: 0.583 ± 0.167
0.194CysMet: 0.194 ± 0.095
0.34CysAsn: 0.34 ± 0.111
0.728CysPro: 0.728 ± 0.192
0.243CysGln: 0.243 ± 0.104
0.728CysArg: 0.728 ± 0.24
0.291CysSer: 0.291 ± 0.118
0.534CysThr: 0.534 ± 0.202
0.825CysVal: 0.825 ± 0.187
0.34CysTrp: 0.34 ± 0.189
0.243CysTyr: 0.243 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
7.283AspAla: 7.283 ± 0.594
0.631AspCys: 0.631 ± 0.155
4.613AspAsp: 4.613 ± 0.537
3.641AspGlu: 3.641 ± 0.477
1.651AspPhe: 1.651 ± 0.265
5.681AspGly: 5.681 ± 0.47
1.311AspHis: 1.311 ± 0.251
1.991AspIle: 1.991 ± 0.285
2.039AspLys: 2.039 ± 0.274
6.118AspLeu: 6.118 ± 0.477
1.748AspMet: 1.748 ± 0.257
1.748AspAsn: 1.748 ± 0.322
4.127AspPro: 4.127 ± 0.371
1.894AspGln: 1.894 ± 0.309
3.884AspArg: 3.884 ± 0.426
2.816AspSer: 2.816 ± 0.387
4.418AspThr: 4.418 ± 0.369
4.078AspVal: 4.078 ± 0.447
1.262AspTrp: 1.262 ± 0.251
1.359AspTyr: 1.359 ± 0.321
0.0AspXaa: 0.0 ± 0.0
Glu
8.205GluAla: 8.205 ± 0.851
0.534GluCys: 0.534 ± 0.145
3.399GluAsp: 3.399 ± 0.471
2.573GluGlu: 2.573 ± 0.35
2.233GluPhe: 2.233 ± 0.298
3.593GluGly: 3.593 ± 0.387
1.554GluHis: 1.554 ± 0.275
2.233GluIle: 2.233 ± 0.313
1.214GluLys: 1.214 ± 0.271
6.652GluLeu: 6.652 ± 0.661
0.971GluMet: 0.971 ± 0.195
1.311GluAsn: 1.311 ± 0.275
3.107GluPro: 3.107 ± 0.434
2.816GluGln: 2.816 ± 0.367
4.321GluArg: 4.321 ± 0.491
2.088GluSer: 2.088 ± 0.327
2.768GluThr: 2.768 ± 0.378
5.778GluVal: 5.778 ± 0.507
1.311GluTrp: 1.311 ± 0.222
1.262GluTyr: 1.262 ± 0.242
0.0GluXaa: 0.0 ± 0.0
Phe
4.078PheAla: 4.078 ± 0.502
0.049PheCys: 0.049 ± 0.047
1.991PheAsp: 1.991 ± 0.324
1.651PheGlu: 1.651 ± 0.334
0.583PhePhe: 0.583 ± 0.18
2.525PheGly: 2.525 ± 0.471
0.486PheHis: 0.486 ± 0.143
0.923PheIle: 0.923 ± 0.249
0.68PheLys: 0.68 ± 0.197
1.942PheLeu: 1.942 ± 0.343
0.388PheMet: 0.388 ± 0.159
0.971PheAsn: 0.971 ± 0.203
1.117PhePro: 1.117 ± 0.18
1.117PheGln: 1.117 ± 0.172
1.894PheArg: 1.894 ± 0.343
1.262PheSer: 1.262 ± 0.241
1.748PheThr: 1.748 ± 0.276
1.894PheVal: 1.894 ± 0.27
0.486PheTrp: 0.486 ± 0.158
0.388PheTyr: 0.388 ± 0.117
0.0PheXaa: 0.0 ± 0.0
Gly
8.837GlyAla: 8.837 ± 1.008
0.631GlyCys: 0.631 ± 0.181
5.292GlyAsp: 5.292 ± 0.46
5.292GlyGlu: 5.292 ± 0.488
2.136GlyPhe: 2.136 ± 0.42
10.779GlyGly: 10.779 ± 2.184
1.894GlyHis: 1.894 ± 0.28
3.739GlyIle: 3.739 ± 0.42
2.962GlyLys: 2.962 ± 0.435
7.866GlyLeu: 7.866 ± 0.718
1.942GlyMet: 1.942 ± 0.279
2.962GlyAsn: 2.962 ± 0.385
4.273GlyPro: 4.273 ± 0.566
3.933GlyGln: 3.933 ± 0.563
4.515GlyArg: 4.515 ± 0.56
4.03GlySer: 4.03 ± 0.505
7.72GlyThr: 7.72 ± 0.62
6.846GlyVal: 6.846 ± 0.658
2.136GlyTrp: 2.136 ± 0.325
2.525GlyTyr: 2.525 ± 0.401
0.0GlyXaa: 0.0 ± 0.0
His
2.088HisAla: 2.088 ± 0.43
0.194HisCys: 0.194 ± 0.101
1.117HisAsp: 1.117 ± 0.248
1.214HisGlu: 1.214 ± 0.236
0.583HisPhe: 0.583 ± 0.179
1.796HisGly: 1.796 ± 0.332
0.68HisHis: 0.68 ± 0.168
0.68HisIle: 0.68 ± 0.16
0.388HisLys: 0.388 ± 0.141
2.185HisLeu: 2.185 ± 0.532
0.437HisMet: 0.437 ± 0.145
0.728HisAsn: 0.728 ± 0.21
1.117HisPro: 1.117 ± 0.262
0.583HisGln: 0.583 ± 0.183
1.554HisArg: 1.554 ± 0.301
1.068HisSer: 1.068 ± 0.263
1.699HisThr: 1.699 ± 0.343
1.311HisVal: 1.311 ± 0.246
0.825HisTrp: 0.825 ± 0.205
0.534HisTyr: 0.534 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.632IleAla: 5.632 ± 0.45
0.194IleCys: 0.194 ± 0.106
3.107IleAsp: 3.107 ± 0.323
2.428IleGlu: 2.428 ± 0.339
0.486IlePhe: 0.486 ± 0.14
3.447IleGly: 3.447 ± 0.338
0.631IleHis: 0.631 ± 0.155
1.117IleIle: 1.117 ± 0.302
1.408IleLys: 1.408 ± 0.267
2.719IleLeu: 2.719 ± 0.347
0.583IleMet: 0.583 ± 0.176
1.359IleAsn: 1.359 ± 0.231
1.554IlePro: 1.554 ± 0.255
1.165IleGln: 1.165 ± 0.262
1.894IleArg: 1.894 ± 0.319
2.331IleSer: 2.331 ± 0.403
3.107IleThr: 3.107 ± 0.437
2.768IleVal: 2.768 ± 0.417
0.243IleTrp: 0.243 ± 0.108
0.437IleTyr: 0.437 ± 0.139
0.0IleXaa: 0.0 ± 0.0
Lys
4.273LysAla: 4.273 ± 0.644
0.388LysCys: 0.388 ± 0.145
1.359LysAsp: 1.359 ± 0.29
1.117LysGlu: 1.117 ± 0.24
0.583LysPhe: 0.583 ± 0.162
2.428LysGly: 2.428 ± 0.357
0.971LysHis: 0.971 ± 0.233
0.728LysIle: 0.728 ± 0.242
0.971LysLys: 0.971 ± 0.28
2.719LysLeu: 2.719 ± 0.329
0.534LysMet: 0.534 ± 0.135
0.874LysAsn: 0.874 ± 0.23
0.971LysPro: 0.971 ± 0.256
0.583LysGln: 0.583 ± 0.189
2.088LysArg: 2.088 ± 0.379
1.311LysSer: 1.311 ± 0.289
1.796LysThr: 1.796 ± 0.324
2.816LysVal: 2.816 ± 0.357
0.534LysTrp: 0.534 ± 0.149
0.631LysTyr: 0.631 ± 0.14
0.0LysXaa: 0.0 ± 0.0
Leu
13.935LeuAla: 13.935 ± 0.74
0.631LeuCys: 0.631 ± 0.186
5.195LeuAsp: 5.195 ± 0.676
4.904LeuGlu: 4.904 ± 0.569
1.845LeuPhe: 1.845 ± 0.312
7.137LeuGly: 7.137 ± 0.746
1.262LeuHis: 1.262 ± 0.327
3.641LeuIle: 3.641 ± 0.485
1.894LeuLys: 1.894 ± 0.385
7.137LeuLeu: 7.137 ± 0.534
1.991LeuMet: 1.991 ± 0.309
2.136LeuAsn: 2.136 ± 0.342
5.778LeuPro: 5.778 ± 0.461
2.379LeuGln: 2.379 ± 0.359
5.875LeuArg: 5.875 ± 0.581
4.564LeuSer: 4.564 ± 0.514
5.244LeuThr: 5.244 ± 0.6
6.652LeuVal: 6.652 ± 0.478
1.408LeuTrp: 1.408 ± 0.274
1.554LeuTyr: 1.554 ± 0.259
0.0LeuXaa: 0.0 ± 0.0
Met
3.593MetAla: 3.593 ± 0.416
0.146MetCys: 0.146 ± 0.073
1.117MetAsp: 1.117 ± 0.213
0.728MetGlu: 0.728 ± 0.197
0.486MetPhe: 0.486 ± 0.162
0.874MetGly: 0.874 ± 0.209
0.68MetHis: 0.68 ± 0.189
0.583MetIle: 0.583 ± 0.15
0.388MetLys: 0.388 ± 0.118
1.894MetLeu: 1.894 ± 0.259
0.291MetMet: 0.291 ± 0.135
0.534MetAsn: 0.534 ± 0.145
1.942MetPro: 1.942 ± 0.322
0.68MetGln: 0.68 ± 0.178
1.214MetArg: 1.214 ± 0.262
1.942MetSer: 1.942 ± 0.301
1.942MetThr: 1.942 ± 0.344
1.748MetVal: 1.748 ± 0.27
0.243MetTrp: 0.243 ± 0.139
0.631MetTyr: 0.631 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
4.127AsnAla: 4.127 ± 0.389
0.291AsnCys: 0.291 ± 0.12
1.845AsnAsp: 1.845 ± 0.291
1.311AsnGlu: 1.311 ± 0.258
0.631AsnPhe: 0.631 ± 0.145
3.544AsnGly: 3.544 ± 0.387
0.534AsnHis: 0.534 ± 0.195
1.262AsnIle: 1.262 ± 0.263
0.583AsnLys: 0.583 ± 0.172
2.331AsnLeu: 2.331 ± 0.306
0.631AsnMet: 0.631 ± 0.166
0.583AsnAsn: 0.583 ± 0.159
1.845AsnPro: 1.845 ± 0.289
1.068AsnGln: 1.068 ± 0.2
1.651AsnArg: 1.651 ± 0.299
1.554AsnSer: 1.554 ± 0.293
1.991AsnThr: 1.991 ± 0.35
1.991AsnVal: 1.991 ± 0.339
0.631AsnTrp: 0.631 ± 0.172
0.437AsnTyr: 0.437 ± 0.143
0.0AsnXaa: 0.0 ± 0.0
Pro
6.506ProAla: 6.506 ± 0.577
0.388ProCys: 0.388 ± 0.126
3.884ProAsp: 3.884 ± 0.444
5.098ProGlu: 5.098 ± 0.488
1.262ProPhe: 1.262 ± 0.226
6.603ProGly: 6.603 ± 0.636
1.117ProHis: 1.117 ± 0.359
1.651ProIle: 1.651 ± 0.32
1.699ProLys: 1.699 ± 0.312
3.787ProLeu: 3.787 ± 0.487
0.825ProMet: 0.825 ± 0.179
1.699ProAsn: 1.699 ± 0.31
5.098ProPro: 5.098 ± 0.666
1.699ProGln: 1.699 ± 0.333
2.67ProArg: 2.67 ± 0.357
3.059ProSer: 3.059 ± 0.395
4.661ProThr: 4.661 ± 0.481
6.263ProVal: 6.263 ± 0.459
0.971ProTrp: 0.971 ± 0.251
1.068ProTyr: 1.068 ± 0.195
0.0ProXaa: 0.0 ± 0.0
Gln
5.584GlnAla: 5.584 ± 0.503
0.146GlnCys: 0.146 ± 0.097
1.602GlnAsp: 1.602 ± 0.291
1.699GlnGlu: 1.699 ± 0.264
1.311GlnPhe: 1.311 ± 0.245
2.428GlnGly: 2.428 ± 0.308
0.583GlnHis: 0.583 ± 0.178
1.214GlnIle: 1.214 ± 0.227
0.728GlnLys: 0.728 ± 0.167
3.787GlnLeu: 3.787 ± 0.46
1.02GlnMet: 1.02 ± 0.226
0.631GlnAsn: 0.631 ± 0.189
1.748GlnPro: 1.748 ± 0.268
1.991GlnGln: 1.991 ± 0.329
2.719GlnArg: 2.719 ± 0.28
1.359GlnSer: 1.359 ± 0.301
1.651GlnThr: 1.651 ± 0.302
3.01GlnVal: 3.01 ± 0.391
1.02GlnTrp: 1.02 ± 0.223
0.728GlnTyr: 0.728 ± 0.204
0.0GlnXaa: 0.0 ± 0.0
Arg
8.254ArgAla: 8.254 ± 0.587
1.068ArgCys: 1.068 ± 0.291
3.933ArgAsp: 3.933 ± 0.422
4.078ArgGlu: 4.078 ± 0.574
1.845ArgPhe: 1.845 ± 0.312
5.244ArgGly: 5.244 ± 0.626
1.894ArgHis: 1.894 ± 0.323
2.719ArgIle: 2.719 ± 0.34
1.505ArgLys: 1.505 ± 0.282
5.875ArgLeu: 5.875 ± 0.48
2.039ArgMet: 2.039 ± 0.358
2.136ArgAsn: 2.136 ± 0.302
3.107ArgPro: 3.107 ± 0.419
2.428ArgGln: 2.428 ± 0.352
5.972ArgArg: 5.972 ± 0.771
2.768ArgSer: 2.768 ± 0.33
4.078ArgThr: 4.078 ± 0.399
4.321ArgVal: 4.321 ± 0.482
1.408ArgTrp: 1.408 ± 0.289
2.039ArgTyr: 2.039 ± 0.348
0.0ArgXaa: 0.0 ± 0.0
Ser
6.021SerAla: 6.021 ± 0.642
0.534SerCys: 0.534 ± 0.151
3.593SerAsp: 3.593 ± 0.411
2.67SerGlu: 2.67 ± 0.401
1.602SerPhe: 1.602 ± 0.405
6.263SerGly: 6.263 ± 0.552
0.437SerHis: 0.437 ± 0.147
1.942SerIle: 1.942 ± 0.295
1.748SerLys: 1.748 ± 0.328
3.593SerLeu: 3.593 ± 0.554
1.505SerMet: 1.505 ± 0.26
1.651SerAsn: 1.651 ± 0.288
2.816SerPro: 2.816 ± 0.423
1.359SerGln: 1.359 ± 0.25
2.962SerArg: 2.962 ± 0.404
2.233SerSer: 2.233 ± 0.373
3.205SerThr: 3.205 ± 0.32
4.321SerVal: 4.321 ± 0.495
1.068SerTrp: 1.068 ± 0.241
0.971SerTyr: 0.971 ± 0.236
0.0SerXaa: 0.0 ± 0.0
Thr
7.768ThrAla: 7.768 ± 0.6
0.631ThrCys: 0.631 ± 0.204
3.641ThrAsp: 3.641 ± 0.411
3.593ThrGlu: 3.593 ± 0.437
2.088ThrPhe: 2.088 ± 0.332
7.04ThrGly: 7.04 ± 0.612
1.457ThrHis: 1.457 ± 0.284
3.059ThrIle: 3.059 ± 0.385
2.525ThrLys: 2.525 ± 0.386
5.05ThrLeu: 5.05 ± 0.572
1.311ThrMet: 1.311 ± 0.239
1.602ThrAsn: 1.602 ± 0.266
5.681ThrPro: 5.681 ± 0.624
2.185ThrGln: 2.185 ± 0.384
3.884ThrArg: 3.884 ± 0.468
4.176ThrSer: 4.176 ± 0.655
4.564ThrThr: 4.564 ± 0.489
5.195ThrVal: 5.195 ± 0.554
1.699ThrTrp: 1.699 ± 0.352
1.554ThrTyr: 1.554 ± 0.232
0.0ThrXaa: 0.0 ± 0.0
Val
11.313ValAla: 11.313 ± 0.853
0.825ValCys: 0.825 ± 0.176
5.244ValAsp: 5.244 ± 0.467
4.758ValGlu: 4.758 ± 0.522
1.796ValPhe: 1.796 ± 0.279
6.118ValGly: 6.118 ± 0.566
1.457ValHis: 1.457 ± 0.277
2.476ValIle: 2.476 ± 0.335
1.651ValLys: 1.651 ± 0.262
6.312ValLeu: 6.312 ± 0.451
1.554ValMet: 1.554 ± 0.314
1.845ValAsn: 1.845 ± 0.26
5.438ValPro: 5.438 ± 0.61
2.913ValGln: 2.913 ± 0.415
6.166ValArg: 6.166 ± 0.649
4.758ValSer: 4.758 ± 0.587
5.632ValThr: 5.632 ± 0.539
6.021ValVal: 6.021 ± 0.615
1.117ValTrp: 1.117 ± 0.233
1.942ValTyr: 1.942 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
2.185TrpAla: 2.185 ± 0.377
0.437TrpCys: 0.437 ± 0.148
1.117TrpAsp: 1.117 ± 0.173
1.359TrpGlu: 1.359 ± 0.226
0.825TrpPhe: 0.825 ± 0.231
1.068TrpGly: 1.068 ± 0.286
0.728TrpHis: 0.728 ± 0.199
0.534TrpIle: 0.534 ± 0.121
0.437TrpLys: 0.437 ± 0.129
1.602TrpLeu: 1.602 ± 0.3
0.34TrpMet: 0.34 ± 0.13
0.728TrpAsn: 0.728 ± 0.185
1.214TrpPro: 1.214 ± 0.226
0.728TrpGln: 0.728 ± 0.177
1.262TrpArg: 1.262 ± 0.24
1.408TrpSer: 1.408 ± 0.3
1.214TrpThr: 1.214 ± 0.271
1.359TrpVal: 1.359 ± 0.301
0.534TrpTrp: 0.534 ± 0.186
0.583TrpTyr: 0.583 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.282TyrAla: 2.282 ± 0.386
0.291TyrCys: 0.291 ± 0.1
1.699TyrAsp: 1.699 ± 0.264
1.262TyrGlu: 1.262 ± 0.198
0.631TyrPhe: 0.631 ± 0.197
2.476TyrGly: 2.476 ± 0.315
0.291TyrHis: 0.291 ± 0.135
0.583TyrIle: 0.583 ± 0.162
0.728TyrLys: 0.728 ± 0.18
1.845TyrLeu: 1.845 ± 0.341
0.388TyrMet: 0.388 ± 0.144
0.68TyrAsn: 0.68 ± 0.176
1.068TyrPro: 1.068 ± 0.222
0.825TyrGln: 0.825 ± 0.223
2.476TyrArg: 2.476 ± 0.368
0.534TyrSer: 0.534 ± 0.141
1.991TyrThr: 1.991 ± 0.277
1.554TyrVal: 1.554 ± 0.262
0.194TyrTrp: 0.194 ± 0.079
0.437TyrTyr: 0.437 ± 0.174
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (20597 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski