Amino acid dipepetide frequency for Microcystis phage Mic1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.141AlaAla: 2.141 ± 0.525
0.42AlaCys: 0.42 ± 0.137
2.141AlaAsp: 2.141 ± 0.241
3.652AlaGlu: 3.652 ± 0.563
2.854AlaPhe: 2.854 ± 0.336
3.274AlaGly: 3.274 ± 0.407
0.839AlaHis: 0.839 ± 0.234
5.247AlaIle: 5.247 ± 0.476
4.995AlaLys: 4.995 ± 0.651
6.842AlaLeu: 6.842 ± 0.754
1.091AlaMet: 1.091 ± 0.338
3.442AlaAsn: 3.442 ± 0.476
1.889AlaPro: 1.889 ± 0.323
2.015AlaGln: 2.015 ± 0.445
2.225AlaArg: 2.225 ± 0.312
4.239AlaSer: 4.239 ± 0.408
3.4AlaThr: 3.4 ± 0.564
3.442AlaVal: 3.442 ± 0.435
0.798AlaTrp: 0.798 ± 0.22
1.973AlaTyr: 1.973 ± 0.278
0.0AlaXaa: 0.0 ± 0.0
Cys
0.881CysAla: 0.881 ± 0.259
0.084CysCys: 0.084 ± 0.052
0.42CysAsp: 0.42 ± 0.162
0.714CysGlu: 0.714 ± 0.226
0.504CysPhe: 0.504 ± 0.118
0.462CysGly: 0.462 ± 0.158
0.168CysHis: 0.168 ± 0.077
0.63CysIle: 0.63 ± 0.156
0.63CysLys: 0.63 ± 0.23
1.049CysLeu: 1.049 ± 0.217
0.126CysMet: 0.126 ± 0.077
0.504CysAsn: 0.504 ± 0.143
0.714CysPro: 0.714 ± 0.23
0.588CysGln: 0.588 ± 0.2
0.546CysArg: 0.546 ± 0.173
0.798CysSer: 0.798 ± 0.167
0.504CysThr: 0.504 ± 0.147
0.756CysVal: 0.756 ± 0.2
0.042CysTrp: 0.042 ± 0.046
0.672CysTyr: 0.672 ± 0.188
0.0CysXaa: 0.0 ± 0.0
Asp
2.644AspAla: 2.644 ± 0.376
0.798AspCys: 0.798 ± 0.209
2.518AspAsp: 2.518 ± 0.318
2.938AspGlu: 2.938 ± 0.46
3.484AspPhe: 3.484 ± 0.376
2.938AspGly: 2.938 ± 0.287
0.462AspHis: 0.462 ± 0.132
4.03AspIle: 4.03 ± 0.386
4.365AspLys: 4.365 ± 0.407
5.751AspLeu: 5.751 ± 0.536
1.175AspMet: 1.175 ± 0.185
3.022AspAsn: 3.022 ± 0.335
1.259AspPro: 1.259 ± 0.252
0.881AspGln: 0.881 ± 0.213
1.553AspArg: 1.553 ± 0.214
4.155AspSer: 4.155 ± 0.501
3.106AspThr: 3.106 ± 0.321
3.148AspVal: 3.148 ± 0.369
0.965AspTrp: 0.965 ± 0.268
2.476AspTyr: 2.476 ± 0.293
0.0AspXaa: 0.0 ± 0.0
Glu
4.155GluAla: 4.155 ± 0.653
0.965GluCys: 0.965 ± 0.239
3.568GluAsp: 3.568 ± 0.554
4.365GluGlu: 4.365 ± 0.82
3.106GluPhe: 3.106 ± 0.307
3.862GluGly: 3.862 ± 0.433
0.965GluHis: 0.965 ± 0.253
5.541GluIle: 5.541 ± 0.368
4.743GluLys: 4.743 ± 0.563
7.01GluLeu: 7.01 ± 0.649
0.839GluMet: 0.839 ± 0.166
4.827GluAsn: 4.827 ± 0.517
1.469GluPro: 1.469 ± 0.256
2.77GluGln: 2.77 ± 0.37
2.476GluArg: 2.476 ± 0.303
4.533GluSer: 4.533 ± 0.455
4.281GluThr: 4.281 ± 0.47
4.323GluVal: 4.323 ± 0.369
0.965GluTrp: 0.965 ± 0.212
2.518GluTyr: 2.518 ± 0.375
0.0GluXaa: 0.0 ± 0.0
Phe
2.393PheAla: 2.393 ± 0.366
0.546PheCys: 0.546 ± 0.147
2.98PheAsp: 2.98 ± 0.407
3.988PheGlu: 3.988 ± 0.375
2.267PhePhe: 2.267 ± 0.282
2.98PheGly: 2.98 ± 0.407
1.049PheHis: 1.049 ± 0.244
2.896PheIle: 2.896 ± 0.434
3.652PheLys: 3.652 ± 0.339
4.617PheLeu: 4.617 ± 0.614
1.007PheMet: 1.007 ± 0.199
3.736PheAsn: 3.736 ± 0.373
2.057PhePro: 2.057 ± 0.343
2.141PheGln: 2.141 ± 0.331
2.015PheArg: 2.015 ± 0.224
4.323PheSer: 4.323 ± 0.415
2.854PheThr: 2.854 ± 0.428
3.148PheVal: 3.148 ± 0.34
1.007PheTrp: 1.007 ± 0.252
1.595PheTyr: 1.595 ± 0.342
0.0PheXaa: 0.0 ± 0.0
Gly
2.98GlyAla: 2.98 ± 0.617
0.546GlyCys: 0.546 ± 0.154
3.19GlyAsp: 3.19 ± 0.321
3.694GlyGlu: 3.694 ± 0.36
3.778GlyPhe: 3.778 ± 0.389
4.617GlyGly: 4.617 ± 0.862
0.756GlyHis: 0.756 ± 0.151
4.03GlyIle: 4.03 ± 0.464
4.701GlyLys: 4.701 ± 0.565
6.128GlyLeu: 6.128 ± 0.739
1.343GlyMet: 1.343 ± 0.374
2.896GlyAsn: 2.896 ± 0.368
0.042GlyPro: 0.042 ± 0.039
2.896GlyGln: 2.896 ± 0.448
2.183GlyArg: 2.183 ± 0.285
4.155GlySer: 4.155 ± 0.446
4.239GlyThr: 4.239 ± 0.42
3.82GlyVal: 3.82 ± 0.472
1.049GlyTrp: 1.049 ± 0.19
2.225GlyTyr: 2.225 ± 0.275
0.0GlyXaa: 0.0 ± 0.0
His
0.588HisAla: 0.588 ± 0.144
0.21HisCys: 0.21 ± 0.114
0.588HisAsp: 0.588 ± 0.157
0.378HisGlu: 0.378 ± 0.124
0.798HisPhe: 0.798 ± 0.166
0.504HisGly: 0.504 ± 0.151
0.252HisHis: 0.252 ± 0.094
1.133HisIle: 1.133 ± 0.234
1.049HisLys: 1.049 ± 0.258
1.973HisLeu: 1.973 ± 0.369
0.336HisMet: 0.336 ± 0.133
0.714HisAsn: 0.714 ± 0.251
1.217HisPro: 1.217 ± 0.223
0.714HisGln: 0.714 ± 0.151
0.923HisArg: 0.923 ± 0.239
1.217HisSer: 1.217 ± 0.237
0.839HisThr: 0.839 ± 0.174
0.462HisVal: 0.462 ± 0.157
0.462HisTrp: 0.462 ± 0.146
0.588HisTyr: 0.588 ± 0.131
0.0HisXaa: 0.0 ± 0.0
Ile
4.323IleAla: 4.323 ± 0.419
0.672IleCys: 0.672 ± 0.192
3.988IleAsp: 3.988 ± 0.377
4.995IleGlu: 4.995 ± 0.52
3.19IlePhe: 3.19 ± 0.358
3.19IleGly: 3.19 ± 0.37
0.839IleHis: 0.839 ± 0.169
3.736IleIle: 3.736 ± 0.458
5.625IleLys: 5.625 ± 0.469
6.422IleLeu: 6.422 ± 0.602
1.091IleMet: 1.091 ± 0.169
3.652IleAsn: 3.652 ± 0.36
3.652IlePro: 3.652 ± 0.335
3.442IleGln: 3.442 ± 0.446
2.896IleArg: 2.896 ± 0.292
6.338IleSer: 6.338 ± 0.622
3.568IleThr: 3.568 ± 0.446
3.358IleVal: 3.358 ± 0.354
0.63IleTrp: 0.63 ± 0.163
1.889IleTyr: 1.889 ± 0.376
0.0IleXaa: 0.0 ± 0.0
Lys
5.667LysAla: 5.667 ± 0.608
0.42LysCys: 0.42 ± 0.163
4.113LysAsp: 4.113 ± 0.501
4.827LysGlu: 4.827 ± 0.59
2.812LysPhe: 2.812 ± 0.374
3.778LysGly: 3.778 ± 0.452
0.965LysHis: 0.965 ± 0.248
4.995LysIle: 4.995 ± 0.593
4.743LysLys: 4.743 ± 0.556
7.681LysLeu: 7.681 ± 0.746
1.175LysMet: 1.175 ± 0.218
4.701LysAsn: 4.701 ± 0.563
2.351LysPro: 2.351 ± 0.3
3.442LysGln: 3.442 ± 0.53
2.518LysArg: 2.518 ± 0.314
4.995LysSer: 4.995 ± 0.482
4.743LysThr: 4.743 ± 0.474
3.862LysVal: 3.862 ± 0.517
0.798LysTrp: 0.798 ± 0.205
2.518LysTyr: 2.518 ± 0.317
0.0LysXaa: 0.0 ± 0.0
Leu
6.128LeuAla: 6.128 ± 0.569
0.798LeuCys: 0.798 ± 0.187
5.163LeuAsp: 5.163 ± 0.478
9.612LeuGlu: 9.612 ± 0.882
4.575LeuPhe: 4.575 ± 0.467
7.178LeuGly: 7.178 ± 0.906
1.595LeuHis: 1.595 ± 0.325
6.044LeuIle: 6.044 ± 0.658
9.57LeuLys: 9.57 ± 0.598
8.437LeuLeu: 8.437 ± 0.61
1.553LeuMet: 1.553 ± 0.274
7.052LeuAsn: 7.052 ± 0.598
4.953LeuPro: 4.953 ± 0.543
3.4LeuGln: 3.4 ± 0.383
3.148LeuArg: 3.148 ± 0.368
8.269LeuSer: 8.269 ± 0.799
6.632LeuThr: 6.632 ± 0.506
4.743LeuVal: 4.743 ± 0.402
0.839LeuTrp: 0.839 ± 0.206
3.106LeuTyr: 3.106 ± 0.325
0.0LeuXaa: 0.0 ± 0.0
Met
0.839MetAla: 0.839 ± 0.195
0.126MetCys: 0.126 ± 0.078
0.839MetAsp: 0.839 ± 0.187
0.839MetGlu: 0.839 ± 0.301
1.007MetPhe: 1.007 ± 0.239
1.133MetGly: 1.133 ± 0.268
0.168MetHis: 0.168 ± 0.089
1.175MetIle: 1.175 ± 0.239
0.798MetLys: 0.798 ± 0.215
1.469MetLeu: 1.469 ± 0.252
0.21MetMet: 0.21 ± 0.072
0.965MetAsn: 0.965 ± 0.149
1.049MetPro: 1.049 ± 0.204
0.462MetGln: 0.462 ± 0.158
0.923MetArg: 0.923 ± 0.178
1.973MetSer: 1.973 ± 0.287
1.133MetThr: 1.133 ± 0.225
0.881MetVal: 0.881 ± 0.217
0.0MetTrp: 0.0 ± 0.0
0.588MetTyr: 0.588 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
4.03AsnAla: 4.03 ± 0.563
0.588AsnCys: 0.588 ± 0.17
2.141AsnAsp: 2.141 ± 0.37
2.644AsnGlu: 2.644 ± 0.295
3.61AsnPhe: 3.61 ± 0.428
3.862AsnGly: 3.862 ± 0.483
1.343AsnHis: 1.343 ± 0.219
3.316AsnIle: 3.316 ± 0.47
2.812AsnLys: 2.812 ± 0.342
8.269AsnLeu: 8.269 ± 0.643
0.798AsnMet: 0.798 ± 0.205
3.61AsnAsn: 3.61 ± 0.483
3.778AsnPro: 3.778 ± 0.444
2.728AsnGln: 2.728 ± 0.327
2.896AsnArg: 2.896 ± 0.288
5.205AsnSer: 5.205 ± 0.433
3.106AsnThr: 3.106 ± 0.359
3.274AsnVal: 3.274 ± 0.379
0.965AsnTrp: 0.965 ± 0.207
2.98AsnTyr: 2.98 ± 0.353
0.0AsnXaa: 0.0 ± 0.0
Pro
1.973ProAla: 1.973 ± 0.321
0.294ProCys: 0.294 ± 0.129
2.602ProAsp: 2.602 ± 0.322
3.106ProGlu: 3.106 ± 0.354
1.763ProPhe: 1.763 ± 0.27
1.049ProGly: 1.049 ± 0.215
0.504ProHis: 0.504 ± 0.158
2.225ProIle: 2.225 ± 0.368
2.938ProLys: 2.938 ± 0.335
4.072ProLeu: 4.072 ± 0.613
0.546ProMet: 0.546 ± 0.171
2.938ProAsn: 2.938 ± 0.346
1.805ProPro: 1.805 ± 0.4
1.805ProGln: 1.805 ± 0.287
1.175ProArg: 1.175 ± 0.203
3.568ProSer: 3.568 ± 0.568
1.973ProThr: 1.973 ± 0.203
2.77ProVal: 2.77 ± 0.35
0.168ProTrp: 0.168 ± 0.085
1.721ProTyr: 1.721 ± 0.372
0.0ProXaa: 0.0 ± 0.0
Gln
2.56GlnAla: 2.56 ± 0.667
0.756GlnCys: 0.756 ± 0.199
1.679GlnAsp: 1.679 ± 0.307
2.98GlnGlu: 2.98 ± 0.488
1.805GlnPhe: 1.805 ± 0.253
2.686GlnGly: 2.686 ± 0.324
0.294GlnHis: 0.294 ± 0.106
3.106GlnIle: 3.106 ± 0.365
2.896GlnLys: 2.896 ± 0.389
4.072GlnLeu: 4.072 ± 0.484
0.756GlnMet: 0.756 ± 0.179
2.518GlnAsn: 2.518 ± 0.338
1.091GlnPro: 1.091 ± 0.211
2.099GlnGln: 2.099 ± 0.42
1.343GlnArg: 1.343 ± 0.192
3.358GlnSer: 3.358 ± 0.362
2.602GlnThr: 2.602 ± 0.425
2.602GlnVal: 2.602 ± 0.296
0.42GlnTrp: 0.42 ± 0.121
1.385GlnTyr: 1.385 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
1.595ArgAla: 1.595 ± 0.285
0.63ArgCys: 0.63 ± 0.162
1.637ArgAsp: 1.637 ± 0.218
2.435ArgGlu: 2.435 ± 0.386
2.267ArgPhe: 2.267 ± 0.323
1.805ArgGly: 1.805 ± 0.219
0.798ArgHis: 0.798 ± 0.22
3.4ArgIle: 3.4 ± 0.369
2.98ArgLys: 2.98 ± 0.413
4.323ArgLeu: 4.323 ± 0.323
0.546ArgMet: 0.546 ± 0.151
2.225ArgAsn: 2.225 ± 0.342
1.049ArgPro: 1.049 ± 0.168
2.141ArgGln: 2.141 ± 0.316
1.637ArgArg: 1.637 ± 0.284
1.931ArgSer: 1.931 ± 0.268
2.351ArgThr: 2.351 ± 0.282
2.476ArgVal: 2.476 ± 0.282
0.588ArgTrp: 0.588 ± 0.171
1.553ArgTyr: 1.553 ± 0.258
0.0ArgXaa: 0.0 ± 0.0
Ser
4.491SerAla: 4.491 ± 0.518
0.923SerCys: 0.923 ± 0.214
4.365SerAsp: 4.365 ± 0.514
5.457SerGlu: 5.457 ± 0.506
4.323SerPhe: 4.323 ± 0.513
5.331SerGly: 5.331 ± 0.58
1.301SerHis: 1.301 ± 0.236
5.625SerIle: 5.625 ± 0.488
4.617SerLys: 4.617 ± 0.588
8.353SerLeu: 8.353 ± 0.845
1.343SerMet: 1.343 ± 0.285
4.743SerAsn: 4.743 ± 0.486
3.232SerPro: 3.232 ± 0.361
3.316SerGln: 3.316 ± 0.337
3.148SerArg: 3.148 ± 0.347
6.212SerSer: 6.212 ± 0.511
3.652SerThr: 3.652 ± 0.458
5.121SerVal: 5.121 ± 0.523
1.343SerTrp: 1.343 ± 0.318
2.518SerTyr: 2.518 ± 0.336
0.0SerXaa: 0.0 ± 0.0
Thr
3.652ThrAla: 3.652 ± 0.569
0.672ThrCys: 0.672 ± 0.211
2.938ThrAsp: 2.938 ± 0.355
3.526ThrGlu: 3.526 ± 0.38
3.736ThrPhe: 3.736 ± 0.422
3.82ThrGly: 3.82 ± 0.51
0.756ThrHis: 0.756 ± 0.221
4.239ThrIle: 4.239 ± 0.414
3.484ThrLys: 3.484 ± 0.464
6.128ThrLeu: 6.128 ± 0.552
0.798ThrMet: 0.798 ± 0.17
2.98ThrAsn: 2.98 ± 0.495
3.106ThrPro: 3.106 ± 0.332
2.476ThrGln: 2.476 ± 0.293
1.595ThrArg: 1.595 ± 0.256
4.827ThrSer: 4.827 ± 0.538
4.281ThrThr: 4.281 ± 0.517
4.491ThrVal: 4.491 ± 0.4
0.546ThrTrp: 0.546 ± 0.152
2.183ThrTyr: 2.183 ± 0.305
0.0ThrXaa: 0.0 ± 0.0
Val
3.736ValAla: 3.736 ± 0.443
0.588ValCys: 0.588 ± 0.164
3.526ValAsp: 3.526 ± 0.368
3.778ValGlu: 3.778 ± 0.421
2.98ValPhe: 2.98 ± 0.404
3.442ValGly: 3.442 ± 0.434
0.839ValHis: 0.839 ± 0.211
3.904ValIle: 3.904 ± 0.518
3.904ValLys: 3.904 ± 0.44
4.995ValLeu: 4.995 ± 0.446
1.133ValMet: 1.133 ± 0.224
4.323ValAsn: 4.323 ± 0.413
2.393ValPro: 2.393 ± 0.379
1.763ValGln: 1.763 ± 0.259
2.476ValArg: 2.476 ± 0.305
5.205ValSer: 5.205 ± 0.452
4.155ValThr: 4.155 ± 0.523
4.113ValVal: 4.113 ± 0.473
0.714ValTrp: 0.714 ± 0.188
2.309ValTyr: 2.309 ± 0.354
0.0ValXaa: 0.0 ± 0.0
Trp
0.504TrpAla: 0.504 ± 0.145
0.168TrpCys: 0.168 ± 0.079
0.839TrpAsp: 0.839 ± 0.161
1.217TrpGlu: 1.217 ± 0.246
0.714TrpPhe: 0.714 ± 0.183
1.091TrpGly: 1.091 ± 0.215
0.21TrpHis: 0.21 ± 0.094
0.672TrpIle: 0.672 ± 0.169
0.336TrpLys: 0.336 ± 0.101
1.511TrpLeu: 1.511 ± 0.337
0.252TrpMet: 0.252 ± 0.111
0.63TrpAsn: 0.63 ± 0.128
0.042TrpPro: 0.042 ± 0.038
0.546TrpGln: 0.546 ± 0.144
0.504TrpArg: 0.504 ± 0.177
1.133TrpSer: 1.133 ± 0.267
0.881TrpThr: 0.881 ± 0.197
1.007TrpVal: 1.007 ± 0.217
0.126TrpTrp: 0.126 ± 0.072
0.672TrpTyr: 0.672 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.679TyrAla: 1.679 ± 0.251
0.588TyrCys: 0.588 ± 0.144
2.435TyrAsp: 2.435 ± 0.402
2.141TyrGlu: 2.141 ± 0.31
1.679TyrPhe: 1.679 ± 0.25
2.267TyrGly: 2.267 ± 0.278
0.839TyrHis: 0.839 ± 0.223
1.553TyrIle: 1.553 ± 0.306
2.267TyrLys: 2.267 ± 0.314
3.694TyrLeu: 3.694 ± 0.426
0.42TyrMet: 0.42 ± 0.136
2.476TyrAsn: 2.476 ± 0.318
1.847TyrPro: 1.847 ± 0.368
1.385TyrGln: 1.385 ± 0.169
2.183TyrArg: 2.183 ± 0.333
2.98TyrSer: 2.98 ± 0.303
1.931TyrThr: 1.931 ± 0.295
2.476TyrVal: 2.476 ± 0.376
0.63TyrTrp: 0.63 ± 0.164
1.133TyrTyr: 1.133 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (23825 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski