Amino acid dipepetide frequency for Dishui Lake phycodnavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.664AlaAla: 5.664 ± 0.656
0.918AlaCys: 0.918 ± 0.15
2.91AlaAsp: 2.91 ± 0.224
3.308AlaGlu: 3.308 ± 0.748
2.806AlaPhe: 2.806 ± 0.186
3.863AlaGly: 3.863 ± 0.35
1.195AlaHis: 1.195 ± 0.145
4.244AlaIle: 4.244 ± 0.283
5.093AlaLys: 5.093 ± 0.709
5.543AlaLeu: 5.543 ± 0.369
1.957AlaMet: 1.957 ± 0.207
4.261AlaAsn: 4.261 ± 1.125
2.338AlaPro: 2.338 ± 0.309
2.356AlaGln: 2.356 ± 0.238
3.239AlaArg: 3.239 ± 0.297
4.348AlaSer: 4.348 ± 0.343
3.984AlaThr: 3.984 ± 0.47
4.504AlaVal: 4.504 ± 0.359
0.797AlaTrp: 0.797 ± 0.12
2.2AlaTyr: 2.2 ± 0.211
0.0AlaXaa: 0.0 ± 0.0
Cys
0.779CysAla: 0.779 ± 0.128
0.243CysCys: 0.243 ± 0.081
1.057CysAsp: 1.057 ± 0.157
1.247CysGlu: 1.247 ± 0.215
0.641CysPhe: 0.641 ± 0.109
0.866CysGly: 0.866 ± 0.158
0.312CysHis: 0.312 ± 0.079
0.658CysIle: 0.658 ± 0.131
0.883CysLys: 0.883 ± 0.15
1.23CysLeu: 1.23 ± 0.178
0.416CysMet: 0.416 ± 0.076
0.641CysAsn: 0.641 ± 0.098
0.797CysPro: 0.797 ± 0.15
0.346CysGln: 0.346 ± 0.085
0.953CysArg: 0.953 ± 0.154
0.797CysSer: 0.797 ± 0.142
0.641CysThr: 0.641 ± 0.129
1.022CysVal: 1.022 ± 0.157
0.139CysTrp: 0.139 ± 0.048
0.537CysTyr: 0.537 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
4.14AspAla: 4.14 ± 0.247
0.589AspCys: 0.589 ± 0.109
4.14AspAsp: 4.14 ± 0.366
4.313AspGlu: 4.313 ± 0.33
2.581AspPhe: 2.581 ± 0.272
4.469AspGly: 4.469 ± 0.401
1.057AspHis: 1.057 ± 0.167
5.11AspIle: 5.11 ± 0.352
2.789AspLys: 2.789 ± 0.244
4.296AspLeu: 4.296 ± 0.292
1.299AspMet: 1.299 ± 0.161
2.096AspAsn: 2.096 ± 0.204
2.2AspPro: 2.2 ± 0.226
1.472AspGln: 1.472 ± 0.19
2.841AspArg: 2.841 ± 0.244
2.91AspSer: 2.91 ± 0.278
3.845AspThr: 3.845 ± 0.31
4.867AspVal: 4.867 ± 0.383
0.589AspTrp: 0.589 ± 0.084
2.598AspTyr: 2.598 ± 0.259
0.0AspXaa: 0.0 ± 0.0
Glu
3.741GluAla: 3.741 ± 0.732
1.247GluCys: 1.247 ± 0.213
3.793GluAsp: 3.793 ± 0.259
5.404GluGlu: 5.404 ± 0.609
2.893GluPhe: 2.893 ± 0.225
2.546GluGly: 2.546 ± 0.306
1.524GluHis: 1.524 ± 0.193
4.105GluIle: 4.105 ± 0.277
4.85GluLys: 4.85 ± 0.478
5.266GluLeu: 5.266 ± 0.316
1.784GluMet: 1.784 ± 0.205
3.205GluAsn: 3.205 ± 0.239
1.905GluPro: 1.905 ± 0.218
1.836GluGln: 1.836 ± 0.163
3.845GluArg: 3.845 ± 0.462
3.343GluSer: 3.343 ± 0.332
3.101GluThr: 3.101 ± 0.237
3.516GluVal: 3.516 ± 0.249
0.779GluTrp: 0.779 ± 0.128
2.737GluTyr: 2.737 ± 0.269
0.0GluXaa: 0.0 ± 0.0
Phe
2.425PheAla: 2.425 ± 0.265
0.901PheCys: 0.901 ± 0.149
2.91PheAsp: 2.91 ± 0.276
2.546PheGlu: 2.546 ± 0.211
1.94PhePhe: 1.94 ± 0.246
2.979PheGly: 2.979 ± 0.249
0.987PheHis: 0.987 ± 0.143
2.685PheIle: 2.685 ± 0.247
2.737PheLys: 2.737 ± 0.241
3.291PheLeu: 3.291 ± 0.226
1.438PheMet: 1.438 ± 0.199
2.546PheAsn: 2.546 ± 0.252
1.472PhePro: 1.472 ± 0.174
1.022PheGln: 1.022 ± 0.143
1.905PheArg: 1.905 ± 0.2
3.291PheSer: 3.291 ± 0.288
2.546PheThr: 2.546 ± 0.286
3.638PheVal: 3.638 ± 0.325
0.381PheTrp: 0.381 ± 0.079
1.94PheTyr: 1.94 ± 0.239
0.0PheXaa: 0.0 ± 0.0
Gly
4.486GlyAla: 4.486 ± 0.453
0.728GlyCys: 0.728 ± 0.126
3.776GlyAsp: 3.776 ± 0.327
2.875GlyGlu: 2.875 ± 0.233
2.702GlyPhe: 2.702 ± 0.247
4.781GlyGly: 4.781 ± 0.552
1.559GlyHis: 1.559 ± 0.195
4.105GlyIle: 4.105 ± 0.379
3.447GlyLys: 3.447 ± 0.363
4.642GlyLeu: 4.642 ± 0.272
1.472GlyMet: 1.472 ± 0.158
4.469GlyAsn: 4.469 ± 1.092
1.957GlyPro: 1.957 ± 0.138
1.923GlyGln: 1.923 ± 0.185
3.482GlyArg: 3.482 ± 0.446
4.105GlySer: 4.105 ± 0.454
4.382GlyThr: 4.382 ± 0.704
4.365GlyVal: 4.365 ± 0.386
0.71GlyTrp: 0.71 ± 0.117
2.771GlyTyr: 2.771 ± 0.296
0.0GlyXaa: 0.0 ± 0.0
His
1.351HisAla: 1.351 ± 0.142
0.416HisCys: 0.416 ± 0.086
0.918HisAsp: 0.918 ± 0.144
1.057HisGlu: 1.057 ± 0.145
1.039HisPhe: 1.039 ± 0.136
1.576HisGly: 1.576 ± 0.206
0.398HisHis: 0.398 ± 0.082
1.403HisIle: 1.403 ± 0.166
1.109HisLys: 1.109 ± 0.17
1.42HisLeu: 1.42 ± 0.15
0.554HisMet: 0.554 ± 0.106
1.109HisAsn: 1.109 ± 0.137
1.022HisPro: 1.022 ± 0.169
0.572HisGln: 0.572 ± 0.112
1.264HisArg: 1.264 ± 0.173
0.849HisSer: 0.849 ± 0.119
1.507HisThr: 1.507 ± 0.164
1.853HisVal: 1.853 ± 0.216
0.26HisTrp: 0.26 ± 0.064
0.762HisTyr: 0.762 ± 0.105
0.0HisXaa: 0.0 ± 0.0
Ile
4.192IleAla: 4.192 ± 0.291
0.71IleCys: 0.71 ± 0.124
4.971IleAsp: 4.971 ± 0.378
4.434IleGlu: 4.434 ± 0.27
2.442IlePhe: 2.442 ± 0.238
4.192IleGly: 4.192 ± 0.376
1.576IleHis: 1.576 ± 0.188
3.741IleIle: 3.741 ± 0.307
5.041IleLys: 5.041 ± 0.299
4.434IleLeu: 4.434 ± 0.309
1.698IleMet: 1.698 ± 0.182
3.655IleAsn: 3.655 ± 0.271
3.43IlePro: 3.43 ± 0.287
2.564IleGln: 2.564 ± 0.198
3.464IleArg: 3.464 ± 0.254
3.724IleSer: 3.724 ± 0.307
3.88IleThr: 3.88 ± 0.297
4.608IleVal: 4.608 ± 0.393
0.502IleTrp: 0.502 ± 0.078
2.39IleTyr: 2.39 ± 0.209
0.0IleXaa: 0.0 ± 0.0
Lys
4.226LysAla: 4.226 ± 0.576
1.143LysCys: 1.143 ± 0.156
3.291LysAsp: 3.291 ± 0.272
3.811LysGlu: 3.811 ± 0.348
3.101LysPhe: 3.101 ± 0.251
3.014LysGly: 3.014 ± 0.306
1.524LysHis: 1.524 ± 0.174
4.573LysIle: 4.573 ± 0.328
6.288LysLys: 6.288 ± 0.707
5.785LysLeu: 5.785 ± 0.416
1.94LysMet: 1.94 ± 0.235
4.833LysAsn: 4.833 ± 0.924
2.685LysPro: 2.685 ± 0.264
1.992LysGln: 1.992 ± 0.28
3.828LysArg: 3.828 ± 0.352
4.088LysSer: 4.088 ± 0.36
4.902LysThr: 4.902 ± 0.469
3.949LysVal: 3.949 ± 0.29
0.728LysTrp: 0.728 ± 0.089
3.135LysTyr: 3.135 ± 0.266
0.0LysXaa: 0.0 ± 0.0
Leu
4.33LeuAla: 4.33 ± 0.272
1.091LeuCys: 1.091 ± 0.177
4.677LeuAsp: 4.677 ± 0.268
5.11LeuGlu: 5.11 ± 0.318
2.841LeuPhe: 2.841 ± 0.265
4.417LeuGly: 4.417 ± 0.321
1.542LeuHis: 1.542 ± 0.164
4.556LeuIle: 4.556 ± 0.359
5.422LeuLys: 5.422 ± 0.444
5.855LeuLeu: 5.855 ± 0.374
1.992LeuMet: 1.992 ± 0.206
4.937LeuAsn: 4.937 ± 0.759
3.222LeuPro: 3.222 ± 0.239
2.512LeuGln: 2.512 ± 0.219
4.971LeuArg: 4.971 ± 0.297
5.578LeuSer: 5.578 ± 0.286
5.266LeuThr: 5.266 ± 0.326
5.404LeuVal: 5.404 ± 0.32
0.728LeuTrp: 0.728 ± 0.124
3.274LeuTyr: 3.274 ± 0.276
0.0LeuXaa: 0.0 ± 0.0
Met
1.628MetAla: 1.628 ± 0.22
0.606MetCys: 0.606 ± 0.138
1.368MetAsp: 1.368 ± 0.157
1.576MetGlu: 1.576 ± 0.19
1.161MetPhe: 1.161 ± 0.177
1.403MetGly: 1.403 ± 0.188
0.658MetHis: 0.658 ± 0.108
1.646MetIle: 1.646 ± 0.2
2.39MetLys: 2.39 ± 0.278
1.715MetLeu: 1.715 ± 0.216
0.71MetMet: 0.71 ± 0.155
1.836MetAsn: 1.836 ± 0.268
1.161MetPro: 1.161 ± 0.115
0.589MetGln: 0.589 ± 0.108
1.213MetArg: 1.213 ± 0.164
2.304MetSer: 2.304 ± 0.226
1.594MetThr: 1.594 ± 0.142
1.42MetVal: 1.42 ± 0.163
0.364MetTrp: 0.364 ± 0.084
1.386MetTyr: 1.386 ± 0.166
0.0MetXaa: 0.0 ± 0.0
Asn
5.63AsnAla: 5.63 ± 1.436
0.572AsnCys: 0.572 ± 0.123
2.754AsnAsp: 2.754 ± 0.18
2.702AsnGlu: 2.702 ± 0.306
2.616AsnPhe: 2.616 ± 0.19
3.932AsnGly: 3.932 ± 0.425
1.039AsnHis: 1.039 ± 0.129
4.452AsnIle: 4.452 ± 0.297
3.655AsnLys: 3.655 ± 0.615
4.781AsnLeu: 4.781 ± 0.468
1.472AsnMet: 1.472 ± 0.184
4.538AsnAsn: 4.538 ± 0.769
2.321AsnPro: 2.321 ± 0.25
1.767AsnGln: 1.767 ± 0.218
2.442AsnArg: 2.442 ± 0.447
3.101AsnSer: 3.101 ± 0.309
4.954AsnThr: 4.954 ± 0.705
5.82AsnVal: 5.82 ± 1.305
0.312AsnTrp: 0.312 ± 0.083
2.079AsnTyr: 2.079 ± 0.188
0.0AsnXaa: 0.0 ± 0.0
Pro
2.616ProAla: 2.616 ± 0.38
0.485ProCys: 0.485 ± 0.117
2.148ProAsp: 2.148 ± 0.195
2.737ProGlu: 2.737 ± 0.294
1.749ProPhe: 1.749 ± 0.205
2.477ProGly: 2.477 ± 0.208
0.537ProHis: 0.537 ± 0.111
2.304ProIle: 2.304 ± 0.235
2.841ProLys: 2.841 ± 0.313
2.72ProLeu: 2.72 ± 0.239
1.109ProMet: 1.109 ± 0.152
2.183ProAsn: 2.183 ± 0.215
2.148ProPro: 2.148 ± 0.286
1.49ProGln: 1.49 ± 0.154
2.061ProArg: 2.061 ± 0.225
2.893ProSer: 2.893 ± 0.253
2.72ProThr: 2.72 ± 0.185
2.841ProVal: 2.841 ± 0.248
0.398ProTrp: 0.398 ± 0.099
1.334ProTyr: 1.334 ± 0.157
0.0ProXaa: 0.0 ± 0.0
Gln
1.801GlnAla: 1.801 ± 0.21
0.502GlnCys: 0.502 ± 0.11
1.594GlnAsp: 1.594 ± 0.18
1.94GlnGlu: 1.94 ± 0.209
1.49GlnPhe: 1.49 ± 0.154
1.264GlnGly: 1.264 ± 0.166
0.693GlnHis: 0.693 ± 0.097
2.183GlnIle: 2.183 ± 0.189
2.546GlnLys: 2.546 ± 0.268
2.893GlnLeu: 2.893 ± 0.214
0.918GlnMet: 0.918 ± 0.132
1.767GlnAsn: 1.767 ± 0.226
1.403GlnPro: 1.403 ± 0.141
1.559GlnGln: 1.559 ± 0.16
1.646GlnArg: 1.646 ± 0.207
1.888GlnSer: 1.888 ± 0.175
1.94GlnThr: 1.94 ± 0.174
2.061GlnVal: 2.061 ± 0.214
0.364GlnTrp: 0.364 ± 0.074
1.368GlnTyr: 1.368 ± 0.143
0.0GlnXaa: 0.0 ± 0.0
Arg
3.274ArgAla: 3.274 ± 0.301
0.779ArgCys: 0.779 ± 0.129
2.685ArgAsp: 2.685 ± 0.201
3.932ArgGlu: 3.932 ± 0.451
2.338ArgPhe: 2.338 ± 0.214
2.529ArgGly: 2.529 ± 0.243
1.039ArgHis: 1.039 ± 0.148
3.256ArgIle: 3.256 ± 0.244
3.222ArgLys: 3.222 ± 0.389
4.4ArgLeu: 4.4 ± 0.306
1.628ArgMet: 1.628 ± 0.179
2.841ArgAsn: 2.841 ± 0.33
2.061ArgPro: 2.061 ± 0.281
2.061ArgGln: 2.061 ± 0.238
3.724ArgArg: 3.724 ± 0.536
3.066ArgSer: 3.066 ± 0.234
3.222ArgThr: 3.222 ± 0.301
4.036ArgVal: 4.036 ± 0.382
0.641ArgTrp: 0.641 ± 0.112
1.94ArgTyr: 1.94 ± 0.195
0.0ArgXaa: 0.0 ± 0.0
Ser
3.568SerAla: 3.568 ± 0.268
0.572SerCys: 0.572 ± 0.095
3.811SerAsp: 3.811 ± 0.282
3.655SerGlu: 3.655 ± 0.3
2.598SerPhe: 2.598 ± 0.208
5.37SerGly: 5.37 ± 0.516
1.143SerHis: 1.143 ± 0.153
4.625SerIle: 4.625 ± 0.316
3.828SerLys: 3.828 ± 0.241
4.798SerLeu: 4.798 ± 0.335
1.282SerMet: 1.282 ± 0.142
5.162SerAsn: 5.162 ± 0.9
1.853SerPro: 1.853 ± 0.194
2.269SerGln: 2.269 ± 0.196
2.858SerArg: 2.858 ± 0.236
3.897SerSer: 3.897 ± 0.397
4.521SerThr: 4.521 ± 0.373
3.915SerVal: 3.915 ± 0.349
0.52SerTrp: 0.52 ± 0.098
2.286SerTyr: 2.286 ± 0.192
0.0SerXaa: 0.0 ± 0.0
Thr
4.313ThrAla: 4.313 ± 0.366
0.624ThrCys: 0.624 ± 0.122
3.482ThrAsp: 3.482 ± 0.36
3.516ThrGlu: 3.516 ± 0.303
3.222ThrPhe: 3.222 ± 0.232
5.075ThrGly: 5.075 ± 0.76
1.161ThrHis: 1.161 ± 0.134
4.226ThrIle: 4.226 ± 0.441
4.504ThrLys: 4.504 ± 0.37
5.162ThrLeu: 5.162 ± 0.346
1.576ThrMet: 1.576 ± 0.157
4.278ThrAsn: 4.278 ± 0.583
3.083ThrPro: 3.083 ± 0.219
2.131ThrGln: 2.131 ± 0.198
3.274ThrArg: 3.274 ± 0.252
4.833ThrSer: 4.833 ± 0.535
5.37ThrThr: 5.37 ± 0.579
3.811ThrVal: 3.811 ± 0.345
0.676ThrTrp: 0.676 ± 0.15
2.148ThrTyr: 2.148 ± 0.262
0.0ThrXaa: 0.0 ± 0.0
Val
4.365ValAla: 4.365 ± 0.339
1.334ValCys: 1.334 ± 0.161
4.209ValAsp: 4.209 ± 0.341
4.209ValGlu: 4.209 ± 0.311
3.308ValPhe: 3.308 ± 0.303
4.625ValGly: 4.625 ± 0.589
1.594ValHis: 1.594 ± 0.2
4.4ValIle: 4.4 ± 0.288
4.538ValLys: 4.538 ± 0.322
5.491ValLeu: 5.491 ± 0.304
1.698ValMet: 1.698 ± 0.211
3.672ValAsn: 3.672 ± 0.333
2.875ValPro: 2.875 ± 0.251
2.2ValGln: 2.2 ± 0.199
3.274ValArg: 3.274 ± 0.233
4.729ValSer: 4.729 ± 0.498
4.815ValThr: 4.815 ± 0.627
4.66ValVal: 4.66 ± 0.396
0.797ValTrp: 0.797 ± 0.114
3.326ValTyr: 3.326 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
0.433TrpAla: 0.433 ± 0.089
0.243TrpCys: 0.243 ± 0.077
0.589TrpAsp: 0.589 ± 0.092
0.537TrpGlu: 0.537 ± 0.099
0.45TrpPhe: 0.45 ± 0.081
0.45TrpGly: 0.45 ± 0.081
0.121TrpHis: 0.121 ± 0.046
0.71TrpIle: 0.71 ± 0.116
0.953TrpLys: 0.953 ± 0.161
0.797TrpLeu: 0.797 ± 0.135
0.346TrpMet: 0.346 ± 0.087
0.779TrpAsn: 0.779 ± 0.122
0.398TrpPro: 0.398 ± 0.089
0.277TrpGln: 0.277 ± 0.068
0.381TrpArg: 0.381 ± 0.102
0.71TrpSer: 0.71 ± 0.093
0.589TrpThr: 0.589 ± 0.099
0.797TrpVal: 0.797 ± 0.124
0.104TrpTrp: 0.104 ± 0.034
0.416TrpTyr: 0.416 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.564TyrAla: 2.564 ± 0.22
0.502TyrCys: 0.502 ± 0.095
3.118TyrAsp: 3.118 ± 0.265
2.512TyrGlu: 2.512 ± 0.257
1.68TyrPhe: 1.68 ± 0.176
2.91TyrGly: 2.91 ± 0.297
0.779TyrHis: 0.779 ± 0.124
2.668TyrIle: 2.668 ± 0.222
2.702TyrLys: 2.702 ± 0.276
3.205TyrLeu: 3.205 ± 0.22
1.42TyrMet: 1.42 ± 0.182
2.131TyrAsn: 2.131 ± 0.232
1.455TyrPro: 1.455 ± 0.174
0.883TyrGln: 0.883 ± 0.12
2.009TyrArg: 2.009 ± 0.205
2.079TyrSer: 2.079 ± 0.199
2.685TyrThr: 2.685 ± 0.282
3.031TyrVal: 3.031 ± 0.232
0.294TyrTrp: 0.294 ± 0.087
1.905TyrTyr: 1.905 ± 0.234
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 203 proteins (57732 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski