Amino acid dipepetide frequency for Yellowstone lake phycodnavirus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.908AlaAla: 9.908 ± 1.386
1.109AlaCys: 1.109 ± 0.141
3.065AlaAsp: 3.065 ± 0.256
3.704AlaGlu: 3.704 ± 0.323
3.365AlaPhe: 3.365 ± 0.262
5.659AlaGly: 5.659 ± 0.4
1.072AlaHis: 1.072 ± 0.12
4.118AlaIle: 4.118 ± 0.263
5.17AlaLys: 5.17 ± 0.54
6.825AlaLeu: 6.825 ± 0.416
2.2AlaMet: 2.2 ± 0.208
4.738AlaAsn: 4.738 ± 0.488
5.49AlaPro: 5.49 ± 0.43
3.271AlaGln: 3.271 ± 0.271
5.866AlaArg: 5.866 ± 0.494
6.581AlaSer: 6.581 ± 0.641
5.659AlaThr: 5.659 ± 0.605
5.509AlaVal: 5.509 ± 0.402
0.865AlaTrp: 0.865 ± 0.14
2.162AlaTyr: 2.162 ± 0.186
0.0AlaXaa: 0.0 ± 0.0
Cys
1.467CysAla: 1.467 ± 0.215
0.508CysCys: 0.508 ± 0.131
1.015CysAsp: 1.015 ± 0.149
0.79CysGlu: 0.79 ± 0.147
0.602CysPhe: 0.602 ± 0.11
1.147CysGly: 1.147 ± 0.196
0.226CysHis: 0.226 ± 0.061
0.714CysIle: 0.714 ± 0.116
0.79CysLys: 0.79 ± 0.175
0.996CysLeu: 0.996 ± 0.123
0.395CysMet: 0.395 ± 0.095
0.677CysAsn: 0.677 ± 0.106
0.959CysPro: 0.959 ± 0.156
0.338CysGln: 0.338 ± 0.081
1.109CysArg: 1.109 ± 0.189
1.072CysSer: 1.072 ± 0.166
0.714CysThr: 0.714 ± 0.117
1.072CysVal: 1.072 ± 0.169
0.226CysTrp: 0.226 ± 0.069
0.602CysTyr: 0.602 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
3.723AspAla: 3.723 ± 0.334
0.602AspCys: 0.602 ± 0.124
2.764AspAsp: 2.764 ± 0.339
3.873AspGlu: 3.873 ± 0.326
2.181AspPhe: 2.181 ± 0.223
3.76AspGly: 3.76 ± 0.323
0.752AspHis: 0.752 ± 0.12
2.764AspIle: 2.764 ± 0.258
2.275AspLys: 2.275 ± 0.185
4.23AspLeu: 4.23 ± 0.37
1.391AspMet: 1.391 ± 0.199
1.297AspAsn: 1.297 ± 0.189
3.403AspPro: 3.403 ± 0.284
1.617AspGln: 1.617 ± 0.17
2.219AspArg: 2.219 ± 0.207
2.237AspSer: 2.237 ± 0.25
2.162AspThr: 2.162 ± 0.199
3.779AspVal: 3.779 ± 0.298
0.696AspTrp: 0.696 ± 0.123
1.429AspTyr: 1.429 ± 0.167
0.0AspXaa: 0.0 ± 0.0
Glu
3.685GluAla: 3.685 ± 0.368
1.015GluCys: 1.015 ± 0.153
2.745GluAsp: 2.745 ± 0.299
3.478GluGlu: 3.478 ± 0.31
2.369GluPhe: 2.369 ± 0.251
2.726GluGly: 2.726 ± 0.29
1.053GluHis: 1.053 ± 0.182
3.572GluIle: 3.572 ± 0.373
3.177GluLys: 3.177 ± 0.316
4.343GluLeu: 4.343 ± 0.286
1.485GluMet: 1.485 ± 0.199
2.35GluAsn: 2.35 ± 0.247
2.576GluPro: 2.576 ± 0.274
1.542GluGln: 1.542 ± 0.188
2.914GluArg: 2.914 ± 0.289
3.065GluSer: 3.065 ± 0.241
2.801GluThr: 2.801 ± 0.292
3.422GluVal: 3.422 ± 0.311
0.846GluTrp: 0.846 ± 0.141
1.598GluTyr: 1.598 ± 0.175
0.0GluXaa: 0.0 ± 0.0
Phe
3.253PheAla: 3.253 ± 0.303
0.771PheCys: 0.771 ± 0.129
2.237PheAsp: 2.237 ± 0.223
1.899PheGlu: 1.899 ± 0.189
1.824PhePhe: 1.824 ± 0.21
2.895PheGly: 2.895 ± 0.318
0.846PheHis: 0.846 ± 0.119
2.106PheIle: 2.106 ± 0.212
2.501PheLys: 2.501 ± 0.263
3.441PheLeu: 3.441 ± 0.266
1.655PheMet: 1.655 ± 0.198
2.707PheAsn: 2.707 ± 0.231
1.767PhePro: 1.767 ± 0.219
1.354PheGln: 1.354 ± 0.152
1.899PheArg: 1.899 ± 0.161
2.839PheSer: 2.839 ± 0.236
2.444PheThr: 2.444 ± 0.264
3.328PheVal: 3.328 ± 0.292
0.639PheTrp: 0.639 ± 0.111
1.316PheTyr: 1.316 ± 0.163
0.0PheXaa: 0.0 ± 0.0
Gly
5.866GlyAla: 5.866 ± 0.463
0.921GlyCys: 0.921 ± 0.173
2.933GlyAsp: 2.933 ± 0.235
2.839GlyGlu: 2.839 ± 0.249
3.046GlyPhe: 3.046 ± 0.324
5.979GlyGly: 5.979 ± 0.593
1.072GlyHis: 1.072 ± 0.159
3.29GlyIle: 3.29 ± 0.36
3.234GlyLys: 3.234 ± 0.243
6.43GlyLeu: 6.43 ± 0.431
1.861GlyMet: 1.861 ± 0.196
4.287GlyAsn: 4.287 ± 0.672
3.873GlyPro: 3.873 ± 0.365
2.501GlyGln: 2.501 ± 0.234
4.23GlyArg: 4.23 ± 0.329
5.49GlySer: 5.49 ± 0.433
6.111GlyThr: 6.111 ± 0.779
4.644GlyVal: 4.644 ± 0.335
0.808GlyTrp: 0.808 ± 0.138
2.237GlyTyr: 2.237 ± 0.215
0.0GlyXaa: 0.0 ± 0.0
His
1.072HisAla: 1.072 ± 0.151
0.338HisCys: 0.338 ± 0.075
0.827HisAsp: 0.827 ± 0.131
0.921HisGlu: 0.921 ± 0.139
0.79HisPhe: 0.79 ± 0.116
1.053HisGly: 1.053 ± 0.188
0.282HisHis: 0.282 ± 0.079
0.94HisIle: 0.94 ± 0.151
1.203HisLys: 1.203 ± 0.182
1.636HisLeu: 1.636 ± 0.217
0.508HisMet: 0.508 ± 0.1
0.62HisAsn: 0.62 ± 0.115
0.771HisPro: 0.771 ± 0.117
0.526HisGln: 0.526 ± 0.106
0.902HisArg: 0.902 ± 0.15
0.752HisSer: 0.752 ± 0.128
0.884HisThr: 0.884 ± 0.13
1.692HisVal: 1.692 ± 0.221
0.47HisTrp: 0.47 ± 0.094
0.696HisTyr: 0.696 ± 0.115
0.0HisXaa: 0.0 ± 0.0
Ile
3.441IleAla: 3.441 ± 0.252
0.677IleCys: 0.677 ± 0.113
3.083IleAsp: 3.083 ± 0.291
3.441IleGlu: 3.441 ± 0.312
2.463IlePhe: 2.463 ± 0.28
3.591IleGly: 3.591 ± 0.443
1.147IleHis: 1.147 ± 0.184
2.501IleIle: 2.501 ± 0.22
3.535IleLys: 3.535 ± 0.348
4.719IleLeu: 4.719 ± 0.367
0.94IleMet: 0.94 ± 0.12
2.632IleAsn: 2.632 ± 0.233
2.632IlePro: 2.632 ± 0.207
2.519IleGln: 2.519 ± 0.218
2.801IleArg: 2.801 ± 0.243
3.723IleSer: 3.723 ± 0.314
3.196IleThr: 3.196 ± 0.283
3.648IleVal: 3.648 ± 0.277
0.902IleTrp: 0.902 ± 0.124
1.73IleTyr: 1.73 ± 0.188
0.0IleXaa: 0.0 ± 0.0
Lys
5.039LysAla: 5.039 ± 0.534
1.335LysCys: 1.335 ± 0.212
2.801LysAsp: 2.801 ± 0.286
3.403LysGlu: 3.403 ± 0.324
2.388LysPhe: 2.388 ± 0.232
3.836LysGly: 3.836 ± 0.33
0.996LysHis: 0.996 ± 0.132
3.742LysIle: 3.742 ± 0.316
4.569LysLys: 4.569 ± 0.403
5.001LysLeu: 5.001 ± 0.411
1.824LysMet: 1.824 ± 0.208
3.723LysAsn: 3.723 ± 0.36
2.538LysPro: 2.538 ± 0.286
1.749LysGln: 1.749 ± 0.223
3.29LysArg: 3.29 ± 0.375
3.441LysSer: 3.441 ± 0.229
4.155LysThr: 4.155 ± 0.337
3.986LysVal: 3.986 ± 0.374
0.545LysTrp: 0.545 ± 0.113
2.313LysTyr: 2.313 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
7.257LeuAla: 7.257 ± 0.39
1.072LeuCys: 1.072 ± 0.17
4.475LeuAsp: 4.475 ± 0.371
3.948LeuGlu: 3.948 ± 0.314
3.497LeuPhe: 3.497 ± 0.265
5.772LeuGly: 5.772 ± 0.345
1.391LeuHis: 1.391 ± 0.184
3.911LeuIle: 3.911 ± 0.273
5.791LeuLys: 5.791 ± 0.434
6.543LeuLeu: 6.543 ± 0.404
2.219LeuMet: 2.219 ± 0.2
4.569LeuAsn: 4.569 ± 0.397
3.817LeuPro: 3.817 ± 0.282
2.933LeuGln: 2.933 ± 0.238
4.23LeuArg: 4.23 ± 0.355
5.133LeuSer: 5.133 ± 0.294
5.058LeuThr: 5.058 ± 0.413
5.697LeuVal: 5.697 ± 0.393
0.921LeuTrp: 0.921 ± 0.13
2.388LeuTyr: 2.388 ± 0.254
0.0LeuXaa: 0.0 ± 0.0
Met
2.482MetAla: 2.482 ± 0.241
0.714MetCys: 0.714 ± 0.132
1.467MetAsp: 1.467 ± 0.19
1.673MetGlu: 1.673 ± 0.226
1.09MetPhe: 1.09 ± 0.147
2.237MetGly: 2.237 ± 0.233
0.338MetHis: 0.338 ± 0.083
1.467MetIle: 1.467 ± 0.16
1.955MetLys: 1.955 ± 0.187
1.316MetLeu: 1.316 ± 0.171
0.808MetMet: 0.808 ± 0.141
2.181MetAsn: 2.181 ± 0.254
0.959MetPro: 0.959 ± 0.115
0.677MetGln: 0.677 ± 0.106
1.937MetArg: 1.937 ± 0.211
1.673MetSer: 1.673 ± 0.179
1.692MetThr: 1.692 ± 0.143
1.467MetVal: 1.467 ± 0.177
0.376MetTrp: 0.376 ± 0.086
1.09MetTyr: 1.09 ± 0.163
0.0MetXaa: 0.0 ± 0.0
Asn
5.358AsnAla: 5.358 ± 0.674
0.526AsnCys: 0.526 ± 0.101
1.429AsnAsp: 1.429 ± 0.164
2.613AsnGlu: 2.613 ± 0.294
2.313AsnPhe: 2.313 ± 0.193
3.478AsnGly: 3.478 ± 0.274
0.545AsnHis: 0.545 ± 0.101
3.215AsnIle: 3.215 ± 0.429
2.651AsnLys: 2.651 ± 0.325
5.358AsnLeu: 5.358 ± 0.503
1.767AsnMet: 1.767 ± 0.208
2.407AsnAsn: 2.407 ± 0.339
2.745AsnPro: 2.745 ± 0.216
1.899AsnGln: 1.899 ± 0.249
2.783AsnArg: 2.783 ± 0.291
3.309AsnSer: 3.309 ± 0.338
3.384AsnThr: 3.384 ± 0.543
6.017AsnVal: 6.017 ± 1.059
0.808AsnTrp: 0.808 ± 0.132
1.824AsnTyr: 1.824 ± 0.226
0.0AsnXaa: 0.0 ± 0.0
Pro
4.907ProAla: 4.907 ± 0.452
0.996ProCys: 0.996 ± 0.166
3.065ProAsp: 3.065 ± 0.243
3.102ProGlu: 3.102 ± 0.304
1.805ProPhe: 1.805 ± 0.189
3.836ProGly: 3.836 ± 0.319
0.771ProHis: 0.771 ± 0.13
2.369ProIle: 2.369 ± 0.171
3.779ProLys: 3.779 ± 0.459
3.177ProLeu: 3.177 ± 0.262
1.485ProMet: 1.485 ± 0.231
1.974ProAsn: 1.974 ± 0.217
3.742ProPro: 3.742 ± 0.44
1.711ProGln: 1.711 ± 0.158
2.67ProArg: 2.67 ± 0.253
3.93ProSer: 3.93 ± 0.297
3.309ProThr: 3.309 ± 0.292
4.569ProVal: 4.569 ± 0.325
0.865ProTrp: 0.865 ± 0.132
1.504ProTyr: 1.504 ± 0.156
0.0ProXaa: 0.0 ± 0.0
Gln
2.707GlnAla: 2.707 ± 0.281
0.432GlnCys: 0.432 ± 0.101
1.692GlnAsp: 1.692 ± 0.21
1.429GlnGlu: 1.429 ± 0.192
1.636GlnPhe: 1.636 ± 0.199
3.121GlnGly: 3.121 ± 0.272
0.451GlnHis: 0.451 ± 0.086
1.918GlnIle: 1.918 ± 0.207
2.576GlnLys: 2.576 ± 0.286
2.388GlnLeu: 2.388 ± 0.269
0.921GlnMet: 0.921 ± 0.157
1.974GlnAsn: 1.974 ± 0.203
2.068GlnPro: 2.068 ± 0.204
1.241GlnGln: 1.241 ± 0.166
2.125GlnArg: 2.125 ± 0.258
1.579GlnSer: 1.579 ± 0.184
2.219GlnThr: 2.219 ± 0.24
2.595GlnVal: 2.595 ± 0.225
0.47GlnTrp: 0.47 ± 0.094
1.41GlnTyr: 1.41 ± 0.173
0.0GlnXaa: 0.0 ± 0.0
Arg
5.434ArgAla: 5.434 ± 0.43
0.752ArgCys: 0.752 ± 0.172
2.783ArgAsp: 2.783 ± 0.287
2.82ArgGlu: 2.82 ± 0.303
2.219ArgPhe: 2.219 ± 0.215
3.309ArgGly: 3.309 ± 0.32
1.523ArgHis: 1.523 ± 0.203
3.309ArgIle: 3.309 ± 0.249
3.911ArgLys: 3.911 ± 0.385
3.948ArgLeu: 3.948 ± 0.308
1.41ArgMet: 1.41 ± 0.174
2.971ArgAsn: 2.971 ± 0.326
3.159ArgPro: 3.159 ± 0.311
1.88ArgGln: 1.88 ± 0.262
4.475ArgArg: 4.475 ± 0.687
3.441ArgSer: 3.441 ± 0.335
3.384ArgThr: 3.384 ± 0.265
4.118ArgVal: 4.118 ± 0.352
0.714ArgTrp: 0.714 ± 0.117
1.655ArgTyr: 1.655 ± 0.153
0.0ArgXaa: 0.0 ± 0.0
Ser
5.17SerAla: 5.17 ± 0.404
0.865SerCys: 0.865 ± 0.158
3.14SerAsp: 3.14 ± 0.271
2.331SerGlu: 2.331 ± 0.236
2.801SerPhe: 2.801 ± 0.273
5.866SerGly: 5.866 ± 0.619
1.09SerHis: 1.09 ± 0.159
3.666SerIle: 3.666 ± 0.338
3.648SerLys: 3.648 ± 0.244
5.02SerLeu: 5.02 ± 0.34
1.749SerMet: 1.749 ± 0.178
5.377SerAsn: 5.377 ± 0.967
3.441SerPro: 3.441 ± 0.356
2.425SerGln: 2.425 ± 0.217
3.234SerArg: 3.234 ± 0.341
4.55SerSer: 4.55 ± 0.397
4.061SerThr: 4.061 ± 0.444
4.343SerVal: 4.343 ± 0.365
0.696SerTrp: 0.696 ± 0.121
1.673SerTyr: 1.673 ± 0.187
0.0SerXaa: 0.0 ± 0.0
Thr
5.979ThrAla: 5.979 ± 0.626
0.996ThrCys: 0.996 ± 0.15
2.519ThrAsp: 2.519 ± 0.24
2.689ThrGlu: 2.689 ± 0.272
2.331ThrPhe: 2.331 ± 0.231
5.64ThrGly: 5.64 ± 0.557
1.034ThrHis: 1.034 ± 0.136
3.403ThrIle: 3.403 ± 0.295
3.666ThrLys: 3.666 ± 0.305
5.904ThrLeu: 5.904 ± 0.567
1.598ThrMet: 1.598 ± 0.233
4.08ThrAsn: 4.08 ± 0.604
3.704ThrPro: 3.704 ± 0.282
2.162ThrGln: 2.162 ± 0.246
3.61ThrArg: 3.61 ± 0.283
4.888ThrSer: 4.888 ± 0.734
5.885ThrThr: 5.885 ± 1.334
3.008ThrVal: 3.008 ± 0.354
0.94ThrTrp: 0.94 ± 0.129
2.143ThrTyr: 2.143 ± 0.272
0.0ThrXaa: 0.0 ± 0.0
Val
5.885ValAla: 5.885 ± 0.399
1.09ValCys: 1.09 ± 0.149
2.858ValAsp: 2.858 ± 0.295
3.459ValGlu: 3.459 ± 0.288
2.914ValPhe: 2.914 ± 0.299
4.738ValGly: 4.738 ± 0.467
1.316ValHis: 1.316 ± 0.169
3.553ValIle: 3.553 ± 0.297
3.535ValLys: 3.535 ± 0.396
5.659ValLeu: 5.659 ± 0.34
2.087ValMet: 2.087 ± 0.178
3.497ValAsn: 3.497 ± 0.28
4.118ValPro: 4.118 ± 0.277
3.008ValGln: 3.008 ± 0.263
4.437ValArg: 4.437 ± 0.362
4.475ValSer: 4.475 ± 0.461
6.035ValThr: 6.035 ± 0.515
4.136ValVal: 4.136 ± 0.308
0.902ValTrp: 0.902 ± 0.121
2.989ValTyr: 2.989 ± 0.277
0.0ValXaa: 0.0 ± 0.0
Trp
1.034TrpAla: 1.034 ± 0.173
0.32TrpCys: 0.32 ± 0.072
0.639TrpAsp: 0.639 ± 0.113
0.639TrpGlu: 0.639 ± 0.11
0.696TrpPhe: 0.696 ± 0.118
0.696TrpGly: 0.696 ± 0.124
0.338TrpHis: 0.338 ± 0.103
0.884TrpIle: 0.884 ± 0.145
0.752TrpLys: 0.752 ± 0.117
0.921TrpLeu: 0.921 ± 0.173
0.301TrpMet: 0.301 ± 0.073
0.658TrpAsn: 0.658 ± 0.117
0.583TrpPro: 0.583 ± 0.115
0.451TrpGln: 0.451 ± 0.09
0.752TrpArg: 0.752 ± 0.124
1.053TrpSer: 1.053 ± 0.162
0.677TrpThr: 0.677 ± 0.12
1.09TrpVal: 1.09 ± 0.142
0.338TrpTrp: 0.338 ± 0.07
0.414TrpTyr: 0.414 ± 0.089
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.67TyrAla: 2.67 ± 0.258
0.376TyrCys: 0.376 ± 0.088
1.617TyrAsp: 1.617 ± 0.171
1.711TyrGlu: 1.711 ± 0.246
1.391TyrPhe: 1.391 ± 0.154
2.313TyrGly: 2.313 ± 0.227
0.639TyrHis: 0.639 ± 0.111
1.899TyrIle: 1.899 ± 0.179
1.955TyrLys: 1.955 ± 0.207
2.764TyrLeu: 2.764 ± 0.279
0.978TyrMet: 0.978 ± 0.136
1.711TyrAsn: 1.711 ± 0.218
1.222TyrPro: 1.222 ± 0.144
1.147TyrGln: 1.147 ± 0.171
1.73TyrArg: 1.73 ± 0.197
1.918TyrSer: 1.918 ± 0.184
2.35TyrThr: 2.35 ± 0.254
2.576TyrVal: 2.576 ± 0.235
0.207TyrTrp: 0.207 ± 0.063
1.26TyrTyr: 1.26 ± 0.185
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 236 proteins (53188 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski