Amino acid dipepetide frequency for Phaeocystis globosa virus 12T

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.227AlaAla: 3.227 ± 0.29
0.509AlaCys: 0.509 ± 0.064
2.805AlaAsp: 2.805 ± 0.184
2.747AlaGlu: 2.747 ± 0.189
1.621AlaPhe: 1.621 ± 0.104
2.791AlaGly: 2.791 ± 0.272
0.974AlaHis: 0.974 ± 0.192
3.597AlaIle: 3.597 ± 0.169
3.648AlaLys: 3.648 ± 0.206
3.677AlaLeu: 3.677 ± 0.198
1.243AlaMet: 1.243 ± 0.101
3.321AlaAsn: 3.321 ± 0.222
1.395AlaPro: 1.395 ± 0.108
1.279AlaGln: 1.279 ± 0.109
1.57AlaArg: 1.57 ± 0.12
3.401AlaSer: 3.401 ± 0.302
3.197AlaThr: 3.197 ± 0.226
2.725AlaVal: 2.725 ± 0.171
0.378AlaTrp: 0.378 ± 0.059
1.759AlaTyr: 1.759 ± 0.109
0.0AlaXaa: 0.0 ± 0.0
Cys
0.56CysAla: 0.56 ± 0.076
0.494CysCys: 0.494 ± 0.081
1.134CysAsp: 1.134 ± 0.118
0.836CysGlu: 0.836 ± 0.095
0.632CysPhe: 0.632 ± 0.071
1.076CysGly: 1.076 ± 0.109
0.254CysHis: 0.254 ± 0.045
1.09CysIle: 1.09 ± 0.078
1.7CysLys: 1.7 ± 0.146
1.039CysLeu: 1.039 ± 0.098
0.349CysMet: 0.349 ± 0.048
1.148CysAsn: 1.148 ± 0.105
0.56CysPro: 0.56 ± 0.066
0.283CysGln: 0.283 ± 0.051
0.429CysArg: 0.429 ± 0.062
1.017CysSer: 1.017 ± 0.102
0.581CysThr: 0.581 ± 0.073
0.887CysVal: 0.887 ± 0.081
0.153CysTrp: 0.153 ± 0.03
0.727CysTyr: 0.727 ± 0.08
0.0CysXaa: 0.0 ± 0.0
Asp
3.19AspAla: 3.19 ± 0.201
0.937AspCys: 0.937 ± 0.089
5.181AspAsp: 5.181 ± 0.353
5.058AspGlu: 5.058 ± 0.335
2.885AspPhe: 2.885 ± 0.18
3.735AspGly: 3.735 ± 0.345
0.683AspHis: 0.683 ± 0.08
6.584AspIle: 6.584 ± 0.361
6.119AspLys: 6.119 ± 0.288
5.051AspLeu: 5.051 ± 0.217
1.584AspMet: 1.584 ± 0.135
5.639AspAsn: 5.639 ± 0.275
1.519AspPro: 1.519 ± 0.112
1.141AspGln: 1.141 ± 0.081
1.555AspArg: 1.555 ± 0.111
3.561AspSer: 3.561 ± 0.15
4.244AspThr: 4.244 ± 0.219
3.772AspVal: 3.772 ± 0.283
0.465AspTrp: 0.465 ± 0.067
3.328AspTyr: 3.328 ± 0.168
0.0AspXaa: 0.0 ± 0.0
Glu
2.674GluAla: 2.674 ± 0.16
0.952GluCys: 0.952 ± 0.108
3.851GluAsp: 3.851 ± 0.237
5.138GluGlu: 5.138 ± 0.652
3.052GluPhe: 3.052 ± 0.151
2.107GluGly: 2.107 ± 0.173
1.061GluHis: 1.061 ± 0.108
4.803GluIle: 4.803 ± 0.211
5.864GluLys: 5.864 ± 0.368
5.494GluLeu: 5.494 ± 0.227
1.628GluMet: 1.628 ± 0.136
5.094GluAsn: 5.094 ± 0.223
2.318GluPro: 2.318 ± 0.197
1.926GluGln: 1.926 ± 0.16
2.216GluArg: 2.216 ± 0.144
2.834GluSer: 2.834 ± 0.157
4.222GluThr: 4.222 ± 0.204
2.856GluVal: 2.856 ± 0.178
0.494GluTrp: 0.494 ± 0.061
3.016GluTyr: 3.016 ± 0.194
0.0GluXaa: 0.0 ± 0.0
Phe
1.809PheAla: 1.809 ± 0.158
0.567PheCys: 0.567 ± 0.066
2.929PheAsp: 2.929 ± 0.205
2.078PheGlu: 2.078 ± 0.126
1.751PhePhe: 1.751 ± 0.131
2.195PheGly: 2.195 ± 0.177
0.727PheHis: 0.727 ± 0.076
3.917PheIle: 3.917 ± 0.199
4.04PheLys: 4.04 ± 0.195
3.495PheLeu: 3.495 ± 0.169
1.25PheMet: 1.25 ± 0.103
4.055PheAsn: 4.055 ± 0.266
1.076PhePro: 1.076 ± 0.099
1.119PheGln: 1.119 ± 0.106
1.09PheArg: 1.09 ± 0.09
3.009PheSer: 3.009 ± 0.186
2.761PheThr: 2.761 ± 0.181
2.66PheVal: 2.66 ± 0.15
0.356PheTrp: 0.356 ± 0.054
2.187PheTyr: 2.187 ± 0.141
0.0PheXaa: 0.0 ± 0.0
Gly
2.398GlyAla: 2.398 ± 0.236
0.785GlyCys: 0.785 ± 0.082
3.604GlyAsp: 3.604 ± 0.413
2.791GlyGlu: 2.791 ± 0.148
2.245GlyPhe: 2.245 ± 0.188
4.469GlyGly: 4.469 ± 0.9
0.85GlyHis: 0.85 ± 0.123
4.186GlyIle: 4.186 ± 0.284
4.273GlyLys: 4.273 ± 0.212
4.04GlyLeu: 4.04 ± 0.247
1.003GlyMet: 1.003 ± 0.08
3.982GlyAsn: 3.982 ± 0.267
1.068GlyPro: 1.068 ± 0.094
1.301GlyGln: 1.301 ± 0.156
1.439GlyArg: 1.439 ± 0.151
3.837GlySer: 3.837 ± 0.287
2.979GlyThr: 2.979 ± 0.327
3.067GlyVal: 3.067 ± 0.167
0.342GlyTrp: 0.342 ± 0.049
2.057GlyTyr: 2.057 ± 0.151
0.0GlyXaa: 0.0 ± 0.0
His
0.77HisAla: 0.77 ± 0.079
0.327HisCys: 0.327 ± 0.053
0.937HisAsp: 0.937 ± 0.08
0.836HisGlu: 0.836 ± 0.084
0.763HisPhe: 0.763 ± 0.072
0.879HisGly: 0.879 ± 0.091
0.538HisHis: 0.538 ± 0.075
1.722HisIle: 1.722 ± 0.128
1.759HisLys: 1.759 ± 0.144
1.693HisLeu: 1.693 ± 0.212
0.581HisMet: 0.581 ± 0.068
1.759HisAsn: 1.759 ± 0.141
0.669HisPro: 0.669 ± 0.072
0.567HisGln: 0.567 ± 0.075
0.61HisArg: 0.61 ± 0.06
1.308HisSer: 1.308 ± 0.118
1.134HisThr: 1.134 ± 0.123
0.923HisVal: 0.923 ± 0.122
0.189HisTrp: 0.189 ± 0.036
0.894HisTyr: 0.894 ± 0.101
0.0HisXaa: 0.0 ± 0.0
Ile
3.285IleAla: 3.285 ± 0.176
1.308IleCys: 1.308 ± 0.125
6.598IleAsp: 6.598 ± 0.261
5.298IleGlu: 5.298 ± 0.228
3.633IlePhe: 3.633 ± 0.203
3.808IleGly: 3.808 ± 0.396
1.809IleHis: 1.809 ± 0.129
7.267IleIle: 7.267 ± 0.289
7.906IleLys: 7.906 ± 0.359
6.562IleLeu: 6.562 ± 0.276
2.275IleMet: 2.275 ± 0.138
7.812IleAsn: 7.812 ± 0.287
2.907IlePro: 2.907 ± 0.156
2.674IleGln: 2.674 ± 0.141
2.107IleArg: 2.107 ± 0.145
6.112IleSer: 6.112 ± 0.33
5.356IleThr: 5.356 ± 0.254
4.389IleVal: 4.389 ± 0.178
0.589IleTrp: 0.589 ± 0.072
4.309IleTyr: 4.309 ± 0.183
0.0IleXaa: 0.0 ± 0.0
Lys
3.822LysAla: 3.822 ± 0.226
1.468LysCys: 1.468 ± 0.138
5.334LysAsp: 5.334 ± 0.29
5.443LysGlu: 5.443 ± 0.322
3.735LysPhe: 3.735 ± 0.196
3.11LysGly: 3.11 ± 0.171
2.187LysHis: 2.187 ± 0.145
7.391LysIle: 7.391 ± 0.314
10.319LysLys: 10.319 ± 0.527
8.35LysLeu: 8.35 ± 0.345
2.238LysMet: 2.238 ± 0.14
7.732LysAsn: 7.732 ± 0.372
2.929LysPro: 2.929 ± 0.189
3.277LysGln: 3.277 ± 0.197
2.514LysArg: 2.514 ± 0.169
5.61LysSer: 5.61 ± 0.308
6.039LysThr: 6.039 ± 0.262
4.04LysVal: 4.04 ± 0.212
0.69LysTrp: 0.69 ± 0.065
5.276LysTyr: 5.276 ± 0.316
0.0LysXaa: 0.0 ± 0.0
Leu
3.619LeuAla: 3.619 ± 0.205
1.264LeuCys: 1.264 ± 0.105
5.152LeuAsp: 5.152 ± 0.188
5.101LeuGlu: 5.101 ± 0.202
4.019LeuPhe: 4.019 ± 0.199
3.561LeuGly: 3.561 ± 0.224
1.519LeuHis: 1.519 ± 0.113
7.209LeuIle: 7.209 ± 0.229
7.165LeuLys: 7.165 ± 0.368
6.627LeuLeu: 6.627 ± 0.298
2.238LeuMet: 2.238 ± 0.154
7.013LeuAsn: 7.013 ± 0.296
2.921LeuPro: 2.921 ± 0.145
2.354LeuGln: 2.354 ± 0.135
2.362LeuArg: 2.362 ± 0.147
5.799LeuSer: 5.799 ± 0.273
4.658LeuThr: 4.658 ± 0.187
4.048LeuVal: 4.048 ± 0.208
0.567LeuTrp: 0.567 ± 0.073
3.99LeuTyr: 3.99 ± 0.205
0.0LeuXaa: 0.0 ± 0.0
Met
1.279MetAla: 1.279 ± 0.123
0.392MetCys: 0.392 ± 0.06
1.802MetAsp: 1.802 ± 0.137
1.853MetGlu: 1.853 ± 0.156
1.097MetPhe: 1.097 ± 0.108
1.482MetGly: 1.482 ± 0.117
0.247MetHis: 0.247 ± 0.051
1.744MetIle: 1.744 ± 0.125
2.1MetLys: 2.1 ± 0.122
1.904MetLeu: 1.904 ± 0.121
0.836MetMet: 0.836 ± 0.087
1.933MetAsn: 1.933 ± 0.146
0.748MetPro: 0.748 ± 0.088
0.589MetGln: 0.589 ± 0.072
1.025MetArg: 1.025 ± 0.096
2.144MetSer: 2.144 ± 0.156
1.403MetThr: 1.403 ± 0.113
1.279MetVal: 1.279 ± 0.1
0.269MetTrp: 0.269 ± 0.056
1.141MetTyr: 1.141 ± 0.099
0.0MetXaa: 0.0 ± 0.0
Asn
3.357AsnAla: 3.357 ± 0.223
1.054AsnCys: 1.054 ± 0.089
4.694AsnAsp: 4.694 ± 0.185
4.585AsnGlu: 4.585 ± 0.246
3.263AsnPhe: 3.263 ± 0.193
3.968AsnGly: 3.968 ± 0.337
1.519AsnHis: 1.519 ± 0.155
9.062AsnIle: 9.062 ± 0.397
8.568AsnLys: 8.568 ± 0.388
7.114AsnLeu: 7.114 ± 0.294
2.398AsnMet: 2.398 ± 0.162
8.829AsnAsn: 8.829 ± 0.353
2.711AsnPro: 2.711 ± 0.141
2.231AsnGln: 2.231 ± 0.144
2.275AsnArg: 2.275 ± 0.117
5.363AsnSer: 5.363 ± 0.284
5.719AsnThr: 5.719 ± 0.274
4.171AsnVal: 4.171 ± 0.197
0.632AsnTrp: 0.632 ± 0.077
4.506AsnTyr: 4.506 ± 0.285
0.0AsnXaa: 0.0 ± 0.0
Pro
1.679ProAla: 1.679 ± 0.119
0.436ProCys: 0.436 ± 0.057
2.369ProAsp: 2.369 ± 0.129
2.478ProGlu: 2.478 ± 0.201
1.308ProPhe: 1.308 ± 0.093
1.388ProGly: 1.388 ± 0.1
0.763ProHis: 0.763 ± 0.092
2.565ProIle: 2.565 ± 0.13
2.573ProLys: 2.573 ± 0.153
2.216ProLeu: 2.216 ± 0.137
0.618ProMet: 0.618 ± 0.073
2.093ProAsn: 2.093 ± 0.133
1.279ProPro: 1.279 ± 0.133
0.879ProGln: 0.879 ± 0.075
0.901ProArg: 0.901 ± 0.094
2.151ProSer: 2.151 ± 0.145
2.078ProThr: 2.078 ± 0.142
2.18ProVal: 2.18 ± 0.143
0.225ProTrp: 0.225 ± 0.037
1.541ProTyr: 1.541 ± 0.119
0.0ProXaa: 0.0 ± 0.0
Gln
1.286GlnAla: 1.286 ± 0.12
0.334GlnCys: 0.334 ± 0.048
1.439GlnAsp: 1.439 ± 0.093
1.875GlnGlu: 1.875 ± 0.14
1.432GlnPhe: 1.432 ± 0.115
1.112GlnGly: 1.112 ± 0.12
0.661GlnHis: 0.661 ± 0.077
2.369GlnIle: 2.369 ± 0.14
2.362GlnLys: 2.362 ± 0.147
2.434GlnLeu: 2.434 ± 0.157
0.814GlnMet: 0.814 ± 0.09
2.391GlnAsn: 2.391 ± 0.161
0.901GlnPro: 0.901 ± 0.08
1.17GlnGln: 1.17 ± 0.09
0.908GlnArg: 0.908 ± 0.106
1.773GlnSer: 1.773 ± 0.124
1.977GlnThr: 1.977 ± 0.16
1.352GlnVal: 1.352 ± 0.113
0.182GlnTrp: 0.182 ± 0.036
1.373GlnTyr: 1.373 ± 0.104
0.0GlnXaa: 0.0 ± 0.0
Arg
1.366ArgAla: 1.366 ± 0.109
0.603ArgCys: 0.603 ± 0.072
1.686ArgAsp: 1.686 ± 0.125
2.129ArgGlu: 2.129 ± 0.139
1.308ArgPhe: 1.308 ± 0.097
1.548ArgGly: 1.548 ± 0.108
0.669ArgHis: 0.669 ± 0.076
2.282ArgIle: 2.282 ± 0.133
2.638ArgLys: 2.638 ± 0.194
2.449ArgLeu: 2.449 ± 0.139
0.77ArgMet: 0.77 ± 0.075
2.166ArgAsn: 2.166 ± 0.143
1.257ArgPro: 1.257 ± 0.106
0.974ArgGln: 0.974 ± 0.095
1.323ArgArg: 1.323 ± 0.133
1.744ArgSer: 1.744 ± 0.127
1.788ArgThr: 1.788 ± 0.125
1.679ArgVal: 1.679 ± 0.158
0.349ArgTrp: 0.349 ± 0.058
1.294ArgTyr: 1.294 ± 0.114
0.0ArgXaa: 0.0 ± 0.0
Ser
3.394SerAla: 3.394 ± 0.217
1.01SerCys: 1.01 ± 0.102
4.28SerAsp: 4.28 ± 0.246
2.878SerGlu: 2.878 ± 0.162
3.081SerPhe: 3.081 ± 0.223
4.331SerGly: 4.331 ± 0.365
1.453SerHis: 1.453 ± 0.106
6.104SerIle: 6.104 ± 0.273
5.305SerLys: 5.305 ± 0.237
5.428SerLeu: 5.428 ± 0.223
1.635SerMet: 1.635 ± 0.142
5.988SerAsn: 5.988 ± 0.338
1.693SerPro: 1.693 ± 0.13
1.977SerGln: 1.977 ± 0.141
2.362SerArg: 2.362 ± 0.139
4.985SerSer: 4.985 ± 0.267
3.619SerThr: 3.619 ± 0.227
3.946SerVal: 3.946 ± 0.207
0.472SerTrp: 0.472 ± 0.064
2.841SerTyr: 2.841 ± 0.18
0.0SerXaa: 0.0 ± 0.0
Thr
2.943ThrAla: 2.943 ± 0.232
0.756ThrCys: 0.756 ± 0.081
4.593ThrAsp: 4.593 ± 0.214
3.626ThrGlu: 3.626 ± 0.188
2.354ThrPhe: 2.354 ± 0.143
3.633ThrGly: 3.633 ± 0.368
1.403ThrHis: 1.403 ± 0.116
4.796ThrIle: 4.796 ± 0.238
5.545ThrLys: 5.545 ± 0.249
4.789ThrLeu: 4.789 ± 0.203
1.192ThrMet: 1.192 ± 0.101
5.581ThrAsn: 5.581 ± 0.235
2.609ThrPro: 2.609 ± 0.187
1.933ThrGln: 1.933 ± 0.128
2.086ThrArg: 2.086 ± 0.113
4.106ThrSer: 4.106 ± 0.252
4.891ThrThr: 4.891 ± 0.362
3.11ThrVal: 3.11 ± 0.176
0.472ThrTrp: 0.472 ± 0.078
2.529ThrTyr: 2.529 ± 0.137
0.0ThrXaa: 0.0 ± 0.0
Val
2.929ValAla: 2.929 ± 0.228
0.879ValCys: 0.879 ± 0.091
4.099ValAsp: 4.099 ± 0.353
3.626ValGlu: 3.626 ± 0.229
2.449ValPhe: 2.449 ± 0.137
2.907ValGly: 2.907 ± 0.287
0.632ValHis: 0.632 ± 0.058
4.447ValIle: 4.447 ± 0.202
4.426ValLys: 4.426 ± 0.172
4.237ValLeu: 4.237 ± 0.208
1.112ValMet: 1.112 ± 0.086
3.946ValAsn: 3.946 ± 0.22
1.809ValPro: 1.809 ± 0.113
1.155ValGln: 1.155 ± 0.112
1.657ValArg: 1.657 ± 0.118
4.338ValSer: 4.338 ± 0.287
2.682ValThr: 2.682 ± 0.182
3.147ValVal: 3.147 ± 0.226
0.385ValTrp: 0.385 ± 0.062
2.413ValTyr: 2.413 ± 0.124
0.0ValXaa: 0.0 ± 0.0
Trp
0.356TrpAla: 0.356 ± 0.062
0.174TrpCys: 0.174 ± 0.04
0.349TrpAsp: 0.349 ± 0.057
0.56TrpGlu: 0.56 ± 0.068
0.385TrpPhe: 0.385 ± 0.058
0.465TrpGly: 0.465 ± 0.056
0.182TrpHis: 0.182 ± 0.038
0.421TrpIle: 0.421 ± 0.072
0.61TrpLys: 0.61 ± 0.074
0.56TrpLeu: 0.56 ± 0.064
0.327TrpMet: 0.327 ± 0.057
0.632TrpAsn: 0.632 ± 0.073
0.145TrpPro: 0.145 ± 0.037
0.153TrpGln: 0.153 ± 0.03
0.276TrpArg: 0.276 ± 0.037
0.501TrpSer: 0.501 ± 0.059
0.56TrpThr: 0.56 ± 0.08
0.596TrpVal: 0.596 ± 0.113
0.036TrpTrp: 0.036 ± 0.016
0.334TrpTyr: 0.334 ± 0.054
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.918TyrAla: 1.918 ± 0.119
0.698TyrCys: 0.698 ± 0.075
3.401TyrAsp: 3.401 ± 0.175
2.667TyrGlu: 2.667 ± 0.164
2.057TyrPhe: 2.057 ± 0.146
2.151TyrGly: 2.151 ± 0.114
0.727TyrHis: 0.727 ± 0.074
4.447TyrIle: 4.447 ± 0.203
4.585TyrLys: 4.585 ± 0.259
4.026TyrLeu: 4.026 ± 0.219
1.17TyrMet: 1.17 ± 0.095
4.905TyrAsn: 4.905 ± 0.252
1.206TyrPro: 1.206 ± 0.104
1.163TyrGln: 1.163 ± 0.103
1.315TyrArg: 1.315 ± 0.109
3.19TyrSer: 3.19 ± 0.18
3.059TyrThr: 3.059 ± 0.155
2.449TyrVal: 2.449 ± 0.127
0.378TyrTrp: 0.378 ± 0.058
2.703TyrTyr: 2.703 ± 0.174
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 439 proteins (137610 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski