Amino acid dipepetide frequency for Chrysochromulina ericina virus (CeV01)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.907AlaAla: 1.907 ± 0.162
0.622AlaCys: 0.622 ± 0.063
1.572AlaAsp: 1.572 ± 0.117
1.851AlaGlu: 1.851 ± 0.129
1.509AlaPhe: 1.509 ± 0.111
2.047AlaGly: 2.047 ± 0.134
0.573AlaHis: 0.573 ± 0.065
3.116AlaIle: 3.116 ± 0.169
2.815AlaLys: 2.815 ± 0.172
3.325AlaLeu: 3.325 ± 0.163
0.817AlaMet: 0.817 ± 0.07
2.578AlaAsn: 2.578 ± 0.13
1.411AlaPro: 1.411 ± 0.118
1.823AlaGln: 1.823 ± 0.325
1.236AlaArg: 1.236 ± 0.084
2.403AlaSer: 2.403 ± 0.16
2.263AlaThr: 2.263 ± 0.267
1.579AlaVal: 1.579 ± 0.104
0.265AlaTrp: 0.265 ± 0.044
1.523AlaTyr: 1.523 ± 0.124
0.0AlaXaa: 0.0 ± 0.0
Cys
0.573CysAla: 0.573 ± 0.073
0.468CysCys: 0.468 ± 0.058
0.887CysAsp: 0.887 ± 0.085
0.831CysGlu: 0.831 ± 0.077
0.817CysPhe: 0.817 ± 0.075
1.076CysGly: 1.076 ± 0.083
0.258CysHis: 0.258 ± 0.042
1.411CysIle: 1.411 ± 0.109
1.837CysLys: 1.837 ± 0.137
1.383CysLeu: 1.383 ± 0.103
0.286CysMet: 0.286 ± 0.057
1.607CysAsn: 1.607 ± 0.129
0.72CysPro: 0.72 ± 0.098
0.545CysGln: 0.545 ± 0.061
0.58CysArg: 0.58 ± 0.063
1.16CysSer: 1.16 ± 0.085
0.866CysThr: 0.866 ± 0.082
0.699CysVal: 0.699 ± 0.072
0.105CysTrp: 0.105 ± 0.031
0.922CysTyr: 0.922 ± 0.078
0.0CysXaa: 0.0 ± 0.0
Asp
1.97AspAla: 1.97 ± 0.125
0.768AspCys: 0.768 ± 0.076
3.339AspAsp: 3.339 ± 0.222
3.828AspGlu: 3.828 ± 0.229
2.55AspPhe: 2.55 ± 0.173
2.48AspGly: 2.48 ± 0.155
0.845AspHis: 0.845 ± 0.102
7.076AspIle: 7.076 ± 0.257
5.917AspLys: 5.917 ± 0.226
4.771AspLeu: 4.771 ± 0.174
0.985AspMet: 0.985 ± 0.075
5.742AspAsn: 5.742 ± 0.241
1.753AspPro: 1.753 ± 0.119
1.362AspGln: 1.362 ± 0.086
1.628AspArg: 1.628 ± 0.102
3.556AspSer: 3.556 ± 0.215
2.71AspThr: 2.71 ± 0.131
2.096AspVal: 2.096 ± 0.115
0.552AspTrp: 0.552 ± 0.064
3.011AspTyr: 3.011 ± 0.152
0.0AspXaa: 0.0 ± 0.0
Glu
1.865GluAla: 1.865 ± 0.131
0.908GluCys: 0.908 ± 0.094
3.318GluAsp: 3.318 ± 0.241
4.317GluGlu: 4.317 ± 0.426
2.787GluPhe: 2.787 ± 0.146
1.977GluGly: 1.977 ± 0.124
0.88GluHis: 0.88 ± 0.081
5.588GluIle: 5.588 ± 0.209
6.357GluLys: 6.357 ± 0.238
5.784GluLeu: 5.784 ± 0.236
1.278GluMet: 1.278 ± 0.118
5.637GluAsn: 5.637 ± 0.219
1.69GluPro: 1.69 ± 0.142
2.34GluGln: 2.34 ± 0.245
2.326GluArg: 2.326 ± 0.134
3.039GluSer: 3.039 ± 0.167
3.206GluThr: 3.206 ± 0.146
2.382GluVal: 2.382 ± 0.148
0.824GluTrp: 0.824 ± 0.094
3.262GluTyr: 3.262 ± 0.183
0.0GluXaa: 0.0 ± 0.0
Phe
1.593PheAla: 1.593 ± 0.115
0.88PheCys: 0.88 ± 0.078
2.682PheAsp: 2.682 ± 0.138
2.11PheGlu: 2.11 ± 0.12
2.061PhePhe: 2.061 ± 0.151
2.019PheGly: 2.019 ± 0.131
0.671PheHis: 0.671 ± 0.069
4.883PheIle: 4.883 ± 0.202
3.667PheLys: 3.667 ± 0.161
3.667PheLeu: 3.667 ± 0.162
0.964PheMet: 0.964 ± 0.082
4.443PheAsn: 4.443 ± 0.158
1.376PhePro: 1.376 ± 0.106
1.264PheGln: 1.264 ± 0.116
1.271PheArg: 1.271 ± 0.095
3.234PheSer: 3.234 ± 0.149
2.599PheThr: 2.599 ± 0.139
2.061PheVal: 2.061 ± 0.112
0.405PheTrp: 0.405 ± 0.058
2.103PheTyr: 2.103 ± 0.144
0.0PheXaa: 0.0 ± 0.0
Gly
2.473GlyAla: 2.473 ± 0.527
1.006GlyCys: 1.006 ± 0.097
2.508GlyAsp: 2.508 ± 0.155
2.438GlyGlu: 2.438 ± 0.153
2.159GlyPhe: 2.159 ± 0.115
3.598GlyGly: 3.598 ± 0.257
0.922GlyHis: 0.922 ± 0.107
4.094GlyIle: 4.094 ± 0.192
3.646GlyLys: 3.646 ± 0.182
3.646GlyLeu: 3.646 ± 0.144
1.097GlyMet: 1.097 ± 0.113
3.15GlyAsn: 3.15 ± 0.163
1.25GlyPro: 1.25 ± 0.105
1.278GlyGln: 1.278 ± 0.104
1.628GlyArg: 1.628 ± 0.141
3.276GlySer: 3.276 ± 0.222
2.473GlyThr: 2.473 ± 0.149
2.319GlyVal: 2.319 ± 0.115
0.524GlyTrp: 0.524 ± 0.062
2.152GlyTyr: 2.152 ± 0.131
0.0GlyXaa: 0.0 ± 0.0
His
0.517HisAla: 0.517 ± 0.057
0.398HisCys: 0.398 ± 0.054
0.796HisAsp: 0.796 ± 0.068
0.775HisGlu: 0.775 ± 0.072
0.713HisPhe: 0.713 ± 0.079
0.796HisGly: 0.796 ± 0.081
0.349HisHis: 0.349 ± 0.045
1.837HisIle: 1.837 ± 0.123
1.83HisLys: 1.83 ± 0.165
1.264HisLeu: 1.264 ± 0.1
0.44HisMet: 0.44 ± 0.052
1.684HisAsn: 1.684 ± 0.122
0.81HisPro: 0.81 ± 0.09
0.51HisGln: 0.51 ± 0.051
0.566HisArg: 0.566 ± 0.055
0.992HisSer: 0.992 ± 0.081
1.083HisThr: 1.083 ± 0.105
0.58HisVal: 0.58 ± 0.073
0.196HisTrp: 0.196 ± 0.04
0.887HisTyr: 0.887 ± 0.08
0.0HisXaa: 0.0 ± 0.0
Ile
3.164IleAla: 3.164 ± 0.168
1.697IleCys: 1.697 ± 0.112
6.692IleAsp: 6.692 ± 0.248
6.098IleGlu: 6.098 ± 0.22
4.317IlePhe: 4.317 ± 0.196
3.695IleGly: 3.695 ± 0.167
1.614IleHis: 1.614 ± 0.136
10.136IleIle: 10.136 ± 0.413
9.284IleLys: 9.284 ± 0.285
9.067IleLeu: 9.067 ± 0.266
1.949IleMet: 1.949 ± 0.141
9.675IleAsn: 9.675 ± 0.345
3.835IlePro: 3.835 ± 0.169
2.773IleGln: 2.773 ± 0.14
2.648IleArg: 2.648 ± 0.149
7.16IleSer: 7.16 ± 0.27
5.051IleThr: 5.051 ± 0.21
4.233IleVal: 4.233 ± 0.193
0.789IleTrp: 0.789 ± 0.074
5.141IleTyr: 5.141 ± 0.233
0.0IleXaa: 0.0 ± 0.0
Lys
2.661LysAla: 2.661 ± 0.164
1.593LysCys: 1.593 ± 0.113
5.449LysAsp: 5.449 ± 0.236
5.707LysGlu: 5.707 ± 0.332
3.849LysPhe: 3.849 ± 0.178
3.269LysGly: 3.269 ± 0.176
1.767LysHis: 1.767 ± 0.112
9.375LysIle: 9.375 ± 0.314
10.325LysLys: 10.325 ± 0.449
8.131LysLeu: 8.131 ± 0.248
1.935LysMet: 1.935 ± 0.127
9.368LysAsn: 9.368 ± 0.347
2.955LysPro: 2.955 ± 0.183
3.444LysGln: 3.444 ± 0.175
3.625LysArg: 3.625 ± 0.182
5.742LysSer: 5.742 ± 0.231
5.358LysThr: 5.358 ± 0.203
3.702LysVal: 3.702 ± 0.206
0.922LysTrp: 0.922 ± 0.073
5.784LysTyr: 5.784 ± 0.265
0.0LysXaa: 0.0 ± 0.0
Leu
3.353LeuAla: 3.353 ± 0.167
1.306LeuCys: 1.306 ± 0.098
5.588LeuAsp: 5.588 ± 0.204
5.896LeuGlu: 5.896 ± 0.256
4.45LeuPhe: 4.45 ± 0.174
3.898LeuGly: 3.898 ± 0.187
1.46LeuHis: 1.46 ± 0.106
7.747LeuIle: 7.747 ± 0.274
7.551LeuLys: 7.551 ± 0.226
8.55LeuLeu: 8.55 ± 0.275
1.313LeuMet: 1.313 ± 0.108
7.069LeuAsn: 7.069 ± 0.257
3.514LeuPro: 3.514 ± 0.182
3.067LeuGln: 3.067 ± 0.177
2.62LeuArg: 2.62 ± 0.137
6.504LeuSer: 6.504 ± 0.253
4.736LeuThr: 4.736 ± 0.169
4.555LeuVal: 4.555 ± 0.199
0.685LeuTrp: 0.685 ± 0.081
4.436LeuTyr: 4.436 ± 0.187
0.0LeuXaa: 0.0 ± 0.0
Met
0.859MetAla: 0.859 ± 0.08
0.279MetCys: 0.279 ± 0.04
1.034MetAsp: 1.034 ± 0.075
1.222MetGlu: 1.222 ± 0.1
0.873MetPhe: 0.873 ± 0.094
0.936MetGly: 0.936 ± 0.094
0.398MetHis: 0.398 ± 0.064
1.67MetIle: 1.67 ± 0.101
1.746MetLys: 1.746 ± 0.106
1.635MetLeu: 1.635 ± 0.112
0.398MetMet: 0.398 ± 0.063
1.697MetAsn: 1.697 ± 0.118
1.111MetPro: 1.111 ± 0.145
0.713MetGln: 0.713 ± 0.073
0.796MetArg: 0.796 ± 0.081
1.781MetSer: 1.781 ± 0.118
1.362MetThr: 1.362 ± 0.109
0.74MetVal: 0.74 ± 0.087
0.258MetTrp: 0.258 ± 0.046
0.713MetTyr: 0.713 ± 0.082
0.0MetXaa: 0.0 ± 0.0
Asn
2.424AsnAla: 2.424 ± 0.141
1.327AsnCys: 1.327 ± 0.111
4.114AsnAsp: 4.114 ± 0.192
4.666AsnGlu: 4.666 ± 0.205
3.779AsnPhe: 3.779 ± 0.176
3.276AsnGly: 3.276 ± 0.201
1.383AsnHis: 1.383 ± 0.108
12.204AsnIle: 12.204 ± 0.387
9.305AsnLys: 9.305 ± 0.295
8.012AsnLeu: 8.012 ± 0.284
2.047AsnMet: 2.047 ± 0.127
10.66AsnAsn: 10.66 ± 0.407
3.304AsnPro: 3.304 ± 0.177
2.822AsnGln: 2.822 ± 0.142
3.032AsnArg: 3.032 ± 0.15
6.266AsnSer: 6.266 ± 0.231
4.869AsnThr: 4.869 ± 0.197
3.143AsnVal: 3.143 ± 0.161
0.706AsnTrp: 0.706 ± 0.069
4.715AsnTyr: 4.715 ± 0.217
0.0AsnXaa: 0.0 ± 0.0
Pro
1.348ProAla: 1.348 ± 0.139
0.461ProCys: 0.461 ± 0.058
2.131ProAsp: 2.131 ± 0.147
2.291ProGlu: 2.291 ± 0.233
1.593ProPhe: 1.593 ± 0.112
1.809ProGly: 1.809 ± 0.154
0.671ProHis: 0.671 ± 0.076
3.283ProIle: 3.283 ± 0.15
3.13ProLys: 3.13 ± 0.197
2.885ProLeu: 2.885 ± 0.163
0.671ProMet: 0.671 ± 0.069
3.004ProAsn: 3.004 ± 0.152
1.928ProPro: 1.928 ± 0.211
1.271ProGln: 1.271 ± 0.127
1.348ProArg: 1.348 ± 0.144
2.508ProSer: 2.508 ± 0.182
2.089ProThr: 2.089 ± 0.135
1.809ProVal: 1.809 ± 0.123
0.412ProTrp: 0.412 ± 0.056
1.767ProTyr: 1.767 ± 0.119
0.0ProXaa: 0.0 ± 0.0
Gln
1.306GlnAla: 1.306 ± 0.115
0.622GlnCys: 0.622 ± 0.068
1.725GlnAsp: 1.725 ± 0.115
2.291GlnGlu: 2.291 ± 0.147
1.516GlnPhe: 1.516 ± 0.099
1.809GlnGly: 1.809 ± 0.405
0.587GlnHis: 0.587 ± 0.067
2.878GlnIle: 2.878 ± 0.173
3.004GlnLys: 3.004 ± 0.145
3.451GlnLeu: 3.451 ± 0.159
0.726GlnMet: 0.726 ± 0.08
2.668GlnAsn: 2.668 ± 0.134
1.593GlnPro: 1.593 ± 0.224
1.851GlnGln: 1.851 ± 0.18
1.062GlnArg: 1.062 ± 0.074
2.207GlnSer: 2.207 ± 0.137
1.53GlnThr: 1.53 ± 0.128
1.383GlnVal: 1.383 ± 0.146
0.328GlnTrp: 0.328 ± 0.048
1.781GlnTyr: 1.781 ± 0.108
0.0GlnXaa: 0.0 ± 0.0
Arg
1.174ArgAla: 1.174 ± 0.095
0.545ArgCys: 0.545 ± 0.061
2.04ArgAsp: 2.04 ± 0.119
3.199ArgGlu: 3.199 ± 0.198
1.334ArgPhe: 1.334 ± 0.097
1.753ArgGly: 1.753 ± 0.149
0.733ArgHis: 0.733 ± 0.071
2.438ArgIle: 2.438 ± 0.14
3.22ArgLys: 3.22 ± 0.17
2.892ArgLeu: 2.892 ± 0.142
0.733ArgMet: 0.733 ± 0.069
2.487ArgAsn: 2.487 ± 0.118
1.369ArgPro: 1.369 ± 0.107
1.446ArgGln: 1.446 ± 0.119
1.837ArgArg: 1.837 ± 0.166
2.054ArgSer: 2.054 ± 0.146
1.467ArgThr: 1.467 ± 0.119
1.725ArgVal: 1.725 ± 0.134
0.461ArgTrp: 0.461 ± 0.06
1.656ArgTyr: 1.656 ± 0.121
0.0ArgXaa: 0.0 ± 0.0
Ser
2.326SerAla: 2.326 ± 0.171
1.153SerCys: 1.153 ± 0.094
4.394SerAsp: 4.394 ± 0.194
3.856SerGlu: 3.856 ± 0.209
2.536SerPhe: 2.536 ± 0.129
3.388SerGly: 3.388 ± 0.175
1.083SerHis: 1.083 ± 0.094
6.056SerIle: 6.056 ± 0.224
6.629SerLys: 6.629 ± 0.221
5.784SerLeu: 5.784 ± 0.214
1.264SerMet: 1.264 ± 0.101
6.189SerAsn: 6.189 ± 0.224
2.089SerPro: 2.089 ± 0.17
2.787SerGln: 2.787 ± 0.186
2.613SerArg: 2.613 ± 0.175
4.799SerSer: 4.799 ± 0.232
3.863SerThr: 3.863 ± 0.184
2.759SerVal: 2.759 ± 0.176
0.594SerTrp: 0.594 ± 0.066
2.843SerTyr: 2.843 ± 0.156
0.0SerXaa: 0.0 ± 0.0
Thr
1.991ThrAla: 1.991 ± 0.129
0.936ThrCys: 0.936 ± 0.089
2.962ThrAsp: 2.962 ± 0.138
2.899ThrGlu: 2.899 ± 0.154
2.354ThrPhe: 2.354 ± 0.139
3.067ThrGly: 3.067 ± 0.362
0.985ThrHis: 0.985 ± 0.089
5.246ThrIle: 5.246 ± 0.199
4.806ThrLys: 4.806 ± 0.196
4.967ThrLeu: 4.967 ± 0.189
1.118ThrMet: 1.118 ± 0.091
4.666ThrAsn: 4.666 ± 0.191
2.242ThrPro: 2.242 ± 0.159
1.907ThrGln: 1.907 ± 0.119
2.277ThrArg: 2.277 ± 0.137
3.744ThrSer: 3.744 ± 0.177
3.297ThrThr: 3.297 ± 0.179
2.34ThrVal: 2.34 ± 0.144
0.503ThrTrp: 0.503 ± 0.066
2.438ThrTyr: 2.438 ± 0.131
0.0ThrXaa: 0.0 ± 0.0
Val
1.795ValAla: 1.795 ± 0.117
0.824ValCys: 0.824 ± 0.078
2.522ValAsp: 2.522 ± 0.154
2.466ValGlu: 2.466 ± 0.148
1.76ValPhe: 1.76 ± 0.1
2.012ValGly: 2.012 ± 0.135
0.789ValHis: 0.789 ± 0.084
3.821ValIle: 3.821 ± 0.191
3.996ValLys: 3.996 ± 0.222
3.688ValLeu: 3.688 ± 0.182
0.859ValMet: 0.859 ± 0.074
3.493ValAsn: 3.493 ± 0.138
1.781ValPro: 1.781 ± 0.161
1.32ValGln: 1.32 ± 0.089
1.383ValArg: 1.383 ± 0.124
2.948ValSer: 2.948 ± 0.154
2.515ValThr: 2.515 ± 0.126
2.396ValVal: 2.396 ± 0.198
0.447ValTrp: 0.447 ± 0.064
1.998ValTyr: 1.998 ± 0.115
0.0ValXaa: 0.0 ± 0.0
Trp
0.489TrpAla: 0.489 ± 0.068
0.224TrpCys: 0.224 ± 0.039
0.461TrpAsp: 0.461 ± 0.051
0.601TrpGlu: 0.601 ± 0.066
0.573TrpPhe: 0.573 ± 0.061
0.566TrpGly: 0.566 ± 0.063
0.196TrpHis: 0.196 ± 0.03
0.775TrpIle: 0.775 ± 0.069
0.915TrpLys: 0.915 ± 0.079
0.692TrpLeu: 0.692 ± 0.073
0.238TrpMet: 0.238 ± 0.04
0.838TrpAsn: 0.838 ± 0.096
0.217TrpPro: 0.217 ± 0.04
0.335TrpGln: 0.335 ± 0.052
0.335TrpArg: 0.335 ± 0.05
0.664TrpSer: 0.664 ± 0.062
0.524TrpThr: 0.524 ± 0.068
0.426TrpVal: 0.426 ± 0.052
0.126TrpTrp: 0.126 ± 0.037
0.384TrpTyr: 0.384 ± 0.052
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.446TyrAla: 1.446 ± 0.097
1.013TyrCys: 1.013 ± 0.093
2.724TyrAsp: 2.724 ± 0.16
2.424TyrGlu: 2.424 ± 0.15
2.431TyrPhe: 2.431 ± 0.152
2.166TyrGly: 2.166 ± 0.135
0.859TyrHis: 0.859 ± 0.082
5.386TyrIle: 5.386 ± 0.227
4.981TyrLys: 4.981 ± 0.198
4.701TyrLeu: 4.701 ± 0.183
1.097TyrMet: 1.097 ± 0.079
5.4TyrAsn: 5.4 ± 0.235
1.46TyrPro: 1.46 ± 0.108
1.46TyrGln: 1.46 ± 0.108
1.753TyrArg: 1.753 ± 0.11
2.969TyrSer: 2.969 ± 0.164
2.885TyrThr: 2.885 ± 0.147
1.886TyrVal: 1.886 ± 0.111
0.475TyrTrp: 0.475 ± 0.06
2.906TyrTyr: 2.906 ± 0.187
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 512 proteins (143154 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski