Amino acid dipepetide frequency for Tetraselmis virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.662AlaAla: 3.662 ± 0.222
0.994AlaCys: 0.994 ± 0.097
3.187AlaAsp: 3.187 ± 0.207
3.178AlaGlu: 3.178 ± 0.146
2.496AlaPhe: 2.496 ± 0.135
3.067AlaGly: 3.067 ± 0.22
0.758AlaHis: 0.758 ± 0.068
3.61AlaIle: 3.61 ± 0.139
3.23AlaLys: 3.23 ± 0.171
4.152AlaLeu: 4.152 ± 0.161
1.531AlaMet: 1.531 ± 0.097
2.89AlaAsn: 2.89 ± 0.177
2.122AlaPro: 2.122 ± 0.136
1.43AlaGln: 1.43 ± 0.09
1.915AlaArg: 1.915 ± 0.111
4.598AlaSer: 4.598 ± 0.2
3.922AlaThr: 3.922 ± 0.421
4.022AlaVal: 4.022 ± 0.177
0.883AlaTrp: 0.883 ± 0.147
1.814AlaTyr: 1.814 ± 0.092
0.0AlaXaa: 0.0 ± 0.0
Cys
0.835CysAla: 0.835 ± 0.078
0.461CysCys: 0.461 ± 0.053
1.32CysAsp: 1.32 ± 0.115
0.965CysGlu: 0.965 ± 0.103
0.706CysPhe: 0.706 ± 0.06
1.013CysGly: 1.013 ± 0.091
0.317CysHis: 0.317 ± 0.049
1.099CysIle: 1.099 ± 0.088
1.435CysLys: 1.435 ± 0.108
1.406CysLeu: 1.406 ± 0.106
0.49CysMet: 0.49 ± 0.063
1.046CysAsn: 1.046 ± 0.094
0.811CysPro: 0.811 ± 0.09
0.394CysGln: 0.394 ± 0.054
0.883CysArg: 0.883 ± 0.066
1.344CysSer: 1.344 ± 0.112
0.811CysThr: 0.811 ± 0.081
1.118CysVal: 1.118 ± 0.096
0.192CysTrp: 0.192 ± 0.031
0.768CysTyr: 0.768 ± 0.073
0.0CysXaa: 0.0 ± 0.0
Asp
4.195AspAla: 4.195 ± 0.296
0.965AspCys: 0.965 ± 0.084
5.309AspAsp: 5.309 ± 0.468
4.051AspGlu: 4.051 ± 0.192
2.866AspPhe: 2.866 ± 0.125
4.43AspGly: 4.43 ± 0.511
1.133AspHis: 1.133 ± 0.109
5.17AspIle: 5.17 ± 0.16
4.056AspLys: 4.056 ± 0.232
5.189AspLeu: 5.189 ± 0.157
1.8AspMet: 1.8 ± 0.112
4.79AspAsn: 4.79 ± 0.25
3.0AspPro: 3.0 ± 0.143
1.69AspGln: 1.69 ± 0.092
2.414AspArg: 2.414 ± 0.142
5.342AspSer: 5.342 ± 0.21
4.613AspThr: 4.613 ± 0.168
4.819AspVal: 4.819 ± 0.214
0.715AspTrp: 0.715 ± 0.05
2.702AspTyr: 2.702 ± 0.115
0.0AspXaa: 0.0 ± 0.0
Glu
2.765GluAla: 2.765 ± 0.156
1.066GluCys: 1.066 ± 0.096
4.507GluAsp: 4.507 ± 0.199
4.67GluGlu: 4.67 ± 0.341
2.17GluPhe: 2.17 ± 0.115
2.731GluGly: 2.731 ± 0.152
1.301GluHis: 1.301 ± 0.088
4.656GluIle: 4.656 ± 0.189
4.363GluLys: 4.363 ± 0.309
4.752GluLeu: 4.752 ± 0.176
1.858GluMet: 1.858 ± 0.142
3.662GluAsn: 3.662 ± 0.145
1.949GluPro: 1.949 ± 0.162
1.757GluGln: 1.757 ± 0.106
2.338GluArg: 2.338 ± 0.128
4.79GluSer: 4.79 ± 0.154
3.523GluThr: 3.523 ± 0.198
2.904GluVal: 2.904 ± 0.175
0.835GluTrp: 0.835 ± 0.07
2.4GluTyr: 2.4 ± 0.122
0.0GluXaa: 0.0 ± 0.0
Phe
1.882PheAla: 1.882 ± 0.093
0.749PheCys: 0.749 ± 0.067
3.077PheAsp: 3.077 ± 0.147
2.357PheGlu: 2.357 ± 0.122
1.738PhePhe: 1.738 ± 0.106
2.054PheGly: 2.054 ± 0.099
0.696PheHis: 0.696 ± 0.06
2.573PheIle: 2.573 ± 0.11
2.333PheLys: 2.333 ± 0.131
3.221PheLeu: 3.221 ± 0.158
1.162PheMet: 1.162 ± 0.081
2.462PheAsn: 2.462 ± 0.12
1.584PhePro: 1.584 ± 0.086
1.176PheGln: 1.176 ± 0.078
1.757PheArg: 1.757 ± 0.104
3.859PheSer: 3.859 ± 0.234
2.808PheThr: 2.808 ± 0.166
2.981PheVal: 2.981 ± 0.129
0.365PheTrp: 0.365 ± 0.04
1.666PheTyr: 1.666 ± 0.09
0.0PheXaa: 0.0 ± 0.0
Gly
2.779GlyAla: 2.779 ± 0.161
0.902GlyCys: 0.902 ± 0.083
3.907GlyAsp: 3.907 ± 0.335
2.707GlyGlu: 2.707 ± 0.163
2.41GlyPhe: 2.41 ± 0.132
4.603GlyGly: 4.603 ± 0.615
0.83GlyHis: 0.83 ± 0.065
4.042GlyIle: 4.042 ± 0.186
3.379GlyLys: 3.379 ± 0.193
4.114GlyLeu: 4.114 ± 0.164
1.32GlyMet: 1.32 ± 0.09
4.166GlyAsn: 4.166 ± 0.295
1.387GlyPro: 1.387 ± 0.087
1.248GlyGln: 1.248 ± 0.111
1.987GlyArg: 1.987 ± 0.175
5.294GlySer: 5.294 ± 0.317
3.994GlyThr: 3.994 ± 0.301
3.446GlyVal: 3.446 ± 0.164
0.778GlyTrp: 0.778 ± 0.088
2.198GlyTyr: 2.198 ± 0.112
0.0GlyXaa: 0.0 ± 0.0
His
1.027HisAla: 1.027 ± 0.08
0.384HisCys: 0.384 ± 0.046
1.152HisAsp: 1.152 ± 0.088
1.037HisGlu: 1.037 ± 0.075
0.778HisPhe: 0.778 ± 0.071
0.936HisGly: 0.936 ± 0.089
0.514HisHis: 0.514 ± 0.058
1.397HisIle: 1.397 ± 0.101
1.248HisLys: 1.248 ± 0.107
1.709HisLeu: 1.709 ± 0.136
0.504HisMet: 0.504 ± 0.059
0.888HisAsn: 0.888 ± 0.078
1.114HisPro: 1.114 ± 0.09
0.586HisGln: 0.586 ± 0.07
0.97HisArg: 0.97 ± 0.092
1.33HisSer: 1.33 ± 0.084
1.152HisThr: 1.152 ± 0.088
1.349HisVal: 1.349 ± 0.096
0.178HisTrp: 0.178 ± 0.027
0.806HisTyr: 0.806 ± 0.07
0.0HisXaa: 0.0 ± 0.0
Ile
3.509IleAla: 3.509 ± 0.162
1.258IleCys: 1.258 ± 0.102
5.587IleAsp: 5.587 ± 0.207
4.094IleGlu: 4.094 ± 0.169
2.462IlePhe: 2.462 ± 0.122
3.312IleGly: 3.312 ± 0.153
1.363IleHis: 1.363 ± 0.093
4.8IleIle: 4.8 ± 0.226
4.603IleLys: 4.603 ± 0.181
5.414IleLeu: 5.414 ± 0.225
1.91IleMet: 1.91 ± 0.148
4.253IleAsn: 4.253 ± 0.147
3.317IlePro: 3.317 ± 0.154
2.309IleGln: 2.309 ± 0.111
3.427IleArg: 3.427 ± 0.153
5.827IleSer: 5.827 ± 0.167
4.891IleThr: 4.891 ± 0.236
4.555IleVal: 4.555 ± 0.162
0.667IleTrp: 0.667 ± 0.058
2.741IleTyr: 2.741 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
3.374LysAla: 3.374 ± 0.201
1.243LysCys: 1.243 ± 0.099
4.368LysAsp: 4.368 ± 0.221
4.565LysGlu: 4.565 ± 0.298
1.92LysPhe: 1.92 ± 0.099
2.971LysGly: 2.971 ± 0.17
1.666LysHis: 1.666 ± 0.128
4.949LysIle: 4.949 ± 0.257
6.346LysLys: 6.346 ± 0.406
4.968LysLeu: 4.968 ± 0.271
2.006LysMet: 2.006 ± 0.141
4.666LysAsn: 4.666 ± 0.271
2.611LysPro: 2.611 ± 0.168
2.29LysGln: 2.29 ± 0.129
3.379LysArg: 3.379 ± 0.213
4.776LysSer: 4.776 ± 0.168
4.469LysThr: 4.469 ± 0.196
3.12LysVal: 3.12 ± 0.198
0.821LysTrp: 0.821 ± 0.078
2.818LysTyr: 2.818 ± 0.17
0.0LysXaa: 0.0 ± 0.0
Leu
3.854LeuAla: 3.854 ± 0.163
1.378LeuCys: 1.378 ± 0.103
4.954LeuAsp: 4.954 ± 0.169
4.531LeuGlu: 4.531 ± 0.229
3.206LeuPhe: 3.206 ± 0.154
3.696LeuGly: 3.696 ± 0.146
1.507LeuHis: 1.507 ± 0.104
4.738LeuIle: 4.738 ± 0.209
5.698LeuLys: 5.698 ± 0.283
6.178LeuLeu: 6.178 ± 0.211
2.054LeuMet: 2.054 ± 0.103
4.44LeuAsn: 4.44 ± 0.186
2.962LeuPro: 2.962 ± 0.144
2.957LeuGln: 2.957 ± 0.147
3.523LeuArg: 3.523 ± 0.148
7.181LeuSer: 7.181 ± 0.207
4.982LeuThr: 4.982 ± 0.21
4.661LeuVal: 4.661 ± 0.168
0.797LeuTrp: 0.797 ± 0.061
3.12LeuTyr: 3.12 ± 0.133
0.0LeuXaa: 0.0 ± 0.0
Met
1.858MetAla: 1.858 ± 0.212
0.422MetCys: 0.422 ± 0.058
1.373MetAsp: 1.373 ± 0.1
1.349MetGlu: 1.349 ± 0.092
1.238MetPhe: 1.238 ± 0.094
1.205MetGly: 1.205 ± 0.097
0.547MetHis: 0.547 ± 0.063
1.814MetIle: 1.814 ± 0.108
2.15MetLys: 2.15 ± 0.114
2.059MetLeu: 2.059 ± 0.117
0.965MetMet: 0.965 ± 0.079
1.766MetAsn: 1.766 ± 0.098
0.883MetPro: 0.883 ± 0.082
0.85MetGln: 0.85 ± 0.071
1.344MetArg: 1.344 ± 0.084
2.227MetSer: 2.227 ± 0.103
1.757MetThr: 1.757 ± 0.093
1.522MetVal: 1.522 ± 0.084
0.283MetTrp: 0.283 ± 0.036
1.133MetTyr: 1.133 ± 0.087
0.0MetXaa: 0.0 ± 0.0
Asn
3.178AsnAla: 3.178 ± 0.139
0.95AsnCys: 0.95 ± 0.085
4.387AsnAsp: 4.387 ± 0.177
3.408AsnGlu: 3.408 ± 0.13
1.978AsnPhe: 1.978 ± 0.093
3.485AsnGly: 3.485 ± 0.181
1.099AsnHis: 1.099 ± 0.104
4.968AsnIle: 4.968 ± 0.208
4.09AsnLys: 4.09 ± 0.235
4.339AsnLeu: 4.339 ± 0.164
1.594AsnMet: 1.594 ± 0.09
4.162AsnAsn: 4.162 ± 0.181
2.928AsnPro: 2.928 ± 0.134
1.901AsnGln: 1.901 ± 0.11
2.477AsnArg: 2.477 ± 0.137
4.944AsnSer: 4.944 ± 0.212
4.747AsnThr: 4.747 ± 0.259
4.099AsnVal: 4.099 ± 0.186
0.614AsnTrp: 0.614 ± 0.068
2.218AsnTyr: 2.218 ± 0.118
0.0AsnXaa: 0.0 ± 0.0
Pro
2.126ProAla: 2.126 ± 0.125
0.667ProCys: 0.667 ± 0.067
3.427ProAsp: 3.427 ± 0.14
2.981ProGlu: 2.981 ± 0.179
1.488ProPhe: 1.488 ± 0.093
2.098ProGly: 2.098 ± 0.166
0.864ProHis: 0.864 ± 0.071
2.506ProIle: 2.506 ± 0.12
2.549ProLys: 2.549 ± 0.157
2.77ProLeu: 2.77 ± 0.14
0.811ProMet: 0.811 ± 0.07
1.963ProAsn: 1.963 ± 0.12
3.211ProPro: 3.211 ± 0.459
1.018ProGln: 1.018 ± 0.076
1.642ProArg: 1.642 ± 0.142
3.888ProSer: 3.888 ± 0.255
2.774ProThr: 2.774 ± 0.318
3.61ProVal: 3.61 ± 0.155
0.427ProTrp: 0.427 ± 0.045
1.56ProTyr: 1.56 ± 0.1
0.0ProXaa: 0.0 ± 0.0
Gln
1.44GlnAla: 1.44 ± 0.087
0.466GlnCys: 0.466 ± 0.06
1.858GlnAsp: 1.858 ± 0.12
1.944GlnGlu: 1.944 ± 0.165
1.181GlnPhe: 1.181 ± 0.088
1.637GlnGly: 1.637 ± 0.116
0.773GlnHis: 0.773 ± 0.08
2.232GlnIle: 2.232 ± 0.112
1.867GlnLys: 1.867 ± 0.141
2.294GlnLeu: 2.294 ± 0.11
0.869GlnMet: 0.869 ± 0.077
1.766GlnAsn: 1.766 ± 0.092
1.219GlnPro: 1.219 ± 0.093
1.258GlnGln: 1.258 ± 0.146
1.454GlnArg: 1.454 ± 0.129
2.141GlnSer: 2.141 ± 0.104
1.685GlnThr: 1.685 ± 0.087
1.805GlnVal: 1.805 ± 0.106
0.346GlnTrp: 0.346 ± 0.036
1.411GlnTyr: 1.411 ± 0.091
0.0GlnXaa: 0.0 ± 0.0
Arg
2.098ArgAla: 2.098 ± 0.13
0.864ArgCys: 0.864 ± 0.073
2.578ArgAsp: 2.578 ± 0.156
2.261ArgGlu: 2.261 ± 0.167
2.011ArgPhe: 2.011 ± 0.12
2.083ArgGly: 2.083 ± 0.159
0.902ArgHis: 0.902 ± 0.083
3.418ArgIle: 3.418 ± 0.157
3.101ArgLys: 3.101 ± 0.208
3.696ArgLeu: 3.696 ± 0.19
1.306ArgMet: 1.306 ± 0.074
2.798ArgAsn: 2.798 ± 0.147
1.589ArgPro: 1.589 ± 0.134
1.229ArgGln: 1.229 ± 0.083
2.251ArgArg: 2.251 ± 0.155
2.88ArgSer: 2.88 ± 0.137
2.242ArgThr: 2.242 ± 0.119
2.333ArgVal: 2.333 ± 0.127
0.562ArgTrp: 0.562 ± 0.06
1.915ArgTyr: 1.915 ± 0.099
0.0ArgXaa: 0.0 ± 0.0
Ser
4.488SerAla: 4.488 ± 0.354
1.368SerCys: 1.368 ± 0.114
5.87SerAsp: 5.87 ± 0.277
5.098SerGlu: 5.098 ± 0.238
3.917SerPhe: 3.917 ± 0.215
5.899SerGly: 5.899 ± 0.373
1.459SerHis: 1.459 ± 0.109
5.592SerIle: 5.592 ± 0.182
5.41SerLys: 5.41 ± 0.279
6.346SerLeu: 6.346 ± 0.183
1.91SerMet: 1.91 ± 0.099
4.982SerAsn: 4.982 ± 0.249
3.274SerPro: 3.274 ± 0.218
2.222SerGln: 2.222 ± 0.097
3.062SerArg: 3.062 ± 0.173
8.17SerSer: 8.17 ± 0.501
5.266SerThr: 5.266 ± 0.359
6.106SerVal: 6.106 ± 0.242
1.027SerTrp: 1.027 ± 0.08
3.149SerTyr: 3.149 ± 0.147
0.0SerXaa: 0.0 ± 0.0
Thr
4.133ThrAla: 4.133 ± 0.45
0.907ThrCys: 0.907 ± 0.087
4.536ThrAsp: 4.536 ± 0.265
3.792ThrGlu: 3.792 ± 0.192
2.746ThrPhe: 2.746 ± 0.15
4.258ThrGly: 4.258 ± 0.229
1.027ThrHis: 1.027 ± 0.078
4.44ThrIle: 4.44 ± 0.245
3.984ThrLys: 3.984 ± 0.155
4.699ThrLeu: 4.699 ± 0.196
1.651ThrMet: 1.651 ± 0.141
3.725ThrAsn: 3.725 ± 0.194
3.581ThrPro: 3.581 ± 0.378
1.92ThrGln: 1.92 ± 0.176
2.462ThrArg: 2.462 ± 0.116
5.933ThrSer: 5.933 ± 0.521
4.934ThrThr: 4.934 ± 0.49
4.992ThrVal: 4.992 ± 0.244
0.61ThrTrp: 0.61 ± 0.059
2.477ThrTyr: 2.477 ± 0.214
0.0ThrXaa: 0.0 ± 0.0
Val
3.701ValAla: 3.701 ± 0.19
1.363ValCys: 1.363 ± 0.098
4.416ValAsp: 4.416 ± 0.203
3.259ValGlu: 3.259 ± 0.157
3.038ValPhe: 3.038 ± 0.141
3.149ValGly: 3.149 ± 0.161
1.286ValHis: 1.286 ± 0.103
4.474ValIle: 4.474 ± 0.178
3.739ValLys: 3.739 ± 0.155
5.29ValLeu: 5.29 ± 0.196
1.723ValMet: 1.723 ± 0.091
3.72ValAsn: 3.72 ± 0.21
2.99ValPro: 2.99 ± 0.112
1.978ValGln: 1.978 ± 0.115
2.544ValArg: 2.544 ± 0.126
6.014ValSer: 6.014 ± 0.318
4.718ValThr: 4.718 ± 0.415
4.795ValVal: 4.795 ± 0.189
0.706ValTrp: 0.706 ± 0.056
2.99ValTyr: 2.99 ± 0.194
0.0ValXaa: 0.0 ± 0.0
Trp
0.446TrpAla: 0.446 ± 0.052
0.298TrpCys: 0.298 ± 0.045
0.533TrpAsp: 0.533 ± 0.048
0.523TrpGlu: 0.523 ± 0.053
0.494TrpPhe: 0.494 ± 0.051
0.547TrpGly: 0.547 ± 0.057
0.154TrpHis: 0.154 ± 0.027
1.195TrpIle: 1.195 ± 0.189
0.955TrpLys: 0.955 ± 0.079
0.984TrpLeu: 0.984 ± 0.071
0.374TrpMet: 0.374 ± 0.037
0.821TrpAsn: 0.821 ± 0.071
0.355TrpPro: 0.355 ± 0.044
0.274TrpGln: 0.274 ± 0.037
0.614TrpArg: 0.614 ± 0.061
0.816TrpSer: 0.816 ± 0.066
0.715TrpThr: 0.715 ± 0.06
0.61TrpVal: 0.61 ± 0.062
0.139TrpTrp: 0.139 ± 0.026
0.49TrpTyr: 0.49 ± 0.052
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.112TyrAla: 2.112 ± 0.146
0.715TyrCys: 0.715 ± 0.065
2.904TyrAsp: 2.904 ± 0.154
2.227TyrGlu: 2.227 ± 0.135
1.656TyrPhe: 1.656 ± 0.093
2.318TyrGly: 2.318 ± 0.113
0.859TyrHis: 0.859 ± 0.071
2.63TyrIle: 2.63 ± 0.147
2.789TyrLys: 2.789 ± 0.149
2.837TyrLeu: 2.837 ± 0.123
0.902TyrMet: 0.902 ± 0.06
2.597TyrAsn: 2.597 ± 0.124
1.493TyrPro: 1.493 ± 0.087
1.162TyrGln: 1.162 ± 0.076
1.67TyrArg: 1.67 ± 0.097
3.259TyrSer: 3.259 ± 0.149
2.803TyrThr: 2.803 ± 0.215
3.048TyrVal: 3.048 ± 0.237
0.394TyrTrp: 0.394 ± 0.05
2.318TyrTyr: 2.318 ± 0.276
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 653 proteins (208336 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski