Amino acid dipepetide frequency for Cedratvirus lausannensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.537AlaAla: 4.537 ± 0.352
1.344AlaCys: 1.344 ± 0.101
2.423AlaAsp: 2.423 ± 0.13
3.162AlaGlu: 3.162 ± 0.158
2.43AlaPhe: 2.43 ± 0.122
2.953AlaGly: 2.953 ± 0.198
1.073AlaHis: 1.073 ± 0.085
3.13AlaIle: 3.13 ± 0.146
3.067AlaLys: 3.067 ± 0.146
6.266AlaLeu: 6.266 ± 0.235
1.092AlaMet: 1.092 ± 0.072
2.65AlaAsn: 2.65 ± 0.158
1.742AlaPro: 1.742 ± 0.12
2.291AlaGln: 2.291 ± 0.17
3.881AlaArg: 3.881 ± 0.207
4.152AlaSer: 4.152 ± 0.21
3.174AlaThr: 3.174 ± 0.339
3.004AlaVal: 3.004 ± 0.137
0.663AlaTrp: 0.663 ± 0.073
2.613AlaTyr: 2.613 ± 0.119
0.0AlaXaa: 0.0 ± 0.0
Cys
1.313CysAla: 1.313 ± 0.115
0.473CysCys: 0.473 ± 0.052
1.149CysAsp: 1.149 ± 0.1
1.054CysGlu: 1.054 ± 0.094
1.13CysPhe: 1.13 ± 0.08
0.877CysGly: 0.877 ± 0.097
0.404CysHis: 0.404 ± 0.055
1.066CysIle: 1.066 ± 0.082
1.666CysLys: 1.666 ± 0.134
2.379CysLeu: 2.379 ± 0.131
0.574CysMet: 0.574 ± 0.058
0.89CysAsn: 0.89 ± 0.085
2.499CysPro: 2.499 ± 0.24
0.581CysGln: 0.581 ± 0.066
1.224CysArg: 1.224 ± 0.104
3.048CysSer: 3.048 ± 0.191
1.212CysThr: 1.212 ± 0.111
1.414CysVal: 1.414 ± 0.102
0.189CysTrp: 0.189 ± 0.045
1.066CysTyr: 1.066 ± 0.091
0.0CysXaa: 0.0 ± 0.0
Asp
2.398AspAla: 2.398 ± 0.121
1.174AspCys: 1.174 ± 0.103
2.783AspAsp: 2.783 ± 0.154
4.285AspGlu: 4.285 ± 0.178
2.821AspPhe: 2.821 ± 0.129
2.575AspGly: 2.575 ± 0.171
0.846AspHis: 0.846 ± 0.083
3.603AspIle: 3.603 ± 0.147
3.988AspLys: 3.988 ± 0.175
6.222AspLeu: 6.222 ± 0.221
1.249AspMet: 1.249 ± 0.091
2.316AspAsn: 2.316 ± 0.133
1.571AspPro: 1.571 ± 0.102
1.256AspGln: 1.256 ± 0.084
2.455AspArg: 2.455 ± 0.115
2.524AspSer: 2.524 ± 0.107
2.24AspThr: 2.24 ± 0.137
3.446AspVal: 3.446 ± 0.148
0.846AspTrp: 0.846 ± 0.082
2.979AspTyr: 2.979 ± 0.144
0.0AspXaa: 0.0 ± 0.0
Glu
3.616GluAla: 3.616 ± 0.17
0.978GluCys: 0.978 ± 0.086
4.525GluAsp: 4.525 ± 0.178
9.087GluGlu: 9.087 ± 0.32
3.035GluPhe: 3.035 ± 0.129
4.48GluGly: 4.48 ± 0.168
1.451GluHis: 1.451 ± 0.094
4.758GluIle: 4.758 ± 0.182
6.424GluLys: 6.424 ± 0.314
5.888GluLeu: 5.888 ± 0.212
1.811GluMet: 1.811 ± 0.104
3.881GluAsn: 3.881 ± 0.176
1.729GluPro: 1.729 ± 0.12
2.865GluGln: 2.865 ± 0.153
5.086GluArg: 5.086 ± 0.209
3.458GluSer: 3.458 ± 0.16
3.635GluThr: 3.635 ± 0.165
5.225GluVal: 5.225 ± 0.228
1.161GluTrp: 1.161 ± 0.102
2.865GluTyr: 2.865 ± 0.154
0.0GluXaa: 0.0 ± 0.0
Phe
3.162PheAla: 3.162 ± 0.151
1.092PheCys: 1.092 ± 0.099
1.899PheAsp: 1.899 ± 0.128
1.937PheGlu: 1.937 ± 0.128
2.486PhePhe: 2.486 ± 0.152
1.994PheGly: 1.994 ± 0.112
0.675PheHis: 0.675 ± 0.053
2.745PheIle: 2.745 ± 0.126
1.149PheLys: 1.149 ± 0.084
4.834PheLeu: 4.834 ± 0.211
0.966PheMet: 0.966 ± 0.084
1.862PheAsn: 1.862 ± 0.121
2.221PhePro: 2.221 ± 0.109
0.984PheGln: 0.984 ± 0.075
2.505PheArg: 2.505 ± 0.122
4.758PheSer: 4.758 ± 0.193
3.187PheThr: 3.187 ± 0.153
3.168PheVal: 3.168 ± 0.146
0.871PheTrp: 0.871 ± 0.071
2.707PheTyr: 2.707 ± 0.143
0.0PheXaa: 0.0 ± 0.0
Gly
4.146GlyAla: 4.146 ± 0.623
2.537GlyCys: 2.537 ± 0.247
2.852GlyAsp: 2.852 ± 0.133
4.663GlyGlu: 4.663 ± 0.275
2.379GlyPhe: 2.379 ± 0.12
3.496GlyGly: 3.496 ± 0.199
2.057GlyHis: 2.057 ± 0.182
2.953GlyIle: 2.953 ± 0.15
3.799GlyLys: 3.799 ± 0.243
4.6GlyLeu: 4.6 ± 0.198
1.022GlyMet: 1.022 ± 0.09
3.849GlyAsn: 3.849 ± 0.329
1.679GlyPro: 1.679 ± 0.134
1.975GlyGln: 1.975 ± 0.131
2.745GlyArg: 2.745 ± 0.133
3.742GlySer: 3.742 ± 0.173
2.562GlyThr: 2.562 ± 0.142
5.036GlyVal: 5.036 ± 0.508
0.625GlyTrp: 0.625 ± 0.068
3.364GlyTyr: 3.364 ± 0.151
0.0GlyXaa: 0.0 ± 0.0
His
1.01HisAla: 1.01 ± 0.091
0.549HisCys: 0.549 ± 0.065
1.073HisAsp: 1.073 ± 0.08
1.275HisGlu: 1.275 ± 0.089
0.587HisPhe: 0.587 ± 0.059
1.117HisGly: 1.117 ± 0.09
0.543HisHis: 0.543 ± 0.098
1.243HisIle: 1.243 ± 0.09
1.079HisLys: 1.079 ± 0.103
3.042HisLeu: 3.042 ± 0.19
0.442HisMet: 0.442 ± 0.053
0.808HisAsn: 0.808 ± 0.076
0.959HisPro: 0.959 ± 0.084
0.549HisGln: 0.549 ± 0.072
1.022HisArg: 1.022 ± 0.088
1.111HisSer: 1.111 ± 0.08
0.77HisThr: 0.77 ± 0.07
1.483HisVal: 1.483 ± 0.1
0.196HisTrp: 0.196 ± 0.04
0.827HisTyr: 0.827 ± 0.077
0.0HisXaa: 0.0 ± 0.0
Ile
3.124IleAla: 3.124 ± 0.13
1.281IleCys: 1.281 ± 0.085
2.972IleAsp: 2.972 ± 0.139
3.502IleGlu: 3.502 ± 0.174
3.016IlePhe: 3.016 ± 0.13
2.537IleGly: 2.537 ± 0.155
1.161IleHis: 1.161 ± 0.091
3.332IleIle: 3.332 ± 0.177
3.748IleLys: 3.748 ± 0.169
6.595IleLeu: 6.595 ± 0.214
1.079IleMet: 1.079 ± 0.086
2.436IleAsn: 2.436 ± 0.128
2.638IlePro: 2.638 ± 0.121
1.445IleGln: 1.445 ± 0.093
2.953IleArg: 2.953 ± 0.133
4.19IleSer: 4.19 ± 0.18
2.72IleThr: 2.72 ± 0.162
3.376IleVal: 3.376 ± 0.159
0.505IleTrp: 0.505 ± 0.055
2.657IleTyr: 2.657 ± 0.119
0.0IleXaa: 0.0 ± 0.0
Lys
3.042LysAla: 3.042 ± 0.154
0.713LysCys: 0.713 ± 0.079
4.108LysAsp: 4.108 ± 0.242
5.97LysGlu: 5.97 ± 0.188
2.089LysPhe: 2.089 ± 0.12
3.976LysGly: 3.976 ± 0.194
1.546LysHis: 1.546 ± 0.105
4.077LysIle: 4.077 ± 0.16
5.067LysLys: 5.067 ± 0.306
5.351LysLeu: 5.351 ± 0.24
1.104LysMet: 1.104 ± 0.091
2.846LysAsn: 2.846 ± 0.135
2.24LysPro: 2.24 ± 0.118
1.988LysGln: 1.988 ± 0.131
3.477LysArg: 3.477 ± 0.165
3.3LysSer: 3.3 ± 0.146
2.808LysThr: 2.808 ± 0.153
4.487LysVal: 4.487 ± 0.201
1.142LysTrp: 1.142 ± 0.132
2.265LysTyr: 2.265 ± 0.133
0.0LysXaa: 0.0 ± 0.0
Leu
5.982LeuAla: 5.982 ± 0.205
2.846LeuCys: 2.846 ± 0.156
5.856LeuAsp: 5.856 ± 0.196
9.258LeuGlu: 9.258 ± 0.294
4.746LeuPhe: 4.746 ± 0.189
5.276LeuGly: 5.276 ± 0.187
2.152LeuHis: 2.152 ± 0.124
4.581LeuIle: 4.581 ± 0.164
5.244LeuLys: 5.244 ± 0.217
11.687LeuLeu: 11.687 ± 0.34
1.609LeuMet: 1.609 ± 0.085
3.698LeuAsn: 3.698 ± 0.167
5.692LeuPro: 5.692 ± 0.21
5.698LeuGln: 5.698 ± 0.238
5.175LeuArg: 5.175 ± 0.181
9.314LeuSer: 9.314 ± 0.287
4.998LeuThr: 4.998 ± 0.188
6.796LeuVal: 6.796 ± 0.224
1.123LeuTrp: 1.123 ± 0.094
5.074LeuTyr: 5.074 ± 0.222
0.0LeuXaa: 0.0 ± 0.0
Met
1.18MetAla: 1.18 ± 0.084
0.416MetCys: 0.416 ± 0.047
1.395MetAsp: 1.395 ± 0.101
1.893MetGlu: 1.893 ± 0.123
0.852MetPhe: 0.852 ± 0.076
0.972MetGly: 0.972 ± 0.073
0.568MetHis: 0.568 ± 0.054
0.947MetIle: 0.947 ± 0.078
1.066MetLys: 1.066 ± 0.083
1.982MetLeu: 1.982 ± 0.109
0.385MetMet: 0.385 ± 0.054
0.783MetAsn: 0.783 ± 0.074
0.492MetPro: 0.492 ± 0.058
1.325MetGln: 1.325 ± 0.088
0.959MetArg: 0.959 ± 0.079
1.622MetSer: 1.622 ± 0.129
0.612MetThr: 0.612 ± 0.063
1.3MetVal: 1.3 ± 0.092
0.259MetTrp: 0.259 ± 0.036
0.732MetTyr: 0.732 ± 0.063
0.0MetXaa: 0.0 ± 0.0
Asn
2.013AsnAla: 2.013 ± 0.112
0.618AsnCys: 0.618 ± 0.078
1.237AsnAsp: 1.237 ± 0.074
1.969AsnGlu: 1.969 ± 0.134
2.448AsnPhe: 2.448 ± 0.127
3.521AsnGly: 3.521 ± 0.308
0.808AsnHis: 0.808 ± 0.062
3.155AsnIle: 3.155 ± 0.126
3.263AsnLys: 3.263 ± 0.154
5.9AsnLeu: 5.9 ± 0.188
1.13AsnMet: 1.13 ± 0.09
2.019AsnAsn: 2.019 ± 0.119
2.114AsnPro: 2.114 ± 0.134
1.634AsnGln: 1.634 ± 0.115
2.202AsnArg: 2.202 ± 0.126
2.657AsnSer: 2.657 ± 0.14
2.871AsnThr: 2.871 ± 0.316
2.644AsnVal: 2.644 ± 0.139
0.454AsnTrp: 0.454 ± 0.05
2.158AsnTyr: 2.158 ± 0.11
0.0AsnXaa: 0.0 ± 0.0
Pro
1.483ProAla: 1.483 ± 0.1
1.066ProCys: 1.066 ± 0.101
2.032ProAsp: 2.032 ± 0.117
4.058ProGlu: 4.058 ± 0.154
1.912ProPhe: 1.912 ± 0.133
2.423ProGly: 2.423 ± 0.154
0.713ProHis: 0.713 ± 0.072
1.603ProIle: 1.603 ± 0.1
2.12ProLys: 2.12 ± 0.116
4.947ProLeu: 4.947 ± 0.191
0.511ProMet: 0.511 ± 0.059
1.615ProAsn: 1.615 ± 0.119
2.082ProPro: 2.082 ± 0.177
1.799ProGln: 1.799 ± 0.15
2.12ProArg: 2.12 ± 0.163
3.49ProSer: 3.49 ± 0.152
1.679ProThr: 1.679 ± 0.118
2.934ProVal: 2.934 ± 0.157
1.559ProTrp: 1.559 ± 0.188
1.729ProTyr: 1.729 ± 0.115
0.0ProXaa: 0.0 ± 0.0
Gln
2.297GlnAla: 2.297 ± 0.125
0.536GlnCys: 0.536 ± 0.063
2.057GlnAsp: 2.057 ± 0.114
3.168GlnGlu: 3.168 ± 0.163
0.719GlnPhe: 0.719 ± 0.053
3.988GlnGly: 3.988 ± 0.283
0.536GlnHis: 0.536 ± 0.062
1.754GlnIle: 1.754 ± 0.088
1.641GlnLys: 1.641 ± 0.113
2.455GlnLeu: 2.455 ± 0.116
0.656GlnMet: 0.656 ± 0.081
1.445GlnAsn: 1.445 ± 0.144
1.281GlnPro: 1.281 ± 0.101
0.883GlnGln: 0.883 ± 0.113
2.146GlnArg: 2.146 ± 0.123
1.761GlnSer: 1.761 ± 0.106
2.133GlnThr: 2.133 ± 0.151
3.092GlnVal: 3.092 ± 0.166
1.66GlnTrp: 1.66 ± 0.215
0.776GlnTyr: 0.776 ± 0.072
0.0GlnXaa: 0.0 ± 0.0
Arg
3.067ArgAla: 3.067 ± 0.183
1.066ArgCys: 1.066 ± 0.108
3.225ArgAsp: 3.225 ± 0.148
5.768ArgGlu: 5.768 ± 0.223
2.234ArgPhe: 2.234 ± 0.115
3.376ArgGly: 3.376 ± 0.204
0.738ArgHis: 0.738 ± 0.068
3.124ArgIle: 3.124 ± 0.179
3.919ArgLys: 3.919 ± 0.176
4.581ArgLeu: 4.581 ± 0.202
1.199ArgMet: 1.199 ± 0.099
2.644ArgAsn: 2.644 ± 0.131
1.735ArgPro: 1.735 ± 0.112
1.912ArgGln: 1.912 ± 0.105
3.187ArgArg: 3.187 ± 0.177
3.376ArgSer: 3.376 ± 0.145
2.259ArgThr: 2.259 ± 0.141
3.326ArgVal: 3.326 ± 0.145
0.606ArgTrp: 0.606 ± 0.064
2.139ArgTyr: 2.139 ± 0.134
0.0ArgXaa: 0.0 ± 0.0
Ser
3.704SerAla: 3.704 ± 0.154
1.685SerCys: 1.685 ± 0.141
3.092SerAsp: 3.092 ± 0.156
4.039SerGlu: 4.039 ± 0.168
4.266SerPhe: 4.266 ± 0.167
4.38SerGly: 4.38 ± 0.163
1.155SerHis: 1.155 ± 0.084
3.717SerIle: 3.717 ± 0.188
3.995SerLys: 3.995 ± 0.177
9.276SerLeu: 9.276 ± 0.311
1.401SerMet: 1.401 ± 0.091
2.96SerAsn: 2.96 ± 0.129
3.54SerPro: 3.54 ± 0.243
2.24SerGln: 2.24 ± 0.127
3.446SerArg: 3.446 ± 0.18
7.977SerSer: 7.977 ± 0.646
3.654SerThr: 3.654 ± 0.151
4.297SerVal: 4.297 ± 0.169
1.161SerTrp: 1.161 ± 0.085
3.477SerTyr: 3.477 ± 0.167
0.0SerXaa: 0.0 ± 0.0
Thr
1.937ThrAla: 1.937 ± 0.119
2.082ThrCys: 2.082 ± 0.155
2.026ThrAsp: 2.026 ± 0.118
2.859ThrGlu: 2.859 ± 0.126
2.442ThrPhe: 2.442 ± 0.145
6.778ThrGly: 6.778 ± 1.309
0.688ThrHis: 0.688 ± 0.079
2.808ThrIle: 2.808 ± 0.144
2.657ThrLys: 2.657 ± 0.134
5.364ThrLeu: 5.364 ± 0.182
0.814ThrMet: 0.814 ± 0.079
1.937ThrAsn: 1.937 ± 0.121
2.411ThrPro: 2.411 ± 0.119
1.369ThrGln: 1.369 ± 0.107
2.562ThrArg: 2.562 ± 0.146
3.647ThrSer: 3.647 ± 0.162
2.341ThrThr: 2.341 ± 0.178
2.518ThrVal: 2.518 ± 0.148
0.827ThrTrp: 0.827 ± 0.059
2.089ThrTyr: 2.089 ± 0.11
0.0ThrXaa: 0.0 ± 0.0
Val
3.389ValAla: 3.389 ± 0.17
2.575ValCys: 2.575 ± 0.196
3.376ValAsp: 3.376 ± 0.151
4.663ValGlu: 4.663 ± 0.183
2.531ValPhe: 2.531 ± 0.13
2.745ValGly: 2.745 ± 0.165
1.306ValHis: 1.306 ± 0.094
3.187ValIle: 3.187 ± 0.157
3.938ValLys: 3.938 ± 0.191
7.705ValLeu: 7.705 ± 0.214
1.268ValMet: 1.268 ± 0.096
2.732ValAsn: 2.732 ± 0.158
2.827ValPro: 2.827 ± 0.149
2.259ValGln: 2.259 ± 0.124
3.275ValArg: 3.275 ± 0.167
4.55ValSer: 4.55 ± 0.176
4.241ValThr: 4.241 ± 0.488
3.685ValVal: 3.685 ± 0.191
0.751ValTrp: 0.751 ± 0.061
3.376ValTyr: 3.376 ± 0.125
0.0ValXaa: 0.0 ± 0.0
Trp
1.811TrpAla: 1.811 ± 0.189
0.435TrpCys: 0.435 ± 0.07
1.338TrpAsp: 1.338 ± 0.139
0.688TrpGlu: 0.688 ± 0.064
0.6TrpPhe: 0.6 ± 0.062
0.511TrpGly: 0.511 ± 0.055
0.227TrpHis: 0.227 ± 0.039
0.745TrpIle: 0.745 ± 0.063
0.972TrpLys: 0.972 ± 0.078
2.108TrpLeu: 2.108 ± 0.174
0.391TrpMet: 0.391 ± 0.05
1.142TrpAsn: 1.142 ± 0.133
0.233TrpPro: 0.233 ± 0.044
0.334TrpGln: 0.334 ± 0.044
0.637TrpArg: 0.637 ± 0.065
0.947TrpSer: 0.947 ± 0.086
0.644TrpThr: 0.644 ± 0.064
0.726TrpVal: 0.726 ± 0.069
0.107TrpTrp: 0.107 ± 0.028
0.486TrpTyr: 0.486 ± 0.06
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.436TyrAla: 2.436 ± 0.137
0.738TyrCys: 0.738 ± 0.079
2.158TyrAsp: 2.158 ± 0.118
2.663TyrGlu: 2.663 ± 0.155
2.48TyrPhe: 2.48 ± 0.131
2.108TyrGly: 2.108 ± 0.106
1.035TyrHis: 1.035 ± 0.072
2.739TyrIle: 2.739 ± 0.136
2.796TyrLys: 2.796 ± 0.148
5.629TyrLeu: 5.629 ± 0.225
1.029TyrMet: 1.029 ± 0.082
2.303TyrAsn: 2.303 ± 0.135
2.19TyrPro: 2.19 ± 0.13
1.597TyrGln: 1.597 ± 0.097
2.423TyrArg: 2.423 ± 0.116
3.837TyrSer: 3.837 ± 0.176
2.417TyrThr: 2.417 ± 0.155
2.423TyrVal: 2.423 ± 0.124
0.379TyrTrp: 0.379 ± 0.045
2.613TyrTyr: 2.613 ± 0.163
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 643 proteins (158466 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski