Amino acid dipepetide frequency for Paramecium bursaria Chlorella virus AR158 (PBCV-AR158)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.278AlaAla: 3.278 ± 0.232
0.922AlaCys: 0.922 ± 0.1
2.26AlaAsp: 2.26 ± 0.142
2.431AlaGlu: 2.431 ± 0.209
2.567AlaPhe: 2.567 ± 0.142
3.018AlaGly: 3.018 ± 0.315
1.024AlaHis: 1.024 ± 0.102
3.947AlaIle: 3.947 ± 0.191
3.694AlaLys: 3.694 ± 0.284
4.008AlaLeu: 4.008 ± 0.223
1.53AlaMet: 1.53 ± 0.138
3.107AlaAsn: 3.107 ± 0.276
3.018AlaPro: 3.018 ± 0.408
1.338AlaGln: 1.338 ± 0.092
2.963AlaArg: 2.963 ± 0.192
4.186AlaSer: 4.186 ± 0.213
3.216AlaThr: 3.216 ± 0.273
3.305AlaVal: 3.305 ± 0.203
0.423AlaTrp: 0.423 ± 0.047
1.707AlaTyr: 1.707 ± 0.129
0.0AlaXaa: 0.0 ± 0.0
Cys
0.854CysAla: 0.854 ± 0.085
0.512CysCys: 0.512 ± 0.063
1.263CysAsp: 1.263 ± 0.106
0.772CysGlu: 0.772 ± 0.074
1.338CysPhe: 1.338 ± 0.092
1.407CysGly: 1.407 ± 0.141
0.615CysHis: 0.615 ± 0.067
1.673CysIle: 1.673 ± 0.117
1.065CysLys: 1.065 ± 0.152
1.652CysLeu: 1.652 ± 0.12
0.526CysMet: 0.526 ± 0.069
0.826CysAsn: 0.826 ± 0.105
1.017CysPro: 1.017 ± 0.109
0.492CysGln: 0.492 ± 0.081
1.291CysArg: 1.291 ± 0.109
2.021CysSer: 2.021 ± 0.202
0.901CysThr: 0.901 ± 0.102
1.823CysVal: 1.823 ± 0.132
0.198CysTrp: 0.198 ± 0.046
0.526CysTyr: 0.526 ± 0.066
0.0CysXaa: 0.0 ± 0.0
Asp
3.257AspAla: 3.257 ± 0.185
0.86AspCys: 0.86 ± 0.076
4.377AspAsp: 4.377 ± 0.272
3.865AspGlu: 3.865 ± 0.208
3.011AspPhe: 3.011 ± 0.145
3.53AspGly: 3.53 ± 0.205
1.099AspHis: 1.099 ± 0.084
4.896AspIle: 4.896 ± 0.199
3.161AspLys: 3.161 ± 0.143
3.653AspLeu: 3.653 ± 0.186
1.57AspMet: 1.57 ± 0.119
2.479AspAsn: 2.479 ± 0.155
2.274AspPro: 2.274 ± 0.154
1.038AspGln: 1.038 ± 0.095
2.363AspArg: 2.363 ± 0.125
3.127AspSer: 3.127 ± 0.136
3.387AspThr: 3.387 ± 0.158
4.493AspVal: 4.493 ± 0.194
0.512AspTrp: 0.512 ± 0.067
1.857AspTyr: 1.857 ± 0.133
0.0AspXaa: 0.0 ± 0.0
Glu
2.349GluAla: 2.349 ± 0.215
1.045GluCys: 1.045 ± 0.092
2.567GluAsp: 2.567 ± 0.166
3.011GluGlu: 3.011 ± 0.206
2.718GluPhe: 2.718 ± 0.143
1.509GluGly: 1.509 ± 0.101
1.632GluHis: 1.632 ± 0.106
4.049GluIle: 4.049 ± 0.198
3.953GluLys: 3.953 ± 0.231
4.008GluLeu: 4.008 ± 0.193
1.359GluMet: 1.359 ± 0.102
3.202GluAsn: 3.202 ± 0.153
1.755GluPro: 1.755 ± 0.13
1.523GluGln: 1.523 ± 0.11
3.066GluArg: 3.066 ± 0.151
3.202GluSer: 3.202 ± 0.157
3.23GluThr: 3.23 ± 0.162
2.431GluVal: 2.431 ± 0.225
0.676GluTrp: 0.676 ± 0.085
2.69GluTyr: 2.69 ± 0.161
0.0GluXaa: 0.0 ± 0.0
Phe
3.1PheAla: 3.1 ± 0.171
1.181PheCys: 1.181 ± 0.104
3.51PheAsp: 3.51 ± 0.168
3.161PheGlu: 3.161 ± 0.156
3.578PhePhe: 3.578 ± 0.222
3.619PheGly: 3.619 ± 0.209
1.263PheHis: 1.263 ± 0.097
3.981PheIle: 3.981 ± 0.204
2.663PheLys: 2.663 ± 0.15
4.527PheLeu: 4.527 ± 0.226
1.372PheMet: 1.372 ± 0.102
1.891PheAsn: 1.891 ± 0.137
2.82PhePro: 2.82 ± 0.312
1.386PheGln: 1.386 ± 0.102
3.025PheArg: 3.025 ± 0.22
4.677PheSer: 4.677 ± 0.205
2.998PheThr: 2.998 ± 0.166
5.073PheVal: 5.073 ± 0.221
0.601PheTrp: 0.601 ± 0.079
1.728PheTyr: 1.728 ± 0.105
0.007PheXaa: 0.007 ± 0.007
Gly
2.963GlyAla: 2.963 ± 0.259
1.277GlyCys: 1.277 ± 0.179
2.595GlyAsp: 2.595 ± 0.124
2.315GlyGlu: 2.315 ± 0.152
3.264GlyPhe: 3.264 ± 0.222
3.51GlyGly: 3.51 ± 0.255
1.133GlyHis: 1.133 ± 0.109
3.94GlyIle: 3.94 ± 0.226
3.913GlyLys: 3.913 ± 0.262
3.469GlyLeu: 3.469 ± 0.21
1.058GlyMet: 1.058 ± 0.088
4.329GlyAsn: 4.329 ± 0.837
1.413GlyPro: 1.413 ± 0.131
1.577GlyGln: 1.577 ± 0.157
2.806GlyArg: 2.806 ± 0.173
4.213GlySer: 4.213 ± 0.279
3.23GlyThr: 3.23 ± 0.221
3.749GlyVal: 3.749 ± 0.3
0.58GlyTrp: 0.58 ± 0.079
2.274GlyTyr: 2.274 ± 0.196
0.0GlyXaa: 0.0 ± 0.0
His
1.147HisAla: 1.147 ± 0.097
0.417HisCys: 0.417 ± 0.062
1.577HisAsp: 1.577 ± 0.107
1.4HisGlu: 1.4 ± 0.116
1.284HisPhe: 1.284 ± 0.155
1.222HisGly: 1.222 ± 0.095
1.079HisHis: 1.079 ± 0.105
2.048HisIle: 2.048 ± 0.124
1.413HisLys: 1.413 ± 0.092
2.103HisLeu: 2.103 ± 0.122
0.574HisMet: 0.574 ± 0.062
0.963HisAsn: 0.963 ± 0.091
0.997HisPro: 0.997 ± 0.092
0.587HisGln: 0.587 ± 0.059
1.871HisArg: 1.871 ± 0.122
1.407HisSer: 1.407 ± 0.096
1.379HisThr: 1.379 ± 0.136
2.021HisVal: 2.021 ± 0.125
0.348HisTrp: 0.348 ± 0.053
0.744HisTyr: 0.744 ± 0.074
0.0HisXaa: 0.0 ± 0.0
Ile
4.206IleAla: 4.206 ± 0.193
1.618IleCys: 1.618 ± 0.12
4.705IleAsp: 4.705 ± 0.176
3.919IleGlu: 3.919 ± 0.175
4.561IlePhe: 4.561 ± 0.238
4.288IleGly: 4.288 ± 0.302
1.967IleHis: 1.967 ± 0.124
5.934IleIle: 5.934 ± 0.251
3.899IleLys: 3.899 ± 0.193
6.692IleLeu: 6.692 ± 0.202
2.021IleMet: 2.021 ± 0.108
3.353IleAsn: 3.353 ± 0.19
3.585IlePro: 3.585 ± 0.151
2.062IleGln: 2.062 ± 0.115
4.568IleArg: 4.568 ± 0.183
6.651IleSer: 6.651 ± 0.261
4.37IleThr: 4.37 ± 0.199
5.579IleVal: 5.579 ± 0.248
0.676IleTrp: 0.676 ± 0.08
2.513IleTyr: 2.513 ± 0.117
0.007IleXaa: 0.007 ± 0.007
Lys
2.752LysAla: 2.752 ± 0.242
1.468LysCys: 1.468 ± 0.181
3.202LysAsp: 3.202 ± 0.17
3.482LysGlu: 3.482 ± 0.216
3.093LysPhe: 3.093 ± 0.152
2.472LysGly: 2.472 ± 0.172
1.878LysHis: 1.878 ± 0.118
5.353LysIle: 5.353 ± 0.244
6.507LysLys: 6.507 ± 0.348
5.155LysLeu: 5.155 ± 0.197
2.301LysMet: 2.301 ± 0.154
4.991LysAsn: 4.991 ± 0.22
3.824LysPro: 3.824 ± 0.569
2.124LysGln: 2.124 ± 0.183
3.455LysArg: 3.455 ± 0.188
4.507LysSer: 4.507 ± 0.207
4.513LysThr: 4.513 ± 0.221
2.95LysVal: 2.95 ± 0.227
0.683LysTrp: 0.683 ± 0.082
3.161LysTyr: 3.161 ± 0.143
0.007LysXaa: 0.007 ± 0.007
Leu
3.824LeuAla: 3.824 ± 0.205
1.57LeuCys: 1.57 ± 0.107
4.049LeuAsp: 4.049 ± 0.176
3.605LeuGlu: 3.605 ± 0.164
4.377LeuPhe: 4.377 ± 0.233
4.001LeuGly: 4.001 ± 0.352
1.878LeuHis: 1.878 ± 0.144
5.005LeuIle: 5.005 ± 0.211
4.923LeuLys: 4.923 ± 0.236
6.412LeuLeu: 6.412 ± 0.277
2.274LeuMet: 2.274 ± 0.137
3.79LeuAsn: 3.79 ± 0.19
3.844LeuPro: 3.844 ± 0.21
2.185LeuGln: 2.185 ± 0.133
4.609LeuArg: 4.609 ± 0.194
6.377LeuSer: 6.377 ± 0.248
4.24LeuThr: 4.24 ± 0.199
4.944LeuVal: 4.944 ± 0.256
0.799LeuTrp: 0.799 ± 0.076
3.025LeuTyr: 3.025 ± 0.168
0.0LeuXaa: 0.0 ± 0.0
Met
1.174MetAla: 1.174 ± 0.102
0.635MetCys: 0.635 ± 0.082
1.004MetAsp: 1.004 ± 0.077
1.065MetGlu: 1.065 ± 0.091
2.096MetPhe: 2.096 ± 0.225
1.065MetGly: 1.065 ± 0.093
0.498MetHis: 0.498 ± 0.063
2.431MetIle: 2.431 ± 0.144
2.444MetLys: 2.444 ± 0.148
2.165MetLeu: 2.165 ± 0.123
1.099MetMet: 1.099 ± 0.108
2.096MetAsn: 2.096 ± 0.129
1.079MetPro: 1.079 ± 0.12
0.519MetGln: 0.519 ± 0.071
1.468MetArg: 1.468 ± 0.098
2.765MetSer: 2.765 ± 0.164
2.383MetThr: 2.383 ± 0.144
1.256MetVal: 1.256 ± 0.107
0.362MetTrp: 0.362 ± 0.059
1.202MetTyr: 1.202 ± 0.098
0.0MetXaa: 0.0 ± 0.0
Asn
3.107AsnAla: 3.107 ± 0.195
0.792AsnCys: 0.792 ± 0.09
3.175AsnAsp: 3.175 ± 0.18
2.485AsnGlu: 2.485 ± 0.146
2.567AsnPhe: 2.567 ± 0.141
3.489AsnGly: 3.489 ± 0.31
1.188AsnHis: 1.188 ± 0.098
5.428AsnIle: 5.428 ± 0.4
3.407AsnLys: 3.407 ± 0.202
3.626AsnLeu: 3.626 ± 0.147
1.652AsnMet: 1.652 ± 0.122
2.977AsnAsn: 2.977 ± 0.21
2.349AsnPro: 2.349 ± 0.198
1.106AsnGln: 1.106 ± 0.094
2.574AsnArg: 2.574 ± 0.15
3.346AsnSer: 3.346 ± 0.155
3.994AsnThr: 3.994 ± 0.296
5.333AsnVal: 5.333 ± 0.57
0.492AsnTrp: 0.492 ± 0.068
1.611AsnTyr: 1.611 ± 0.123
0.007AsnXaa: 0.007 ± 0.007
Pro
3.066ProAla: 3.066 ± 0.4
0.867ProCys: 0.867 ± 0.084
2.526ProAsp: 2.526 ± 0.177
2.765ProGlu: 2.765 ± 0.2
2.192ProPhe: 2.192 ± 0.173
2.533ProGly: 2.533 ± 0.179
0.826ProHis: 0.826 ± 0.076
2.861ProIle: 2.861 ± 0.132
4.008ProLys: 4.008 ± 0.525
3.066ProLeu: 3.066 ± 0.171
1.393ProMet: 1.393 ± 0.208
2.158ProAsn: 2.158 ± 0.154
2.117ProPro: 2.117 ± 0.159
1.168ProGln: 1.168 ± 0.129
2.322ProArg: 2.322 ± 0.15
4.111ProSer: 4.111 ± 0.3
2.718ProThr: 2.718 ± 0.176
2.936ProVal: 2.936 ± 0.22
0.362ProTrp: 0.362 ± 0.059
1.27ProTyr: 1.27 ± 0.096
0.0ProXaa: 0.0 ± 0.0
Gln
1.086GlnAla: 1.086 ± 0.092
0.642GlnCys: 0.642 ± 0.109
1.222GlnAsp: 1.222 ± 0.106
0.935GlnGlu: 0.935 ± 0.096
1.297GlnPhe: 1.297 ± 0.113
1.031GlnGly: 1.031 ± 0.115
0.799GlnHis: 0.799 ± 0.076
1.912GlnIle: 1.912 ± 0.123
2.322GlnLys: 2.322 ± 0.182
2.089GlnLeu: 2.089 ± 0.137
0.874GlnMet: 0.874 ± 0.1
1.509GlnAsn: 1.509 ± 0.103
1.045GlnPro: 1.045 ± 0.122
1.195GlnGln: 1.195 ± 0.15
1.823GlnArg: 1.823 ± 0.154
1.878GlnSer: 1.878 ± 0.125
1.707GlnThr: 1.707 ± 0.106
1.461GlnVal: 1.461 ± 0.13
0.369GlnTrp: 0.369 ± 0.046
1.243GlnTyr: 1.243 ± 0.099
0.0GlnXaa: 0.0 ± 0.0
Arg
2.267ArgAla: 2.267 ± 0.138
1.482ArgCys: 1.482 ± 0.141
3.066ArgAsp: 3.066 ± 0.144
2.786ArgGlu: 2.786 ± 0.141
2.745ArgPhe: 2.745 ± 0.181
2.608ArgGly: 2.608 ± 0.16
1.605ArgHis: 1.605 ± 0.132
4.295ArgIle: 4.295 ± 0.164
3.974ArgLys: 3.974 ± 0.185
3.81ArgLeu: 3.81 ± 0.199
1.837ArgMet: 1.837 ± 0.129
3.052ArgAsn: 3.052 ± 0.175
2.158ArgPro: 2.158 ± 0.119
1.666ArgGln: 1.666 ± 0.116
4.329ArgArg: 4.329 ± 0.303
4.527ArgSer: 4.527 ± 0.235
3.387ArgThr: 3.387 ± 0.154
3.892ArgVal: 3.892 ± 0.307
0.881ArgTrp: 0.881 ± 0.074
2.335ArgTyr: 2.335 ± 0.134
0.0ArgXaa: 0.0 ± 0.0
Ser
4.192SerAla: 4.192 ± 0.309
1.734SerCys: 1.734 ± 0.122
3.68SerAsp: 3.68 ± 0.185
3.557SerGlu: 3.557 ± 0.181
4.5SerPhe: 4.5 ± 0.221
5.012SerGly: 5.012 ± 0.316
1.885SerHis: 1.885 ± 0.167
5.811SerIle: 5.811 ± 0.233
4.944SerLys: 4.944 ± 0.262
5.763SerLeu: 5.763 ± 0.211
2.26SerMet: 2.26 ± 0.137
3.906SerAsn: 3.906 ± 0.244
3.592SerPro: 3.592 ± 0.249
2.048SerGln: 2.048 ± 0.162
4.964SerArg: 4.964 ± 0.247
7.395SerSer: 7.395 ± 0.35
4.868SerThr: 4.868 ± 0.284
5.749SerVal: 5.749 ± 0.223
0.799SerTrp: 0.799 ± 0.1
2.622SerTyr: 2.622 ± 0.132
0.007SerXaa: 0.007 ± 0.006
Thr
3.039ThrAla: 3.039 ± 0.275
1.27ThrCys: 1.27 ± 0.103
2.663ThrAsp: 2.663 ± 0.14
2.793ThrGlu: 2.793 ± 0.147
3.947ThrPhe: 3.947 ± 0.229
3.523ThrGly: 3.523 ± 0.308
1.543ThrHis: 1.543 ± 0.15
4.916ThrIle: 4.916 ± 0.245
4.165ThrLys: 4.165 ± 0.209
4.787ThrLeu: 4.787 ± 0.198
1.652ThrMet: 1.652 ± 0.118
3.592ThrAsn: 3.592 ± 0.22
3.462ThrPro: 3.462 ± 0.265
1.632ThrGln: 1.632 ± 0.125
3.243ThrArg: 3.243 ± 0.163
5.497ThrSer: 5.497 ± 0.242
3.988ThrThr: 3.988 ± 0.207
2.793ThrVal: 2.793 ± 0.163
0.737ThrTrp: 0.737 ± 0.074
1.96ThrTyr: 1.96 ± 0.126
0.0ThrXaa: 0.0 ± 0.0
Val
3.523ValAla: 3.523 ± 0.247
1.379ValCys: 1.379 ± 0.113
4.124ValAsp: 4.124 ± 0.172
3.264ValGlu: 3.264 ± 0.185
3.953ValPhe: 3.953 ± 0.202
3.339ValGly: 3.339 ± 0.3
1.577ValHis: 1.577 ± 0.098
4.944ValIle: 4.944 ± 0.223
4.008ValLys: 4.008 ± 0.187
5.422ValLeu: 5.422 ± 0.32
2.13ValMet: 2.13 ± 0.179
3.346ValAsn: 3.346 ± 0.221
3.243ValPro: 3.243 ± 0.238
1.673ValGln: 1.673 ± 0.123
3.557ValArg: 3.557 ± 0.235
5.995ValSer: 5.995 ± 0.355
3.687ValThr: 3.687 ± 0.22
4.609ValVal: 4.609 ± 0.208
0.594ValTrp: 0.594 ± 0.073
2.922ValTyr: 2.922 ± 0.153
0.0ValXaa: 0.0 ± 0.0
Trp
0.485TrpAla: 0.485 ± 0.063
0.417TrpCys: 0.417 ± 0.079
0.608TrpAsp: 0.608 ± 0.087
0.389TrpGlu: 0.389 ± 0.059
0.813TrpPhe: 0.813 ± 0.081
0.485TrpGly: 0.485 ± 0.053
0.164TrpHis: 0.164 ± 0.041
0.84TrpIle: 0.84 ± 0.086
1.004TrpLys: 1.004 ± 0.104
0.683TrpLeu: 0.683 ± 0.074
0.294TrpMet: 0.294 ± 0.042
1.004TrpAsn: 1.004 ± 0.106
0.273TrpPro: 0.273 ± 0.044
0.219TrpGln: 0.219 ± 0.04
0.41TrpArg: 0.41 ± 0.065
0.799TrpSer: 0.799 ± 0.079
0.574TrpThr: 0.574 ± 0.059
0.546TrpVal: 0.546 ± 0.068
0.219TrpTrp: 0.219 ± 0.042
0.389TrpTyr: 0.389 ± 0.05
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.274TyrAla: 2.274 ± 0.138
0.683TyrCys: 0.683 ± 0.09
2.636TyrAsp: 2.636 ± 0.151
1.973TyrGlu: 1.973 ± 0.133
2.26TyrPhe: 2.26 ± 0.135
1.946TyrGly: 1.946 ± 0.116
0.929TyrHis: 0.929 ± 0.08
2.902TyrIle: 2.902 ± 0.141
2.41TyrLys: 2.41 ± 0.148
2.636TyrLeu: 2.636 ± 0.152
1.017TyrMet: 1.017 ± 0.082
2.137TyrAsn: 2.137 ± 0.119
1.4TyrPro: 1.4 ± 0.108
0.901TyrGln: 0.901 ± 0.088
1.994TyrArg: 1.994 ± 0.118
2.608TyrSer: 2.608 ± 0.125
2.424TyrThr: 2.424 ± 0.122
2.349TyrVal: 2.349 ± 0.143
0.3TyrTrp: 0.3 ± 0.045
1.25TyrTyr: 1.25 ± 0.091
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.007XaaAsp: 0.007 ± 0.007
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.007XaaIle: 0.007 ± 0.006
0.0XaaLys: 0.0 ± 0.0
0.007XaaLeu: 0.007 ± 0.007
0.007XaaMet: 0.007 ± 0.007
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.007XaaThr: 0.007 ± 0.007
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 814 proteins (146454 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski