Amino acid dipepetide frequency for Paramecium bursaria Chlorella virus 1 (PBCV-1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.478AlaAla: 3.478 ± 0.22
0.869AlaCys: 0.869 ± 0.084
2.535AlaAsp: 2.535 ± 0.206
2.572AlaGlu: 2.572 ± 0.158
2.645AlaPhe: 2.645 ± 0.155
3.171AlaGly: 3.171 ± 0.329
0.891AlaHis: 0.891 ± 0.083
4.055AlaIle: 4.055 ± 0.162
3.77AlaLys: 3.77 ± 0.193
4.318AlaLeu: 4.318 ± 0.182
1.512AlaMet: 1.512 ± 0.121
3.668AlaAsn: 3.668 ± 0.541
3.354AlaPro: 3.354 ± 0.419
1.432AlaGln: 1.432 ± 0.128
2.411AlaArg: 2.411 ± 0.166
3.836AlaSer: 3.836 ± 0.212
3.485AlaThr: 3.485 ± 0.209
3.938AlaVal: 3.938 ± 0.21
0.526AlaTrp: 0.526 ± 0.058
1.607AlaTyr: 1.607 ± 0.112
0.0AlaXaa: 0.0 ± 0.0
Cys
0.877CysAla: 0.877 ± 0.099
0.416CysCys: 0.416 ± 0.067
1.067CysAsp: 1.067 ± 0.103
0.68CysGlu: 0.68 ± 0.087
1.191CysPhe: 1.191 ± 0.092
1.439CysGly: 1.439 ± 0.195
0.533CysHis: 0.533 ± 0.07
1.403CysIle: 1.403 ± 0.113
1.191CysLys: 1.191 ± 0.125
1.564CysLeu: 1.564 ± 0.137
0.482CysMet: 0.482 ± 0.052
0.928CysAsn: 0.928 ± 0.085
1.001CysPro: 1.001 ± 0.116
0.416CysGln: 0.416 ± 0.062
0.994CysArg: 0.994 ± 0.102
1.454CysSer: 1.454 ± 0.123
0.964CysThr: 0.964 ± 0.083
1.688CysVal: 1.688 ± 0.137
0.212CysTrp: 0.212 ± 0.043
0.57CysTyr: 0.57 ± 0.065
0.0CysXaa: 0.0 ± 0.0
Asp
3.032AspAla: 3.032 ± 0.161
0.701AspCys: 0.701 ± 0.072
3.558AspAsp: 3.558 ± 0.262
3.266AspGlu: 3.266 ± 0.181
2.703AspPhe: 2.703 ± 0.152
3.083AspGly: 3.083 ± 0.183
0.789AspHis: 0.789 ± 0.068
4.698AspIle: 4.698 ± 0.238
2.952AspLys: 2.952 ± 0.174
3.456AspLeu: 3.456 ± 0.189
1.425AspMet: 1.425 ± 0.106
2.762AspAsn: 2.762 ± 0.164
2.017AspPro: 2.017 ± 0.152
0.986AspGln: 0.986 ± 0.102
1.936AspArg: 1.936 ± 0.118
2.755AspSer: 2.755 ± 0.152
3.434AspThr: 3.434 ± 0.142
4.084AspVal: 4.084 ± 0.179
0.599AspTrp: 0.599 ± 0.076
2.06AspTyr: 2.06 ± 0.131
0.0AspXaa: 0.0 ± 0.0
Glu
2.177GluAla: 2.177 ± 0.12
0.943GluCys: 0.943 ± 0.097
2.725GluAsp: 2.725 ± 0.165
3.149GluGlu: 3.149 ± 0.24
2.47GluPhe: 2.47 ± 0.153
1.856GluGly: 1.856 ± 0.131
1.359GluHis: 1.359 ± 0.12
4.106GluIle: 4.106 ± 0.199
4.179GluLys: 4.179 ± 0.262
3.916GluLeu: 3.916 ± 0.185
1.381GluMet: 1.381 ± 0.104
3.142GluAsn: 3.142 ± 0.164
1.907GluPro: 1.907 ± 0.156
1.593GluGln: 1.593 ± 0.115
2.762GluArg: 2.762 ± 0.176
2.952GluSer: 2.952 ± 0.164
3.595GluThr: 3.595 ± 0.184
2.718GluVal: 2.718 ± 0.186
0.541GluTrp: 0.541 ± 0.068
2.499GluTyr: 2.499 ± 0.18
0.0GluXaa: 0.0 ± 0.0
Phe
3.441PheAla: 3.441 ± 0.223
0.928PheCys: 0.928 ± 0.097
2.893PheAsp: 2.893 ± 0.167
2.828PheGlu: 2.828 ± 0.175
3.485PhePhe: 3.485 ± 0.224
3.551PheGly: 3.551 ± 0.244
1.315PheHis: 1.315 ± 0.102
3.785PheIle: 3.785 ± 0.201
2.923PheLys: 2.923 ± 0.137
4.369PheLeu: 4.369 ± 0.226
1.308PheMet: 1.308 ± 0.112
2.565PheAsn: 2.565 ± 0.173
2.718PhePro: 2.718 ± 0.217
1.235PheGln: 1.235 ± 0.098
2.725PheArg: 2.725 ± 0.194
4.296PheSer: 4.296 ± 0.226
3.573PheThr: 3.573 ± 0.202
4.691PheVal: 4.691 ± 0.194
0.621PheTrp: 0.621 ± 0.068
1.724PheTyr: 1.724 ± 0.115
0.0PheXaa: 0.0 ± 0.0
Gly
2.988GlyAla: 2.988 ± 0.238
1.111GlyCys: 1.111 ± 0.11
2.996GlyAsp: 2.996 ± 0.15
2.674GlyGlu: 2.674 ± 0.203
3.215GlyPhe: 3.215 ± 0.232
3.96GlyGly: 3.96 ± 0.282
1.03GlyHis: 1.03 ± 0.102
3.595GlyIle: 3.595 ± 0.179
4.632GlyLys: 4.632 ± 0.259
3.712GlyLeu: 3.712 ± 0.227
1.213GlyMet: 1.213 ± 0.104
4.851GlyAsn: 4.851 ± 0.912
1.622GlyPro: 1.622 ± 0.163
1.388GlyGln: 1.388 ± 0.122
2.477GlyArg: 2.477 ± 0.162
4.026GlySer: 4.026 ± 0.303
3.259GlyThr: 3.259 ± 0.24
3.953GlyVal: 3.953 ± 0.168
0.701GlyTrp: 0.701 ± 0.081
2.272GlyTyr: 2.272 ± 0.173
0.0GlyXaa: 0.0 ± 0.0
His
1.059HisAla: 1.059 ± 0.078
0.511HisCys: 0.511 ± 0.065
1.081HisAsp: 1.081 ± 0.098
0.986HisGlu: 0.986 ± 0.094
1.038HisPhe: 1.038 ± 0.092
1.103HisGly: 1.103 ± 0.108
0.906HisHis: 0.906 ± 0.084
1.702HisIle: 1.702 ± 0.111
1.235HisLys: 1.235 ± 0.111
2.06HisLeu: 2.06 ± 0.15
0.482HisMet: 0.482 ± 0.061
0.957HisAsn: 0.957 ± 0.088
0.957HisPro: 0.957 ± 0.102
0.716HisGln: 0.716 ± 0.071
1.352HisArg: 1.352 ± 0.12
1.33HisSer: 1.33 ± 0.093
1.476HisThr: 1.476 ± 0.122
1.542HisVal: 1.542 ± 0.104
0.314HisTrp: 0.314 ± 0.048
0.753HisTyr: 0.753 ± 0.085
0.0HisXaa: 0.0 ± 0.0
Ile
4.311IleAla: 4.311 ± 0.249
1.607IleCys: 1.607 ± 0.114
3.858IleAsp: 3.858 ± 0.197
3.368IleGlu: 3.368 ± 0.17
4.135IlePhe: 4.135 ± 0.204
3.653IleGly: 3.653 ± 0.259
1.549IleHis: 1.549 ± 0.105
5.736IleIle: 5.736 ± 0.247
4.567IleLys: 4.567 ± 0.208
6.386IleLeu: 6.386 ± 0.295
1.929IleMet: 1.929 ± 0.117
3.894IleAsn: 3.894 ± 0.162
3.712IlePro: 3.712 ± 0.178
2.009IleGln: 2.009 ± 0.108
3.653IleArg: 3.653 ± 0.185
5.736IleSer: 5.736 ± 0.281
4.83IleThr: 4.83 ± 0.299
5.29IleVal: 5.29 ± 0.204
0.745IleTrp: 0.745 ± 0.079
2.667IleTyr: 2.667 ± 0.145
0.0IleXaa: 0.0 ± 0.0
Lys
2.835LysAla: 2.835 ± 0.223
1.556LysCys: 1.556 ± 0.159
3.39LysAsp: 3.39 ± 0.189
3.792LysGlu: 3.792 ± 0.225
3.419LysPhe: 3.419 ± 0.176
3.156LysGly: 3.156 ± 0.208
1.775LysHis: 1.775 ± 0.123
5.399LysIle: 5.399 ± 0.234
7.255LysLys: 7.255 ± 0.392
5.385LysLeu: 5.385 ± 0.234
2.272LysMet: 2.272 ± 0.128
4.72LysAsn: 4.72 ± 0.252
4.691LysPro: 4.691 ± 0.492
2.302LysGln: 2.302 ± 0.201
3.456LysArg: 3.456 ± 0.202
4.304LysSer: 4.304 ± 0.216
4.866LysThr: 4.866 ± 0.259
3.88LysVal: 3.88 ± 0.273
0.723LysTrp: 0.723 ± 0.067
3.441LysTyr: 3.441 ± 0.184
0.0LysXaa: 0.0 ± 0.0
Leu
3.953LeuAla: 3.953 ± 0.201
1.381LeuCys: 1.381 ± 0.11
3.653LeuAsp: 3.653 ± 0.189
3.872LeuGlu: 3.872 ± 0.239
4.727LeuPhe: 4.727 ± 0.219
4.033LeuGly: 4.033 ± 0.244
1.856LeuHis: 1.856 ± 0.152
4.881LeuIle: 4.881 ± 0.2
5.721LeuLys: 5.721 ± 0.27
6.737LeuLeu: 6.737 ± 0.27
2.155LeuMet: 2.155 ± 0.124
4.114LeuAsn: 4.114 ± 0.193
4.26LeuPro: 4.26 ± 0.209
2.47LeuGln: 2.47 ± 0.144
4.428LeuArg: 4.428 ± 0.209
5.787LeuSer: 5.787 ± 0.258
4.903LeuThr: 4.903 ± 0.231
5.122LeuVal: 5.122 ± 0.179
0.833LeuTrp: 0.833 ± 0.082
3.069LeuTyr: 3.069 ± 0.164
0.0LeuXaa: 0.0 ± 0.0
Met
1.308MetAla: 1.308 ± 0.106
0.672MetCys: 0.672 ± 0.071
1.023MetAsp: 1.023 ± 0.095
1.191MetGlu: 1.191 ± 0.084
1.797MetPhe: 1.797 ± 0.139
1.242MetGly: 1.242 ± 0.108
0.38MetHis: 0.38 ± 0.061
1.951MetIle: 1.951 ± 0.133
2.316MetLys: 2.316 ± 0.137
2.199MetLeu: 2.199 ± 0.129
0.994MetMet: 0.994 ± 0.101
1.651MetAsn: 1.651 ± 0.105
0.935MetPro: 0.935 ± 0.081
0.577MetGln: 0.577 ± 0.081
1.242MetArg: 1.242 ± 0.114
2.952MetSer: 2.952 ± 0.149
2.185MetThr: 2.185 ± 0.139
1.593MetVal: 1.593 ± 0.121
0.395MetTrp: 0.395 ± 0.052
1.125MetTyr: 1.125 ± 0.078
0.0MetXaa: 0.0 ± 0.0
Asn
3.354AsnAla: 3.354 ± 0.271
0.906AsnCys: 0.906 ± 0.083
2.842AsnAsp: 2.842 ± 0.16
2.535AsnGlu: 2.535 ± 0.157
2.908AsnPhe: 2.908 ± 0.166
3.624AsnGly: 3.624 ± 0.325
1.206AsnHis: 1.206 ± 0.13
5.882AsnIle: 5.882 ± 0.475
3.661AsnLys: 3.661 ± 0.229
4.377AsnLeu: 4.377 ± 0.316
1.651AsnMet: 1.651 ± 0.107
3.193AsnAsn: 3.193 ± 0.288
2.345AsnPro: 2.345 ± 0.137
1.198AsnGln: 1.198 ± 0.09
2.243AsnArg: 2.243 ± 0.148
3.931AsnSer: 3.931 ± 0.255
4.252AsnThr: 4.252 ± 0.436
5.794AsnVal: 5.794 ± 0.675
0.519AsnTrp: 0.519 ± 0.08
1.702AsnTyr: 1.702 ± 0.177
0.0AsnXaa: 0.0 ± 0.0
Pro
3.427ProAla: 3.427 ± 0.393
0.694ProCys: 0.694 ± 0.089
2.25ProAsp: 2.25 ± 0.141
3.127ProGlu: 3.127 ± 0.234
2.097ProPhe: 2.097 ± 0.146
2.681ProGly: 2.681 ± 0.194
0.789ProHis: 0.789 ± 0.084
2.594ProIle: 2.594 ± 0.179
4.493ProLys: 4.493 ± 0.448
3.31ProLeu: 3.31 ± 0.177
1.293ProMet: 1.293 ± 0.094
2.177ProAsn: 2.177 ± 0.154
2.594ProPro: 2.594 ± 0.227
1.403ProGln: 1.403 ± 0.153
2.499ProArg: 2.499 ± 0.159
3.77ProSer: 3.77 ± 0.276
3.215ProThr: 3.215 ± 0.177
3.573ProVal: 3.573 ± 0.23
0.373ProTrp: 0.373 ± 0.053
1.534ProTyr: 1.534 ± 0.114
0.0ProXaa: 0.0 ± 0.0
Gln
1.235GlnAla: 1.235 ± 0.118
0.526GlnCys: 0.526 ± 0.087
1.213GlnAsp: 1.213 ± 0.096
1.206GlnGlu: 1.206 ± 0.103
1.14GlnPhe: 1.14 ± 0.089
1.257GlnGly: 1.257 ± 0.127
0.694GlnHis: 0.694 ± 0.077
1.673GlnIle: 1.673 ± 0.111
2.141GlnLys: 2.141 ± 0.167
2.25GlnLeu: 2.25 ± 0.14
0.862GlnMet: 0.862 ± 0.086
1.52GlnAsn: 1.52 ± 0.126
1.235GlnPro: 1.235 ± 0.125
1.242GlnGln: 1.242 ± 0.167
1.819GlnArg: 1.819 ± 0.166
1.812GlnSer: 1.812 ± 0.126
2.09GlnThr: 2.09 ± 0.141
1.732GlnVal: 1.732 ± 0.177
0.373GlnTrp: 0.373 ± 0.052
1.301GlnTyr: 1.301 ± 0.097
0.0GlnXaa: 0.0 ± 0.0
Arg
2.418ArgAla: 2.418 ± 0.18
1.118ArgCys: 1.118 ± 0.126
2.367ArgAsp: 2.367 ± 0.146
2.389ArgGlu: 2.389 ± 0.177
2.455ArgPhe: 2.455 ± 0.169
2.747ArgGly: 2.747 ± 0.217
1.352ArgHis: 1.352 ± 0.115
3.398ArgIle: 3.398 ± 0.161
3.573ArgLys: 3.573 ± 0.205
3.675ArgLeu: 3.675 ± 0.183
1.688ArgMet: 1.688 ± 0.117
2.893ArgAsn: 2.893 ± 0.183
2.25ArgPro: 2.25 ± 0.17
1.352ArgGln: 1.352 ± 0.131
2.923ArgArg: 2.923 ± 0.218
3.595ArgSer: 3.595 ± 0.207
2.923ArgThr: 2.923 ± 0.143
3.456ArgVal: 3.456 ± 0.198
0.65ArgTrp: 0.65 ± 0.074
2.082ArgTyr: 2.082 ± 0.12
0.0ArgXaa: 0.0 ± 0.0
Ser
4.092SerAla: 4.092 ± 0.242
1.534SerCys: 1.534 ± 0.106
3.134SerAsp: 3.134 ± 0.16
3.032SerGlu: 3.032 ± 0.162
4.398SerPhe: 4.398 ± 0.208
4.749SerGly: 4.749 ± 0.376
1.439SerHis: 1.439 ± 0.109
5.071SerIle: 5.071 ± 0.224
5.027SerLys: 5.027 ± 0.238
5.867SerLeu: 5.867 ± 0.196
2.06SerMet: 2.06 ± 0.132
4.362SerAsn: 4.362 ± 0.484
3.5SerPro: 3.5 ± 0.305
1.929SerGln: 1.929 ± 0.142
3.807SerArg: 3.807 ± 0.192
6.758SerSer: 6.758 ± 0.465
4.713SerThr: 4.713 ± 0.224
5.575SerVal: 5.575 ± 0.247
0.891SerTrp: 0.891 ± 0.081
2.484SerTyr: 2.484 ± 0.147
0.0SerXaa: 0.0 ± 0.0
Thr
3.785ThrAla: 3.785 ± 0.437
1.096ThrCys: 1.096 ± 0.113
2.901ThrAsp: 2.901 ± 0.135
3.164ThrGlu: 3.164 ± 0.166
3.989ThrPhe: 3.989 ± 0.227
4.143ThrGly: 4.143 ± 0.319
1.257ThrHis: 1.257 ± 0.108
4.99ThrIle: 4.99 ± 0.233
4.559ThrLys: 4.559 ± 0.238
5.085ThrLeu: 5.085 ± 0.214
1.71ThrMet: 1.71 ± 0.117
3.354ThrAsn: 3.354 ± 0.222
4.099ThrPro: 4.099 ± 0.235
1.666ThrGln: 1.666 ± 0.126
3.288ThrArg: 3.288 ± 0.158
5.568ThrSer: 5.568 ± 0.349
4.647ThrThr: 4.647 ± 0.357
3.587ThrVal: 3.587 ± 0.19
0.68ThrTrp: 0.68 ± 0.08
2.082ThrTyr: 2.082 ± 0.14
0.0ThrXaa: 0.0 ± 0.0
Val
3.398ValAla: 3.398 ± 0.202
1.344ValCys: 1.344 ± 0.146
3.682ValAsp: 3.682 ± 0.175
3.485ValGlu: 3.485 ± 0.175
4.194ValPhe: 4.194 ± 0.202
3.463ValGly: 3.463 ± 0.355
1.542ValHis: 1.542 ± 0.114
5.253ValIle: 5.253 ± 0.199
4.866ValLys: 4.866 ± 0.236
5.882ValLeu: 5.882 ± 0.248
2.068ValMet: 2.068 ± 0.126
4.084ValAsn: 4.084 ± 0.21
3.105ValPro: 3.105 ± 0.203
2.17ValGln: 2.17 ± 0.197
3.134ValArg: 3.134 ± 0.146
5.977ValSer: 5.977 ± 0.312
4.143ValThr: 4.143 ± 0.328
5.473ValVal: 5.473 ± 0.245
0.723ValTrp: 0.723 ± 0.111
2.93ValTyr: 2.93 ± 0.156
0.0ValXaa: 0.0 ± 0.0
Trp
0.519TrpAla: 0.519 ± 0.06
0.424TrpCys: 0.424 ± 0.073
0.636TrpAsp: 0.636 ± 0.073
0.68TrpGlu: 0.68 ± 0.099
0.789TrpPhe: 0.789 ± 0.078
0.628TrpGly: 0.628 ± 0.073
0.161TrpHis: 0.161 ± 0.036
0.621TrpIle: 0.621 ± 0.074
0.76TrpLys: 0.76 ± 0.078
0.855TrpLeu: 0.855 ± 0.091
0.307TrpMet: 0.307 ± 0.045
0.84TrpAsn: 0.84 ± 0.1
0.263TrpPro: 0.263 ± 0.049
0.263TrpGln: 0.263 ± 0.036
0.453TrpArg: 0.453 ± 0.06
0.891TrpSer: 0.891 ± 0.09
0.533TrpThr: 0.533 ± 0.056
0.672TrpVal: 0.672 ± 0.076
0.212TrpTrp: 0.212 ± 0.048
0.46TrpTyr: 0.46 ± 0.056
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.389TyrAla: 2.389 ± 0.165
0.731TyrCys: 0.731 ± 0.071
2.492TyrAsp: 2.492 ± 0.149
1.973TyrGlu: 1.973 ± 0.125
2.177TyrPhe: 2.177 ± 0.126
2.133TyrGly: 2.133 ± 0.12
0.738TyrHis: 0.738 ± 0.074
2.842TyrIle: 2.842 ± 0.157
2.798TyrLys: 2.798 ± 0.162
2.557TyrLeu: 2.557 ± 0.163
0.928TyrMet: 0.928 ± 0.082
2.448TyrAsn: 2.448 ± 0.134
1.33TyrPro: 1.33 ± 0.108
0.972TyrGln: 0.972 ± 0.096
1.702TyrArg: 1.702 ± 0.114
2.696TyrSer: 2.696 ± 0.156
2.565TyrThr: 2.565 ± 0.156
2.557TyrVal: 2.557 ± 0.159
0.336TyrTrp: 0.336 ± 0.044
1.483TyrTyr: 1.483 ± 0.11
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 794 proteins (136866 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski