Amino acid dipepetide frequency for Cyprinid herpesvirus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.906AlaAla: 9.906 ± 0.542
1.602AlaCys: 1.602 ± 0.165
3.929AlaAsp: 3.929 ± 0.226
4.781AlaGlu: 4.781 ± 0.315
2.442AlaPhe: 2.442 ± 0.158
3.548AlaGly: 3.548 ± 0.207
1.717AlaHis: 1.717 ± 0.175
3.141AlaIle: 3.141 ± 0.211
3.777AlaLys: 3.777 ± 0.292
6.651AlaLeu: 6.651 ± 0.314
2.225AlaMet: 2.225 ± 0.164
2.683AlaAsn: 2.683 ± 0.192
4.057AlaPro: 4.057 ± 0.317
2.683AlaGln: 2.683 ± 0.231
3.637AlaArg: 3.637 ± 0.236
6.447AlaSer: 6.447 ± 0.384
5.621AlaThr: 5.621 ± 0.425
5.519AlaVal: 5.519 ± 0.249
0.852AlaTrp: 0.852 ± 0.119
2.136AlaTyr: 2.136 ± 0.143
0.0AlaXaa: 0.0 ± 0.0
Cys
1.628CysAla: 1.628 ± 0.166
0.877CysCys: 0.877 ± 0.114
1.17CysAsp: 1.17 ± 0.165
1.335CysGlu: 1.335 ± 0.144
0.839CysPhe: 0.839 ± 0.111
1.348CysGly: 1.348 ± 0.127
0.42CysHis: 0.42 ± 0.074
1.144CysIle: 1.144 ± 0.146
0.839CysLys: 0.839 ± 0.124
2.111CysLeu: 2.111 ± 0.212
0.687CysMet: 0.687 ± 0.099
0.852CysAsn: 0.852 ± 0.094
1.412CysPro: 1.412 ± 0.199
0.496CysGln: 0.496 ± 0.079
0.979CysArg: 0.979 ± 0.111
1.704CysSer: 1.704 ± 0.184
1.373CysThr: 1.373 ± 0.125
1.806CysVal: 1.806 ± 0.169
0.331CysTrp: 0.331 ± 0.078
0.725CysTyr: 0.725 ± 0.104
0.0CysXaa: 0.0 ± 0.0
Asp
4.451AspAla: 4.451 ± 0.224
1.03AspCys: 1.03 ± 0.118
6.18AspAsp: 6.18 ± 0.414
4.858AspGlu: 4.858 ± 0.379
2.416AspPhe: 2.416 ± 0.158
3.09AspGly: 3.09 ± 0.207
1.208AspHis: 1.208 ± 0.139
2.34AspIle: 2.34 ± 0.202
2.34AspLys: 2.34 ± 0.23
5.341AspLeu: 5.341 ± 0.301
1.704AspMet: 1.704 ± 0.153
2.225AspAsn: 2.225 ± 0.161
3.065AspPro: 3.065 ± 0.187
2.098AspGln: 2.098 ± 0.129
2.938AspArg: 2.938 ± 0.209
4.349AspSer: 4.349 ± 0.336
3.561AspThr: 3.561 ± 0.209
4.565AspVal: 4.565 ± 0.235
0.75AspTrp: 0.75 ± 0.098
2.111AspTyr: 2.111 ± 0.153
0.0AspXaa: 0.0 ± 0.0
Glu
4.692GluAla: 4.692 ± 0.354
1.246GluCys: 1.246 ± 0.153
4.387GluAsp: 4.387 ± 0.277
5.697GluGlu: 5.697 ± 0.54
1.933GluPhe: 1.933 ± 0.158
2.289GluGly: 2.289 ± 0.175
1.246GluHis: 1.246 ± 0.131
2.365GluIle: 2.365 ± 0.179
3.065GluLys: 3.065 ± 0.27
5.608GluLeu: 5.608 ± 0.348
1.64GluMet: 1.64 ± 0.147
2.098GluAsn: 2.098 ± 0.191
3.51GluPro: 3.51 ± 0.311
2.353GluGln: 2.353 ± 0.206
3.408GluArg: 3.408 ± 0.232
4.781GluSer: 4.781 ± 0.285
4.26GluThr: 4.26 ± 0.258
2.632GluVal: 2.632 ± 0.192
0.966GluTrp: 0.966 ± 0.134
2.047GluTyr: 2.047 ± 0.151
0.0GluXaa: 0.0 ± 0.0
Phe
2.353PheAla: 2.353 ± 0.197
0.954PheCys: 0.954 ± 0.114
2.403PheAsp: 2.403 ± 0.17
2.022PheGlu: 2.022 ± 0.166
1.564PhePhe: 1.564 ± 0.185
2.06PheGly: 2.06 ± 0.162
0.661PheHis: 0.661 ± 0.102
1.412PheIle: 1.412 ± 0.163
2.251PheLys: 2.251 ± 0.207
2.81PheLeu: 2.81 ± 0.186
1.17PheMet: 1.17 ± 0.099
1.946PheAsn: 1.946 ± 0.163
0.954PhePro: 0.954 ± 0.115
1.246PheGln: 1.246 ± 0.117
1.971PheArg: 1.971 ± 0.151
2.925PheSer: 2.925 ± 0.198
2.238PheThr: 2.238 ± 0.184
2.836PheVal: 2.836 ± 0.211
0.509PheTrp: 0.509 ± 0.083
1.31PheTyr: 1.31 ± 0.138
0.0PheXaa: 0.0 ± 0.0
Gly
3.866GlyAla: 3.866 ± 0.241
1.081GlyCys: 1.081 ± 0.136
2.543GlyAsp: 2.543 ± 0.176
2.785GlyGlu: 2.785 ± 0.188
1.895GlyPhe: 1.895 ± 0.161
4.553GlyGly: 4.553 ± 0.401
1.284GlyHis: 1.284 ± 0.154
2.124GlyIle: 2.124 ± 0.187
2.035GlyLys: 2.035 ± 0.161
4.629GlyLeu: 4.629 ± 0.248
1.272GlyMet: 1.272 ± 0.167
2.035GlyAsn: 2.035 ± 0.222
2.696GlyPro: 2.696 ± 0.249
1.462GlyGln: 1.462 ± 0.154
2.785GlyArg: 2.785 ± 0.208
4.209GlySer: 4.209 ± 0.257
3.637GlyThr: 3.637 ± 0.243
3.612GlyVal: 3.612 ± 0.212
0.827GlyTrp: 0.827 ± 0.105
1.958GlyTyr: 1.958 ± 0.149
0.0GlyXaa: 0.0 ± 0.0
His
1.564HisAla: 1.564 ± 0.137
0.712HisCys: 0.712 ± 0.099
1.055HisAsp: 1.055 ± 0.13
1.068HisGlu: 1.068 ± 0.125
0.852HisPhe: 0.852 ± 0.127
1.043HisGly: 1.043 ± 0.107
0.865HisHis: 0.865 ± 0.133
1.03HisIle: 1.03 ± 0.131
1.157HisLys: 1.157 ± 0.118
1.92HisLeu: 1.92 ± 0.171
0.585HisMet: 0.585 ± 0.079
0.903HisAsn: 0.903 ± 0.093
1.017HisPro: 1.017 ± 0.129
0.928HisGln: 0.928 ± 0.133
1.412HisArg: 1.412 ± 0.157
1.462HisSer: 1.462 ± 0.128
1.488HisThr: 1.488 ± 0.139
1.551HisVal: 1.551 ± 0.168
0.28HisTrp: 0.28 ± 0.062
0.839HisTyr: 0.839 ± 0.096
0.0HisXaa: 0.0 ± 0.0
Ile
2.556IleAla: 2.556 ± 0.177
0.572IleCys: 0.572 ± 0.089
2.594IleAsp: 2.594 ± 0.172
2.505IleGlu: 2.505 ± 0.19
1.501IlePhe: 1.501 ± 0.141
1.958IleGly: 1.958 ± 0.166
0.877IleHis: 0.877 ± 0.091
1.958IleIle: 1.958 ± 0.167
3.09IleLys: 3.09 ± 0.2
3.561IleLeu: 3.561 ± 0.234
1.234IleMet: 1.234 ± 0.116
2.314IleAsn: 2.314 ± 0.17
2.225IlePro: 2.225 ± 0.163
1.615IleGln: 1.615 ± 0.133
2.302IleArg: 2.302 ± 0.151
2.887IleSer: 2.887 ± 0.207
2.632IleThr: 2.632 ± 0.196
3.217IleVal: 3.217 ± 0.195
0.521IleTrp: 0.521 ± 0.079
1.323IleTyr: 1.323 ± 0.122
0.0IleXaa: 0.0 ± 0.0
Lys
3.917LysAla: 3.917 ± 0.33
1.119LysCys: 1.119 ± 0.13
2.785LysAsp: 2.785 ± 0.212
2.874LysGlu: 2.874 ± 0.268
1.462LysPhe: 1.462 ± 0.131
1.653LysGly: 1.653 ± 0.168
1.361LysHis: 1.361 ± 0.151
2.187LysIle: 2.187 ± 0.177
4.667LysLys: 4.667 ± 0.542
5.455LysLeu: 5.455 ± 0.236
1.246LysMet: 1.246 ± 0.116
2.264LysAsn: 2.264 ± 0.147
3.294LysPro: 3.294 ± 0.304
2.289LysGln: 2.289 ± 0.181
4.438LysArg: 4.438 ± 0.273
3.942LysSer: 3.942 ± 0.382
4.273LysThr: 4.273 ± 0.295
2.594LysVal: 2.594 ± 0.221
0.598LysTrp: 0.598 ± 0.103
1.246LysTyr: 1.246 ± 0.15
0.0LysXaa: 0.0 ± 0.0
Leu
5.926LeuAla: 5.926 ± 0.265
2.289LeuCys: 2.289 ± 0.188
5.862LeuAsp: 5.862 ± 0.339
5.29LeuGlu: 5.29 ± 0.435
3.433LeuPhe: 3.433 ± 0.186
4.54LeuGly: 4.54 ± 0.278
1.602LeuHis: 1.602 ± 0.138
3.51LeuIle: 3.51 ± 0.219
5.455LeuLys: 5.455 ± 0.278
8.113LeuLeu: 8.113 ± 0.393
3.065LeuMet: 3.065 ± 0.193
3.815LeuAsn: 3.815 ± 0.24
4.209LeuPro: 4.209 ± 0.259
3.192LeuGln: 3.192 ± 0.256
4.692LeuArg: 4.692 ± 0.245
6.447LeuSer: 6.447 ± 0.302
5.481LeuThr: 5.481 ± 0.28
5.506LeuVal: 5.506 ± 0.282
1.005LeuTrp: 1.005 ± 0.112
2.925LeuTyr: 2.925 ± 0.23
0.0LeuXaa: 0.0 ± 0.0
Met
2.543MetAla: 2.543 ± 0.183
0.776MetCys: 0.776 ± 0.097
1.755MetAsp: 1.755 ± 0.14
1.284MetGlu: 1.284 ± 0.132
1.081MetPhe: 1.081 ± 0.111
1.729MetGly: 1.729 ± 0.139
0.598MetHis: 0.598 ± 0.097
1.094MetIle: 1.094 ± 0.131
1.31MetLys: 1.31 ± 0.212
2.353MetLeu: 2.353 ± 0.182
1.17MetMet: 1.17 ± 0.163
1.017MetAsn: 1.017 ± 0.121
1.259MetPro: 1.259 ± 0.147
0.763MetGln: 0.763 ± 0.111
1.551MetArg: 1.551 ± 0.161
2.365MetSer: 2.365 ± 0.18
1.755MetThr: 1.755 ± 0.142
2.327MetVal: 2.327 ± 0.17
0.394MetTrp: 0.394 ± 0.069
0.966MetTyr: 0.966 ± 0.117
0.0MetXaa: 0.0 ± 0.0
Asn
3.192AsnAla: 3.192 ± 0.243
0.687AsnCys: 0.687 ± 0.091
1.768AsnAsp: 1.768 ± 0.206
1.831AsnGlu: 1.831 ± 0.124
1.437AsnPhe: 1.437 ± 0.135
2.607AsnGly: 2.607 ± 0.197
1.043AsnHis: 1.043 ± 0.132
2.136AsnIle: 2.136 ± 0.187
2.416AsnLys: 2.416 ± 0.192
3.383AsnLeu: 3.383 ± 0.203
1.221AsnMet: 1.221 ± 0.123
2.62AsnAsn: 2.62 ± 0.186
2.327AsnPro: 2.327 ± 0.219
1.323AsnGln: 1.323 ± 0.114
2.48AsnArg: 2.48 ± 0.167
2.683AsnSer: 2.683 ± 0.224
3.052AsnThr: 3.052 ± 0.218
3.243AsnVal: 3.243 ± 0.188
0.471AsnTrp: 0.471 ± 0.064
1.144AsnTyr: 1.144 ± 0.135
0.0AsnXaa: 0.0 ± 0.0
Pro
3.879ProAla: 3.879 ± 0.275
0.903ProCys: 0.903 ± 0.131
2.976ProAsp: 2.976 ± 0.224
3.281ProGlu: 3.281 ± 0.253
1.844ProPhe: 1.844 ± 0.176
2.518ProGly: 2.518 ± 0.172
0.916ProHis: 0.916 ± 0.106
2.175ProIle: 2.175 ± 0.2
2.683ProLys: 2.683 ± 0.205
3.993ProLeu: 3.993 ± 0.262
1.183ProMet: 1.183 ± 0.129
1.857ProAsn: 1.857 ± 0.162
4.591ProPro: 4.591 ± 0.411
2.111ProGln: 2.111 ± 0.239
2.518ProArg: 2.518 ± 0.214
5.265ProSer: 5.265 ± 0.37
4.858ProThr: 4.858 ± 0.469
4.324ProVal: 4.324 ± 0.285
0.458ProTrp: 0.458 ± 0.066
1.869ProTyr: 1.869 ± 0.157
0.0ProXaa: 0.0 ± 0.0
Gln
2.696GlnAla: 2.696 ± 0.192
0.496GlnCys: 0.496 ± 0.071
1.615GlnAsp: 1.615 ± 0.17
2.225GlnGlu: 2.225 ± 0.241
1.157GlnPhe: 1.157 ± 0.16
1.45GlnGly: 1.45 ± 0.134
0.801GlnHis: 0.801 ± 0.113
1.78GlnIle: 1.78 ± 0.159
1.717GlnLys: 1.717 ± 0.17
3.472GlnLeu: 3.472 ± 0.249
0.928GlnMet: 0.928 ± 0.152
1.323GlnAsn: 1.323 ± 0.15
2.696GlnPro: 2.696 ± 0.295
5.227GlnGln: 5.227 ± 0.756
2.518GlnArg: 2.518 ± 0.205
2.747GlnSer: 2.747 ± 0.221
2.543GlnThr: 2.543 ± 0.215
2.149GlnVal: 2.149 ± 0.201
0.61GlnTrp: 0.61 ± 0.104
1.272GlnTyr: 1.272 ± 0.13
0.0GlnXaa: 0.0 ± 0.0
Arg
3.993ArgAla: 3.993 ± 0.24
1.323ArgCys: 1.323 ± 0.153
3.357ArgAsp: 3.357 ± 0.266
3.281ArgGlu: 3.281 ± 0.218
2.111ArgPhe: 2.111 ± 0.186
3.014ArgGly: 3.014 ± 0.218
1.399ArgHis: 1.399 ± 0.122
2.213ArgIle: 2.213 ± 0.173
3.294ArgLys: 3.294 ± 0.239
5.468ArgLeu: 5.468 ± 0.304
1.335ArgMet: 1.335 ± 0.147
2.187ArgAsn: 2.187 ± 0.218
2.836ArgPro: 2.836 ± 0.2
2.098ArgGln: 2.098 ± 0.187
4.324ArgArg: 4.324 ± 0.343
4.209ArgSer: 4.209 ± 0.28
3.383ArgThr: 3.383 ± 0.187
3.701ArgVal: 3.701 ± 0.236
0.725ArgTrp: 0.725 ± 0.129
1.806ArgTyr: 1.806 ± 0.133
0.0ArgXaa: 0.0 ± 0.0
Ser
5.735SerAla: 5.735 ± 0.306
1.539SerCys: 1.539 ± 0.149
5.544SerAsp: 5.544 ± 0.379
5.392SerGlu: 5.392 ± 0.298
2.645SerPhe: 2.645 ± 0.19
4.018SerGly: 4.018 ± 0.257
1.373SerHis: 1.373 ± 0.154
3.128SerIle: 3.128 ± 0.192
4.298SerLys: 4.298 ± 0.355
5.837SerLeu: 5.837 ± 0.289
2.149SerMet: 2.149 ± 0.215
3.459SerAsn: 3.459 ± 0.266
4.006SerPro: 4.006 ± 0.486
2.81SerGln: 2.81 ± 0.326
4.476SerArg: 4.476 ± 0.262
9.537SerSer: 9.537 ± 0.785
6.168SerThr: 6.168 ± 0.472
5.939SerVal: 5.939 ± 0.235
0.89SerTrp: 0.89 ± 0.098
2.187SerTyr: 2.187 ± 0.185
0.0SerXaa: 0.0 ± 0.0
Thr
5.812ThrAla: 5.812 ± 0.251
1.297ThrCys: 1.297 ± 0.158
3.904ThrAsp: 3.904 ± 0.217
3.828ThrGlu: 3.828 ± 0.304
2.416ThrPhe: 2.416 ± 0.179
3.739ThrGly: 3.739 ± 0.293
1.361ThrHis: 1.361 ± 0.11
2.963ThrIle: 2.963 ± 0.219
3.433ThrLys: 3.433 ± 0.249
5.875ThrLeu: 5.875 ± 0.322
1.997ThrMet: 1.997 ± 0.205
2.581ThrAsn: 2.581 ± 0.203
4.069ThrPro: 4.069 ± 0.405
2.391ThrGln: 2.391 ± 0.185
3.306ThrArg: 3.306 ± 0.23
5.837ThrSer: 5.837 ± 0.376
9.69ThrThr: 9.69 ± 2.238
6.562ThrVal: 6.562 ± 0.396
0.941ThrTrp: 0.941 ± 0.107
2.047ThrTyr: 2.047 ± 0.186
0.0ThrXaa: 0.0 ± 0.0
Val
5.481ValAla: 5.481 ± 0.327
2.213ValCys: 2.213 ± 0.191
4.222ValAsp: 4.222 ± 0.254
3.891ValGlu: 3.891 ± 0.23
2.67ValPhe: 2.67 ± 0.203
3.497ValGly: 3.497 ± 0.27
1.793ValHis: 1.793 ± 0.158
2.785ValIle: 2.785 ± 0.193
3.408ValLys: 3.408 ± 0.236
6.346ValLeu: 6.346 ± 0.291
1.704ValMet: 1.704 ± 0.14
2.836ValAsn: 2.836 ± 0.208
3.879ValPro: 3.879 ± 0.265
2.709ValGln: 2.709 ± 0.204
3.726ValArg: 3.726 ± 0.329
5.786ValSer: 5.786 ± 0.269
4.476ValThr: 4.476 ± 0.311
5.214ValVal: 5.214 ± 0.323
0.979ValTrp: 0.979 ± 0.116
2.734ValTyr: 2.734 ± 0.209
0.0ValXaa: 0.0 ± 0.0
Trp
0.763TrpAla: 0.763 ± 0.109
0.56TrpCys: 0.56 ± 0.091
0.598TrpAsp: 0.598 ± 0.087
0.547TrpGlu: 0.547 ± 0.082
0.509TrpPhe: 0.509 ± 0.084
0.649TrpGly: 0.649 ± 0.096
0.292TrpHis: 0.292 ± 0.064
0.699TrpIle: 0.699 ± 0.08
0.598TrpLys: 0.598 ± 0.077
1.234TrpLeu: 1.234 ± 0.144
0.331TrpMet: 0.331 ± 0.067
0.496TrpAsn: 0.496 ± 0.1
0.623TrpPro: 0.623 ± 0.103
0.292TrpGln: 0.292 ± 0.06
0.852TrpArg: 0.852 ± 0.103
1.132TrpSer: 1.132 ± 0.133
0.966TrpThr: 0.966 ± 0.104
0.852TrpVal: 0.852 ± 0.115
0.254TrpTrp: 0.254 ± 0.069
0.585TrpTyr: 0.585 ± 0.083
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.403TyrAla: 2.403 ± 0.188
0.801TyrCys: 0.801 ± 0.105
1.971TyrAsp: 1.971 ± 0.188
1.45TyrGlu: 1.45 ± 0.141
1.386TyrPhe: 1.386 ± 0.133
1.997TyrGly: 1.997 ± 0.16
0.916TyrHis: 0.916 ± 0.119
1.259TyrIle: 1.259 ± 0.135
1.831TyrLys: 1.831 ± 0.183
2.2TyrLeu: 2.2 ± 0.154
1.234TyrMet: 1.234 ± 0.152
1.551TyrAsn: 1.551 ± 0.16
1.475TyrPro: 1.475 ± 0.152
1.335TyrGln: 1.335 ± 0.131
1.742TyrArg: 1.742 ± 0.192
2.416TyrSer: 2.416 ± 0.174
2.594TyrThr: 2.594 ± 0.177
2.2TyrVal: 2.2 ± 0.173
0.471TyrTrp: 0.471 ± 0.082
1.221TyrTyr: 1.221 ± 0.132
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 150 proteins (78638 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski