Amino acid dipepetide frequency for Panine betaherpesvirus 2 (Chimpanzee cytomegalovirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.645AlaAla: 10.645 ± 0.819
1.825AlaCys: 1.825 ± 0.168
3.285AlaAsp: 3.285 ± 0.244
3.893AlaGlu: 3.893 ± 0.288
2.889AlaPhe: 2.889 ± 0.232
5.368AlaGly: 5.368 ± 0.423
1.414AlaHis: 1.414 ± 0.177
2.357AlaIle: 2.357 ± 0.201
1.718AlaLys: 1.718 ± 0.14
8.197AlaLeu: 8.197 ± 0.453
1.399AlaMet: 1.399 ± 0.154
1.825AlaAsn: 1.825 ± 0.159
4.866AlaPro: 4.866 ± 0.405
2.372AlaGln: 2.372 ± 0.228
5.201AlaArg: 5.201 ± 0.297
6.296AlaSer: 6.296 ± 0.379
5.642AlaThr: 5.642 ± 0.431
7.254AlaVal: 7.254 ± 0.366
1.186AlaTrp: 1.186 ± 0.16
2.236AlaTyr: 2.236 ± 0.192
0.0AlaXaa: 0.0 ± 0.0
Cys
1.551CysAla: 1.551 ± 0.18
0.806CysCys: 0.806 ± 0.133
1.353CysAsp: 1.353 ± 0.134
1.217CysGlu: 1.217 ± 0.154
0.928CysPhe: 0.928 ± 0.148
1.399CysGly: 1.399 ± 0.143
0.654CysHis: 0.654 ± 0.092
0.882CysIle: 0.882 ± 0.114
0.593CysLys: 0.593 ± 0.103
3.011CysLeu: 3.011 ± 0.222
0.593CysMet: 0.593 ± 0.099
0.76CysAsn: 0.76 ± 0.105
1.065CysPro: 1.065 ± 0.122
1.171CysGln: 1.171 ± 0.137
1.779CysArg: 1.779 ± 0.191
1.506CysSer: 1.506 ± 0.142
1.414CysThr: 1.414 ± 0.169
2.083CysVal: 2.083 ± 0.189
0.243CysTrp: 0.243 ± 0.067
0.943CysTyr: 0.943 ± 0.106
0.0CysXaa: 0.0 ± 0.0
Asp
4.395AspAla: 4.395 ± 0.246
0.836AspCys: 0.836 ± 0.115
4.775AspAsp: 4.775 ± 0.378
4.517AspGlu: 4.517 ± 0.354
1.658AspPhe: 1.658 ± 0.187
3.407AspGly: 3.407 ± 0.232
1.399AspHis: 1.399 ± 0.159
1.718AspIle: 1.718 ± 0.161
1.141AspLys: 1.141 ± 0.129
5.429AspLeu: 5.429 ± 0.267
1.004AspMet: 1.004 ± 0.119
1.749AspAsn: 1.749 ± 0.161
2.768AspPro: 2.768 ± 0.268
1.019AspGln: 1.019 ± 0.139
3.209AspArg: 3.209 ± 0.2
2.935AspSer: 2.935 ± 0.236
2.433AspThr: 2.433 ± 0.168
3.65AspVal: 3.65 ± 0.242
0.624AspTrp: 0.624 ± 0.099
1.627AspTyr: 1.627 ± 0.142
0.0AspXaa: 0.0 ± 0.0
Glu
4.197GluAla: 4.197 ± 0.28
1.034GluCys: 1.034 ± 0.132
4.228GluAsp: 4.228 ± 0.324
5.931GluGlu: 5.931 ± 0.51
1.506GluPhe: 1.506 ± 0.17
2.92GluGly: 2.92 ± 0.257
1.627GluHis: 1.627 ± 0.18
2.053GluIle: 2.053 ± 0.165
1.749GluLys: 1.749 ± 0.17
5.064GluLeu: 5.064 ± 0.316
0.943GluMet: 0.943 ± 0.116
2.053GluAsn: 2.053 ± 0.184
2.692GluPro: 2.692 ± 0.235
1.795GluGln: 1.795 ± 0.173
4.562GluArg: 4.562 ± 0.311
3.27GluSer: 3.27 ± 0.206
3.513GluThr: 3.513 ± 0.315
3.315GluVal: 3.315 ± 0.255
0.654GluTrp: 0.654 ± 0.106
1.673GluTyr: 1.673 ± 0.177
0.0GluXaa: 0.0 ± 0.0
Phe
2.479PheAla: 2.479 ± 0.225
1.186PheCys: 1.186 ± 0.126
1.718PheAsp: 1.718 ± 0.19
1.46PheGlu: 1.46 ± 0.13
1.886PhePhe: 1.886 ± 0.15
2.083PheGly: 2.083 ± 0.138
1.08PheHis: 1.08 ± 0.161
1.795PheIle: 1.795 ± 0.215
1.171PheLys: 1.171 ± 0.125
3.954PheLeu: 3.954 ± 0.257
1.186PheMet: 1.186 ± 0.139
1.43PheAsn: 1.43 ± 0.142
1.947PhePro: 1.947 ± 0.178
1.247PheGln: 1.247 ± 0.148
2.524PheArg: 2.524 ± 0.197
2.57PheSer: 2.57 ± 0.224
2.327PheThr: 2.327 ± 0.185
3.011PheVal: 3.011 ± 0.228
0.639PheTrp: 0.639 ± 0.099
1.734PheTyr: 1.734 ± 0.178
0.0PheXaa: 0.0 ± 0.0
Gly
5.855GlyAla: 5.855 ± 0.384
1.293GlyCys: 1.293 ± 0.167
3.422GlyAsp: 3.422 ± 0.258
3.817GlyGlu: 3.817 ± 0.286
2.175GlyPhe: 2.175 ± 0.183
8.197GlyGly: 8.197 ± 0.922
1.521GlyHis: 1.521 ± 0.16
1.825GlyIle: 1.825 ± 0.162
1.947GlyLys: 1.947 ± 0.162
6.326GlyLeu: 6.326 ± 0.384
1.08GlyMet: 1.08 ± 0.116
1.947GlyAsn: 1.947 ± 0.177
3.011GlyPro: 3.011 ± 0.24
2.007GlyGln: 2.007 ± 0.145
4.395GlyArg: 4.395 ± 0.365
4.669GlySer: 4.669 ± 0.293
4.106GlyThr: 4.106 ± 0.254
4.137GlyVal: 4.137 ± 0.307
0.943GlyTrp: 0.943 ± 0.119
1.764GlyTyr: 1.764 ± 0.151
0.0GlyXaa: 0.0 ± 0.0
His
2.19HisAla: 2.19 ± 0.193
0.563HisCys: 0.563 ± 0.099
1.597HisAsp: 1.597 ± 0.145
1.566HisGlu: 1.566 ± 0.138
0.989HisPhe: 0.989 ± 0.111
2.357HisGly: 2.357 ± 0.201
1.764HisHis: 1.764 ± 0.25
0.912HisIle: 0.912 ± 0.133
0.776HisLys: 0.776 ± 0.129
2.92HisLeu: 2.92 ± 0.2
0.593HisMet: 0.593 ± 0.089
1.141HisAsn: 1.141 ± 0.114
1.916HisPro: 1.916 ± 0.201
1.353HisGln: 1.353 ± 0.161
2.677HisArg: 2.677 ± 0.204
1.384HisSer: 1.384 ± 0.197
1.855HisThr: 1.855 ± 0.195
2.038HisVal: 2.038 ± 0.196
0.38HisTrp: 0.38 ± 0.083
0.989HisTyr: 0.989 ± 0.124
0.0HisXaa: 0.0 ± 0.0
Ile
2.007IleAla: 2.007 ± 0.176
0.958IleCys: 0.958 ± 0.123
1.566IleAsp: 1.566 ± 0.163
1.323IleGlu: 1.323 ± 0.122
1.612IlePhe: 1.612 ± 0.156
1.566IleGly: 1.566 ± 0.193
1.049IleHis: 1.049 ± 0.119
1.475IleIle: 1.475 ± 0.18
1.171IleLys: 1.171 ± 0.104
3.559IleLeu: 3.559 ± 0.321
0.867IleMet: 0.867 ± 0.133
1.141IleAsn: 1.141 ± 0.145
2.129IlePro: 2.129 ± 0.162
1.43IleGln: 1.43 ± 0.169
2.327IleArg: 2.327 ± 0.227
2.737IleSer: 2.737 ± 0.239
2.266IleThr: 2.266 ± 0.219
2.874IleVal: 2.874 ± 0.222
0.593IleTrp: 0.593 ± 0.096
1.703IleTyr: 1.703 ± 0.168
0.0IleXaa: 0.0 ± 0.0
Lys
2.114LysAla: 2.114 ± 0.192
0.624LysCys: 0.624 ± 0.1
1.247LysAsp: 1.247 ± 0.153
1.551LysGlu: 1.551 ± 0.166
0.776LysPhe: 0.776 ± 0.114
1.445LysGly: 1.445 ± 0.144
1.049LysHis: 1.049 ± 0.127
1.308LysIle: 1.308 ± 0.135
2.129LysLys: 2.129 ± 0.238
2.555LysLeu: 2.555 ± 0.243
0.897LysMet: 0.897 ± 0.113
1.323LysAsn: 1.323 ± 0.125
1.627LysPro: 1.627 ± 0.221
1.08LysGln: 1.08 ± 0.129
2.54LysArg: 2.54 ± 0.198
1.992LysSer: 1.992 ± 0.194
2.19LysThr: 2.19 ± 0.215
1.749LysVal: 1.749 ± 0.165
0.365LysTrp: 0.365 ± 0.068
1.141LysTyr: 1.141 ± 0.119
0.0LysXaa: 0.0 ± 0.0
Leu
7.33LeuAla: 7.33 ± 0.358
3.635LeuCys: 3.635 ± 0.234
4.106LeuAsp: 4.106 ± 0.318
4.775LeuGlu: 4.775 ± 0.301
4.456LeuPhe: 4.456 ± 0.296
5.885LeuGly: 5.885 ± 0.32
2.966LeuHis: 2.966 ± 0.193
3.878LeuIle: 3.878 ± 0.277
3.194LeuLys: 3.194 ± 0.233
11.968LeuLeu: 11.968 ± 0.599
2.433LeuMet: 2.433 ± 0.228
3.27LeuAsn: 3.27 ± 0.236
5.672LeuPro: 5.672 ± 0.296
3.65LeuGln: 3.65 ± 0.272
8.045LeuArg: 8.045 ± 0.4
7.847LeuSer: 7.847 ± 0.394
6.57LeuThr: 6.57 ± 0.307
6.296LeuVal: 6.296 ± 0.313
1.49LeuTrp: 1.49 ± 0.168
3.574LeuTyr: 3.574 ± 0.249
0.0LeuXaa: 0.0 ± 0.0
Met
1.642MetAla: 1.642 ± 0.146
0.441MetCys: 0.441 ± 0.082
0.928MetAsp: 0.928 ± 0.114
1.08MetGlu: 1.08 ± 0.138
1.019MetPhe: 1.019 ± 0.124
1.369MetGly: 1.369 ± 0.148
0.578MetHis: 0.578 ± 0.097
0.973MetIle: 0.973 ± 0.141
0.715MetLys: 0.715 ± 0.109
2.327MetLeu: 2.327 ± 0.208
0.639MetMet: 0.639 ± 0.106
0.76MetAsn: 0.76 ± 0.117
1.11MetPro: 1.11 ± 0.126
0.624MetGln: 0.624 ± 0.107
1.49MetArg: 1.49 ± 0.151
2.053MetSer: 2.053 ± 0.165
1.323MetThr: 1.323 ± 0.135
1.262MetVal: 1.262 ± 0.157
0.517MetTrp: 0.517 ± 0.092
0.624MetTyr: 0.624 ± 0.083
0.0MetXaa: 0.0 ± 0.0
Asn
2.677AsnAla: 2.677 ± 0.2
0.776AsnCys: 0.776 ± 0.12
1.399AsnAsp: 1.399 ± 0.149
1.612AsnGlu: 1.612 ± 0.158
1.414AsnPhe: 1.414 ± 0.124
2.129AsnGly: 2.129 ± 0.198
1.201AsnHis: 1.201 ± 0.135
1.338AsnIle: 1.338 ± 0.175
1.125AsnLys: 1.125 ± 0.137
2.798AsnLeu: 2.798 ± 0.215
0.7AsnMet: 0.7 ± 0.123
1.521AsnAsn: 1.521 ± 0.187
1.658AsnPro: 1.658 ± 0.161
0.882AsnGln: 0.882 ± 0.133
1.901AsnArg: 1.901 ± 0.165
2.312AsnSer: 2.312 ± 0.22
2.266AsnThr: 2.266 ± 0.245
2.783AsnVal: 2.783 ± 0.261
0.319AsnTrp: 0.319 ± 0.064
1.065AsnTyr: 1.065 ± 0.13
0.0AsnXaa: 0.0 ± 0.0
Pro
5.11ProAla: 5.11 ± 0.425
1.156ProCys: 1.156 ± 0.148
2.859ProAsp: 2.859 ± 0.179
3.178ProGlu: 3.178 ± 0.268
1.658ProPhe: 1.658 ± 0.182
3.559ProGly: 3.559 ± 0.236
1.81ProHis: 1.81 ± 0.184
1.141ProIle: 1.141 ± 0.138
1.901ProLys: 1.901 ± 0.191
5.642ProLeu: 5.642 ± 0.255
1.247ProMet: 1.247 ± 0.147
1.217ProAsn: 1.217 ± 0.117
8.227ProPro: 8.227 ± 0.855
2.616ProGln: 2.616 ± 0.25
4.714ProArg: 4.714 ± 0.334
5.946ProSer: 5.946 ± 0.452
3.528ProThr: 3.528 ± 0.274
4.213ProVal: 4.213 ± 0.263
0.821ProTrp: 0.821 ± 0.117
1.779ProTyr: 1.779 ± 0.155
0.0ProXaa: 0.0 ± 0.0
Gln
1.992GlnAla: 1.992 ± 0.188
0.7GlnCys: 0.7 ± 0.13
1.673GlnAsp: 1.673 ± 0.188
2.007GlnGlu: 2.007 ± 0.174
1.232GlnPhe: 1.232 ± 0.132
1.475GlnGly: 1.475 ± 0.13
1.384GlnHis: 1.384 ± 0.184
1.262GlnIle: 1.262 ± 0.114
1.521GlnLys: 1.521 ± 0.149
3.787GlnLeu: 3.787 ± 0.315
1.049GlnMet: 1.049 ± 0.123
1.232GlnAsn: 1.232 ± 0.167
2.372GlnPro: 2.372 ± 0.222
2.966GlnGln: 2.966 ± 0.38
3.832GlnArg: 3.832 ± 0.272
2.099GlnSer: 2.099 ± 0.182
2.874GlnThr: 2.874 ± 0.205
2.175GlnVal: 2.175 ± 0.207
0.289GlnTrp: 0.289 ± 0.072
0.989GlnTyr: 0.989 ± 0.153
0.0GlnXaa: 0.0 ± 0.0
Arg
4.821ArgAla: 4.821 ± 0.297
1.749ArgCys: 1.749 ± 0.168
4.821ArgAsp: 4.821 ± 0.292
3.802ArgGlu: 3.802 ± 0.237
2.783ArgPhe: 2.783 ± 0.205
4.79ArgGly: 4.79 ± 0.338
3.3ArgHis: 3.3 ± 0.234
2.175ArgIle: 2.175 ± 0.167
1.992ArgLys: 1.992 ± 0.165
8.151ArgLeu: 8.151 ± 0.41
1.095ArgMet: 1.095 ± 0.141
2.54ArgAsn: 2.54 ± 0.225
4.41ArgPro: 4.41 ± 0.316
3.787ArgGln: 3.787 ± 0.288
9.049ArgArg: 9.049 ± 0.554
4.562ArgSer: 4.562 ± 0.294
4.03ArgThr: 4.03 ± 0.297
5.064ArgVal: 5.064 ± 0.29
1.156ArgTrp: 1.156 ± 0.186
2.859ArgTyr: 2.859 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
6.144SerAla: 6.144 ± 0.426
1.597SerCys: 1.597 ± 0.15
3.315SerAsp: 3.315 ± 0.201
3.68SerGlu: 3.68 ± 0.226
2.57SerPhe: 2.57 ± 0.198
5.794SerGly: 5.794 ± 0.407
2.023SerHis: 2.023 ± 0.211
2.114SerIle: 2.114 ± 0.176
1.536SerLys: 1.536 ± 0.171
7.3SerLeu: 7.3 ± 0.381
1.582SerMet: 1.582 ± 0.163
2.251SerAsn: 2.251 ± 0.199
5.186SerPro: 5.186 ± 0.358
2.783SerGln: 2.783 ± 0.208
5.292SerArg: 5.292 ± 0.39
11.178SerSer: 11.178 ± 0.861
5.11SerThr: 5.11 ± 0.431
5.414SerVal: 5.414 ± 0.344
0.867SerTrp: 0.867 ± 0.109
2.251SerTyr: 2.251 ± 0.202
0.0SerXaa: 0.0 ± 0.0
Thr
5.961ThrAla: 5.961 ± 0.364
1.612ThrCys: 1.612 ± 0.147
2.494ThrAsp: 2.494 ± 0.23
3.133ThrGlu: 3.133 ± 0.22
2.418ThrPhe: 2.418 ± 0.19
3.772ThrGly: 3.772 ± 0.243
1.749ThrHis: 1.749 ± 0.166
2.083ThrIle: 2.083 ± 0.223
1.795ThrLys: 1.795 ± 0.193
5.749ThrLeu: 5.749 ± 0.319
1.308ThrMet: 1.308 ± 0.148
1.688ThrAsn: 1.688 ± 0.204
4.517ThrPro: 4.517 ± 0.411
2.144ThrGln: 2.144 ± 0.173
4.152ThrArg: 4.152 ± 0.288
5.733ThrSer: 5.733 ± 0.453
7.102ThrThr: 7.102 ± 0.838
5.992ThrVal: 5.992 ± 0.377
1.065ThrTrp: 1.065 ± 0.12
1.81ThrTyr: 1.81 ± 0.164
0.0ThrXaa: 0.0 ± 0.0
Val
5.581ValAla: 5.581 ± 0.328
1.916ValCys: 1.916 ± 0.175
2.92ValAsp: 2.92 ± 0.235
3.741ValGlu: 3.741 ± 0.257
3.65ValPhe: 3.65 ± 0.262
3.984ValGly: 3.984 ± 0.317
1.871ValHis: 1.871 ± 0.151
3.315ValIle: 3.315 ± 0.233
2.129ValLys: 2.129 ± 0.185
7.102ValLeu: 7.102 ± 0.372
1.749ValMet: 1.749 ± 0.18
2.388ValAsn: 2.388 ± 0.186
4.684ValPro: 4.684 ± 0.302
2.129ValGln: 2.129 ± 0.172
5.125ValArg: 5.125 ± 0.28
5.901ValSer: 5.901 ± 0.343
5.186ValThr: 5.186 ± 0.338
5.536ValVal: 5.536 ± 0.314
1.156ValTrp: 1.156 ± 0.136
2.753ValTyr: 2.753 ± 0.182
0.0ValXaa: 0.0 ± 0.0
Trp
0.958TrpAla: 0.958 ± 0.108
0.456TrpCys: 0.456 ± 0.092
0.715TrpAsp: 0.715 ± 0.115
0.73TrpGlu: 0.73 ± 0.101
0.563TrpPhe: 0.563 ± 0.112
0.791TrpGly: 0.791 ± 0.118
0.259TrpHis: 0.259 ± 0.059
0.654TrpIle: 0.654 ± 0.098
0.471TrpLys: 0.471 ± 0.101
1.642TrpLeu: 1.642 ± 0.16
0.395TrpMet: 0.395 ± 0.082
0.441TrpAsn: 0.441 ± 0.079
0.958TrpPro: 0.958 ± 0.131
0.715TrpGln: 0.715 ± 0.096
1.095TrpArg: 1.095 ± 0.12
0.867TrpSer: 0.867 ± 0.119
0.669TrpThr: 0.669 ± 0.094
0.928TrpVal: 0.928 ± 0.102
0.502TrpTrp: 0.502 ± 0.124
0.487TrpTyr: 0.487 ± 0.078
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.22TyrAla: 2.22 ± 0.168
0.791TyrCys: 0.791 ± 0.115
1.871TyrAsp: 1.871 ± 0.181
1.764TyrGlu: 1.764 ± 0.17
1.247TyrPhe: 1.247 ± 0.144
2.144TyrGly: 2.144 ± 0.191
1.262TyrHis: 1.262 ± 0.144
1.095TyrIle: 1.095 ± 0.148
0.928TyrLys: 0.928 ± 0.145
3.559TyrLeu: 3.559 ± 0.283
0.7TyrMet: 0.7 ± 0.109
1.201TyrAsn: 1.201 ± 0.179
1.506TyrPro: 1.506 ± 0.156
1.141TyrGln: 1.141 ± 0.128
3.026TyrArg: 3.026 ± 0.186
2.129TyrSer: 2.129 ± 0.204
1.871TyrThr: 1.871 ± 0.21
3.042TyrVal: 3.042 ± 0.201
0.502TyrTrp: 0.502 ± 0.086
1.308TyrTyr: 1.308 ± 0.134
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 169 proteins (65757 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski