Amino acid dipepetide frequency for Cynomolgus cytomegalovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.799AlaAla: 6.799 ± 0.572
1.8AlaCys: 1.8 ± 0.184
2.845AlaAsp: 2.845 ± 0.23
3.391AlaGlu: 3.391 ± 0.24
2.668AlaPhe: 2.668 ± 0.221
2.861AlaGly: 2.861 ± 0.274
1.672AlaHis: 1.672 ± 0.159
3.52AlaIle: 3.52 ± 0.215
2.315AlaLys: 2.315 ± 0.265
6.719AlaLeu: 6.719 ± 0.336
1.575AlaMet: 1.575 ± 0.158
2.62AlaAsn: 2.62 ± 0.209
3.616AlaPro: 3.616 ± 0.308
2.459AlaGln: 2.459 ± 0.213
3.279AlaArg: 3.279 ± 0.239
5.481AlaSer: 5.481 ± 0.313
4.597AlaThr: 4.597 ± 0.298
4.838AlaVal: 4.838 ± 0.34
1.013AlaTrp: 1.013 ± 0.114
1.736AlaTyr: 1.736 ± 0.173
0.0AlaXaa: 0.0 ± 0.0
Cys
1.704CysAla: 1.704 ± 0.154
0.948CysCys: 0.948 ± 0.159
1.382CysAsp: 1.382 ± 0.158
1.173CysGlu: 1.173 ± 0.13
1.141CysPhe: 1.141 ± 0.146
1.623CysGly: 1.623 ± 0.184
0.739CysHis: 0.739 ± 0.123
1.318CysIle: 1.318 ± 0.142
0.82CysLys: 0.82 ± 0.118
2.909CysLeu: 2.909 ± 0.223
0.836CysMet: 0.836 ± 0.113
1.286CysAsn: 1.286 ± 0.14
1.366CysPro: 1.366 ± 0.16
1.061CysGln: 1.061 ± 0.144
1.559CysArg: 1.559 ± 0.178
1.607CysSer: 1.607 ± 0.179
1.784CysThr: 1.784 ± 0.18
1.881CysVal: 1.881 ± 0.18
0.37CysTrp: 0.37 ± 0.075
1.254CysTyr: 1.254 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
2.797AspAla: 2.797 ± 0.201
0.723AspCys: 0.723 ± 0.117
3.809AspAsp: 3.809 ± 0.368
4.099AspGlu: 4.099 ± 0.272
2.089AspPhe: 2.089 ± 0.193
2.041AspGly: 2.041 ± 0.203
1.077AspHis: 1.077 ± 0.146
2.829AspIle: 2.829 ± 0.252
1.479AspLys: 1.479 ± 0.192
4.934AspLeu: 4.934 ± 0.32
1.414AspMet: 1.414 ± 0.161
1.752AspAsn: 1.752 ± 0.178
2.636AspPro: 2.636 ± 0.224
1.302AspGln: 1.302 ± 0.142
2.957AspArg: 2.957 ± 0.216
3.52AspSer: 3.52 ± 0.269
2.909AspThr: 2.909 ± 0.219
2.99AspVal: 2.99 ± 0.246
0.659AspTrp: 0.659 ± 0.122
1.881AspTyr: 1.881 ± 0.166
0.0AspXaa: 0.0 ± 0.0
Glu
4.002GluAla: 4.002 ± 0.294
1.35GluCys: 1.35 ± 0.155
3.263GluAsp: 3.263 ± 0.263
4.227GluGlu: 4.227 ± 0.366
1.495GluPhe: 1.495 ± 0.17
2.138GluGly: 2.138 ± 0.216
1.864GluHis: 1.864 ± 0.2
2.331GluIle: 2.331 ± 0.182
2.25GluLys: 2.25 ± 0.213
4.95GluLeu: 4.95 ± 0.305
1.35GluMet: 1.35 ± 0.137
2.475GluAsn: 2.475 ± 0.197
2.459GluPro: 2.459 ± 0.186
1.784GluGln: 1.784 ± 0.202
3.375GluArg: 3.375 ± 0.311
3.584GluSer: 3.584 ± 0.255
3.986GluThr: 3.986 ± 0.315
3.391GluVal: 3.391 ± 0.27
0.514GluTrp: 0.514 ± 0.087
1.559GluTyr: 1.559 ± 0.15
0.0GluXaa: 0.0 ± 0.0
Phe
2.089PheAla: 2.089 ± 0.164
1.366PheCys: 1.366 ± 0.179
1.848PheAsp: 1.848 ± 0.169
1.977PheGlu: 1.977 ± 0.168
1.8PhePhe: 1.8 ± 0.198
2.17PheGly: 2.17 ± 0.178
1.109PheHis: 1.109 ± 0.152
2.459PheIle: 2.459 ± 0.209
1.35PheLys: 1.35 ± 0.136
4.179PheLeu: 4.179 ± 0.236
1.318PheMet: 1.318 ± 0.154
1.832PheAsn: 1.832 ± 0.131
1.993PhePro: 1.993 ± 0.178
1.463PheGln: 1.463 ± 0.173
2.491PheArg: 2.491 ± 0.202
2.877PheSer: 2.877 ± 0.237
3.022PheThr: 3.022 ± 0.226
3.215PheVal: 3.215 ± 0.243
0.804PheTrp: 0.804 ± 0.128
1.848PheTyr: 1.848 ± 0.189
0.0PheXaa: 0.0 ± 0.0
Gly
2.556GlyAla: 2.556 ± 0.207
1.077GlyCys: 1.077 ± 0.147
2.315GlyAsp: 2.315 ± 0.217
2.379GlyGlu: 2.379 ± 0.22
1.929GlyPhe: 1.929 ± 0.171
2.99GlyGly: 2.99 ± 0.305
1.045GlyHis: 1.045 ± 0.145
2.54GlyIle: 2.54 ± 0.184
1.672GlyLys: 1.672 ± 0.175
4.597GlyLeu: 4.597 ± 0.265
0.9GlyMet: 0.9 ± 0.143
2.057GlyAsn: 2.057 ± 0.204
2.025GlyPro: 2.025 ± 0.183
1.881GlyGln: 1.881 ± 0.164
2.893GlyArg: 2.893 ± 0.292
3.488GlySer: 3.488 ± 0.233
3.504GlyThr: 3.504 ± 0.29
3.375GlyVal: 3.375 ± 0.284
0.675GlyTrp: 0.675 ± 0.09
1.784GlyTyr: 1.784 ± 0.149
0.0GlyXaa: 0.0 ± 0.0
His
2.298HisAla: 2.298 ± 0.202
0.643HisCys: 0.643 ± 0.103
1.623HisAsp: 1.623 ± 0.147
1.431HisGlu: 1.431 ± 0.152
1.077HisPhe: 1.077 ± 0.121
1.607HisGly: 1.607 ± 0.137
1.527HisHis: 1.527 ± 0.245
1.334HisIle: 1.334 ± 0.177
0.964HisLys: 0.964 ± 0.126
2.781HisLeu: 2.781 ± 0.291
0.788HisMet: 0.788 ± 0.113
1.463HisAsn: 1.463 ± 0.165
1.623HisPro: 1.623 ± 0.176
1.189HisGln: 1.189 ± 0.163
2.282HisArg: 2.282 ± 0.173
2.009HisSer: 2.009 ± 0.191
2.154HisThr: 2.154 ± 0.206
2.298HisVal: 2.298 ± 0.184
0.321HisTrp: 0.321 ± 0.06
0.836HisTyr: 0.836 ± 0.125
0.0HisXaa: 0.0 ± 0.0
Ile
2.861IleAla: 2.861 ± 0.262
1.72IleCys: 1.72 ± 0.179
2.122IleAsp: 2.122 ± 0.17
1.913IleGlu: 1.913 ± 0.159
2.459IlePhe: 2.459 ± 0.199
2.138IleGly: 2.138 ± 0.232
1.334IleHis: 1.334 ± 0.136
3.488IleIle: 3.488 ± 0.265
2.154IleLys: 2.154 ± 0.213
4.983IleLeu: 4.983 ± 0.357
1.447IleMet: 1.447 ± 0.148
2.25IleAsn: 2.25 ± 0.23
3.182IlePro: 3.182 ± 0.249
2.25IleGln: 2.25 ± 0.205
2.523IleArg: 2.523 ± 0.209
3.488IleSer: 3.488 ± 0.236
3.665IleThr: 3.665 ± 0.303
4.356IleVal: 4.356 ± 0.282
0.852IleTrp: 0.852 ± 0.11
2.556IleTyr: 2.556 ± 0.18
0.0IleXaa: 0.0 ± 0.0
Lys
2.427LysAla: 2.427 ± 0.196
0.964LysCys: 0.964 ± 0.109
1.768LysAsp: 1.768 ± 0.174
1.752LysGlu: 1.752 ± 0.22
1.366LysPhe: 1.366 ± 0.159
1.607LysGly: 1.607 ± 0.182
1.495LysHis: 1.495 ± 0.16
2.057LysIle: 2.057 ± 0.229
2.202LysLys: 2.202 ± 0.212
3.552LysLeu: 3.552 ± 0.259
1.109LysMet: 1.109 ± 0.14
2.202LysAsn: 2.202 ± 0.187
2.315LysPro: 2.315 ± 0.279
1.656LysGln: 1.656 ± 0.161
3.118LysArg: 3.118 ± 0.233
2.556LysSer: 2.556 ± 0.202
2.845LysThr: 2.845 ± 0.242
1.945LysVal: 1.945 ± 0.212
0.37LysTrp: 0.37 ± 0.078
1.431LysTyr: 1.431 ± 0.157
0.0LysXaa: 0.0 ± 0.0
Leu
6.156LeuAla: 6.156 ± 0.293
3.279LeuCys: 3.279 ± 0.224
3.938LeuAsp: 3.938 ± 0.268
4.34LeuGlu: 4.34 ± 0.302
4.902LeuPhe: 4.902 ± 0.3
4.661LeuGly: 4.661 ± 0.299
3.134LeuHis: 3.134 ± 0.248
5.272LeuIle: 5.272 ± 0.302
4.131LeuLys: 4.131 ± 0.276
10.576LeuLeu: 10.576 ± 0.487
3.134LeuMet: 3.134 ± 0.246
3.825LeuAsn: 3.825 ± 0.261
5.24LeuPro: 5.24 ± 0.32
3.681LeuGln: 3.681 ± 0.299
6.686LeuArg: 6.686 ± 0.317
7.538LeuSer: 7.538 ± 0.426
7.056LeuThr: 7.056 ± 0.304
6.06LeuVal: 6.06 ± 0.336
1.479LeuTrp: 1.479 ± 0.186
3.713LeuTyr: 3.713 ± 0.25
0.0LeuXaa: 0.0 ± 0.0
Met
2.057MetAla: 2.057 ± 0.223
0.675MetCys: 0.675 ± 0.13
1.109MetAsp: 1.109 ± 0.13
1.463MetGlu: 1.463 ± 0.144
1.238MetPhe: 1.238 ± 0.154
0.948MetGly: 0.948 ± 0.125
0.723MetHis: 0.723 ± 0.115
1.35MetIle: 1.35 ± 0.132
0.948MetLys: 0.948 ± 0.105
2.781MetLeu: 2.781 ± 0.235
0.98MetMet: 0.98 ± 0.138
1.189MetAsn: 1.189 ± 0.143
1.077MetPro: 1.077 ± 0.119
1.013MetGln: 1.013 ± 0.129
1.543MetArg: 1.543 ± 0.187
2.106MetSer: 2.106 ± 0.174
1.543MetThr: 1.543 ± 0.127
1.559MetVal: 1.559 ± 0.172
0.418MetTrp: 0.418 ± 0.082
1.173MetTyr: 1.173 ± 0.148
0.0MetXaa: 0.0 ± 0.0
Asn
2.877AsnAla: 2.877 ± 0.202
0.9AsnCys: 0.9 ± 0.113
1.897AsnAsp: 1.897 ± 0.157
2.186AsnGlu: 2.186 ± 0.201
1.334AsnPhe: 1.334 ± 0.184
1.977AsnGly: 1.977 ± 0.161
1.27AsnHis: 1.27 ± 0.135
2.443AsnIle: 2.443 ± 0.227
1.752AsnLys: 1.752 ± 0.175
4.05AsnLeu: 4.05 ± 0.305
1.045AsnMet: 1.045 ± 0.121
2.218AsnAsn: 2.218 ± 0.205
1.993AsnPro: 1.993 ± 0.182
1.672AsnGln: 1.672 ± 0.158
2.395AsnArg: 2.395 ± 0.209
3.6AsnSer: 3.6 ± 0.289
3.616AsnThr: 3.616 ± 0.327
3.665AsnVal: 3.665 ± 0.273
0.418AsnTrp: 0.418 ± 0.086
1.543AsnTyr: 1.543 ± 0.159
0.0AsnXaa: 0.0 ± 0.0
Pro
4.115ProAla: 4.115 ± 0.357
1.35ProCys: 1.35 ± 0.143
2.813ProAsp: 2.813 ± 0.255
2.845ProGlu: 2.845 ± 0.202
2.057ProPhe: 2.057 ± 0.171
2.315ProGly: 2.315 ± 0.22
1.688ProHis: 1.688 ± 0.185
2.411ProIle: 2.411 ± 0.169
2.138ProLys: 2.138 ± 0.224
4.211ProLeu: 4.211 ± 0.261
1.173ProMet: 1.173 ± 0.151
1.688ProAsn: 1.688 ± 0.173
5.609ProPro: 5.609 ± 0.502
2.122ProGln: 2.122 ± 0.234
3.713ProArg: 3.713 ± 0.36
4.854ProSer: 4.854 ± 0.362
3.874ProThr: 3.874 ± 0.288
3.954ProVal: 3.954 ± 0.231
0.627ProTrp: 0.627 ± 0.104
1.704ProTyr: 1.704 ± 0.188
0.0ProXaa: 0.0 ± 0.0
Gln
2.395GlnAla: 2.395 ± 0.233
0.997GlnCys: 0.997 ± 0.12
1.623GlnAsp: 1.623 ± 0.204
2.331GlnGlu: 2.331 ± 0.217
1.366GlnPhe: 1.366 ± 0.143
1.398GlnGly: 1.398 ± 0.157
1.302GlnHis: 1.302 ± 0.134
1.897GlnIle: 1.897 ± 0.192
2.122GlnLys: 2.122 ± 0.179
4.002GlnLeu: 4.002 ± 0.284
0.9GlnMet: 0.9 ± 0.118
1.929GlnAsn: 1.929 ± 0.209
2.057GlnPro: 2.057 ± 0.204
2.588GlnGln: 2.588 ± 0.329
2.813GlnArg: 2.813 ± 0.189
2.347GlnSer: 2.347 ± 0.201
2.765GlnThr: 2.765 ± 0.227
2.331GlnVal: 2.331 ± 0.191
0.321GlnTrp: 0.321 ± 0.07
1.447GlnTyr: 1.447 ± 0.17
0.0GlnXaa: 0.0 ± 0.0
Arg
3.616ArgAla: 3.616 ± 0.273
1.752ArgCys: 1.752 ± 0.159
3.134ArgAsp: 3.134 ± 0.276
3.215ArgGlu: 3.215 ± 0.243
2.363ArgPhe: 2.363 ± 0.234
3.038ArgGly: 3.038 ± 0.236
2.7ArgHis: 2.7 ± 0.27
2.797ArgIle: 2.797 ± 0.227
2.556ArgLys: 2.556 ± 0.192
6.863ArgLeu: 6.863 ± 0.32
1.222ArgMet: 1.222 ± 0.138
2.877ArgAsn: 2.877 ± 0.233
3.343ArgPro: 3.343 ± 0.348
3.199ArgGln: 3.199 ± 0.239
5.674ArgArg: 5.674 ± 0.485
4.227ArgSer: 4.227 ± 0.279
3.568ArgThr: 3.568 ± 0.274
3.793ArgVal: 3.793 ± 0.322
1.077ArgTrp: 1.077 ± 0.15
2.781ArgTyr: 2.781 ± 0.194
0.0ArgXaa: 0.0 ± 0.0
Ser
4.934SerAla: 4.934 ± 0.332
1.736SerCys: 1.736 ± 0.162
4.002SerAsp: 4.002 ± 0.245
4.275SerGlu: 4.275 ± 0.228
3.134SerPhe: 3.134 ± 0.191
4.388SerGly: 4.388 ± 0.285
2.154SerHis: 2.154 ± 0.181
3.118SerIle: 3.118 ± 0.24
2.652SerLys: 2.652 ± 0.232
7.41SerLeu: 7.41 ± 0.334
1.784SerMet: 1.784 ± 0.18
2.909SerAsn: 2.909 ± 0.268
4.709SerPro: 4.709 ± 0.339
2.765SerGln: 2.765 ± 0.194
4.918SerArg: 4.918 ± 0.311
9.66SerSer: 9.66 ± 0.694
6.301SerThr: 6.301 ± 0.671
5.256SerVal: 5.256 ± 0.367
0.98SerTrp: 0.98 ± 0.134
2.395SerTyr: 2.395 ± 0.198
0.0SerXaa: 0.0 ± 0.0
Thr
4.806ThrAla: 4.806 ± 0.336
2.138ThrCys: 2.138 ± 0.187
2.765ThrAsp: 2.765 ± 0.216
3.52ThrGlu: 3.52 ± 0.295
3.086ThrPhe: 3.086 ± 0.208
2.925ThrGly: 2.925 ± 0.201
1.913ThrHis: 1.913 ± 0.171
3.584ThrIle: 3.584 ± 0.298
2.893ThrLys: 2.893 ± 0.23
6.59ThrLeu: 6.59 ± 0.392
1.623ThrMet: 1.623 ± 0.159
2.748ThrAsn: 2.748 ± 0.297
4.452ThrPro: 4.452 ± 0.339
2.668ThrGln: 2.668 ± 0.216
3.681ThrArg: 3.681 ± 0.246
7.136ThrSer: 7.136 ± 0.672
7.876ThrThr: 7.876 ± 0.927
6.092ThrVal: 6.092 ± 0.369
0.997ThrTrp: 0.997 ± 0.131
2.395ThrTyr: 2.395 ± 0.162
0.0ThrXaa: 0.0 ± 0.0
Val
4.581ValAla: 4.581 ± 0.284
1.881ValCys: 1.881 ± 0.186
2.99ValAsp: 2.99 ± 0.237
3.006ValGlu: 3.006 ± 0.208
3.375ValPhe: 3.375 ± 0.265
2.588ValGly: 2.588 ± 0.209
1.913ValHis: 1.913 ± 0.19
4.034ValIle: 4.034 ± 0.281
2.604ValLys: 2.604 ± 0.252
7.088ValLeu: 7.088 ± 0.438
1.929ValMet: 1.929 ± 0.18
3.263ValAsn: 3.263 ± 0.271
3.52ValPro: 3.52 ± 0.21
2.347ValGln: 2.347 ± 0.205
4.066ValArg: 4.066 ± 0.251
5.915ValSer: 5.915 ± 0.347
5.577ValThr: 5.577 ± 0.315
4.934ValVal: 4.934 ± 0.277
1.045ValTrp: 1.045 ± 0.153
2.957ValTyr: 2.957 ± 0.22
0.0ValXaa: 0.0 ± 0.0
Trp
0.579TrpAla: 0.579 ± 0.109
0.498TrpCys: 0.498 ± 0.097
0.643TrpAsp: 0.643 ± 0.094
0.707TrpGlu: 0.707 ± 0.109
0.627TrpPhe: 0.627 ± 0.084
0.45TrpGly: 0.45 ± 0.09
0.466TrpHis: 0.466 ± 0.088
0.884TrpIle: 0.884 ± 0.119
0.53TrpLys: 0.53 ± 0.104
1.656TrpLeu: 1.656 ± 0.171
0.466TrpMet: 0.466 ± 0.098
0.595TrpAsn: 0.595 ± 0.088
0.788TrpPro: 0.788 ± 0.119
0.563TrpGln: 0.563 ± 0.088
0.964TrpArg: 0.964 ± 0.188
1.029TrpSer: 1.029 ± 0.104
0.739TrpThr: 0.739 ± 0.108
0.611TrpVal: 0.611 ± 0.084
0.338TrpTrp: 0.338 ± 0.086
0.723TrpTyr: 0.723 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.17TyrAla: 2.17 ± 0.173
0.964TyrCys: 0.964 ± 0.122
2.009TyrAsp: 2.009 ± 0.177
2.122TyrGlu: 2.122 ± 0.18
1.704TyrPhe: 1.704 ± 0.171
1.816TyrGly: 1.816 ± 0.173
1.125TyrHis: 1.125 ± 0.165
2.073TyrIle: 2.073 ± 0.207
1.205TyrLys: 1.205 ± 0.158
3.97TyrLeu: 3.97 ± 0.223
0.884TyrMet: 0.884 ± 0.106
1.672TyrAsn: 1.672 ± 0.17
1.254TyrPro: 1.254 ± 0.132
1.27TyrGln: 1.27 ± 0.162
2.861TyrArg: 2.861 ± 0.232
2.459TyrSer: 2.459 ± 0.235
2.507TyrThr: 2.507 ± 0.211
3.07TyrVal: 3.07 ± 0.231
0.579TyrTrp: 0.579 ± 0.093
1.495TyrTyr: 1.495 ± 0.16
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 173 proteins (62217 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski