Amino acid dipepetide frequency for Cydia pomonella granulosis virus (isolate Mexico/1963) (CpGV) (Cydia pomonella granulovirus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.005AlaAla: 3.005 ± 0.356
1.029AlaCys: 1.029 ± 0.152
3.168AlaAsp: 3.168 ± 0.28
2.789AlaGlu: 2.789 ± 0.333
2.139AlaPhe: 2.139 ± 0.22
2.031AlaGly: 2.031 ± 0.211
1.435AlaHis: 1.435 ± 0.167
2.491AlaIle: 2.491 ± 0.277
2.22AlaLys: 2.22 ± 0.23
4.63AlaLeu: 4.63 ± 0.386
1.056AlaMet: 1.056 ± 0.165
3.222AlaAsn: 3.222 ± 0.28
1.841AlaPro: 1.841 ± 0.226
1.976AlaGln: 1.976 ± 0.202
2.464AlaArg: 2.464 ± 0.212
2.41AlaSer: 2.41 ± 0.228
3.168AlaThr: 3.168 ± 0.36
3.466AlaVal: 3.466 ± 0.291
0.379AlaTrp: 0.379 ± 0.116
1.841AlaTyr: 1.841 ± 0.243
0.0AlaXaa: 0.0 ± 0.0
Cys
1.029CysAla: 1.029 ± 0.183
0.569CysCys: 0.569 ± 0.189
1.597CysAsp: 1.597 ± 0.238
1.516CysGlu: 1.516 ± 0.221
1.191CysPhe: 1.191 ± 0.194
1.76CysGly: 1.76 ± 0.252
0.785CysHis: 0.785 ± 0.159
1.191CysIle: 1.191 ± 0.173
1.489CysLys: 1.489 ± 0.235
2.328CysLeu: 2.328 ± 0.303
0.569CysMet: 0.569 ± 0.1
1.516CysAsn: 1.516 ± 0.225
0.893CysPro: 0.893 ± 0.137
0.758CysGln: 0.758 ± 0.195
1.462CysArg: 1.462 ± 0.238
1.597CysSer: 1.597 ± 0.193
1.408CysThr: 1.408 ± 0.176
2.545CysVal: 2.545 ± 0.296
0.108CysTrp: 0.108 ± 0.051
1.083CysTyr: 1.083 ± 0.159
0.0CysXaa: 0.0 ± 0.0
Asp
3.168AspAla: 3.168 ± 0.336
1.462AspCys: 1.462 ± 0.213
5.794AspAsp: 5.794 ± 0.589
4.765AspGlu: 4.765 ± 0.456
2.085AspPhe: 2.085 ± 0.212
2.951AspGly: 2.951 ± 0.293
1.273AspHis: 1.273 ± 0.197
3.087AspIle: 3.087 ± 0.244
4.034AspLys: 4.034 ± 0.362
4.44AspLeu: 4.44 ± 0.347
1.435AspMet: 1.435 ± 0.199
4.494AspAsn: 4.494 ± 0.395
1.949AspPro: 1.949 ± 0.263
1.245AspGln: 1.245 ± 0.168
3.005AspArg: 3.005 ± 0.271
3.33AspSer: 3.33 ± 0.283
4.521AspThr: 4.521 ± 0.33
4.467AspVal: 4.467 ± 0.327
0.623AspTrp: 0.623 ± 0.121
3.222AspTyr: 3.222 ± 0.294
0.0AspXaa: 0.0 ± 0.0
Glu
2.762GluAla: 2.762 ± 0.332
1.679GluCys: 1.679 ± 0.269
3.79GluAsp: 3.79 ± 0.305
5.036GluGlu: 5.036 ± 0.456
2.518GluPhe: 2.518 ± 0.245
3.141GluGly: 3.141 ± 0.35
1.624GluHis: 1.624 ± 0.225
2.789GluIle: 2.789 ± 0.258
3.818GluLys: 3.818 ± 0.412
5.442GluLeu: 5.442 ± 0.415
1.868GluMet: 1.868 ± 0.227
3.032GluAsn: 3.032 ± 0.302
2.328GluPro: 2.328 ± 0.379
2.545GluGln: 2.545 ± 0.329
4.007GluArg: 4.007 ± 0.368
3.547GluSer: 3.547 ± 0.351
2.924GluThr: 2.924 ± 0.223
3.818GluVal: 3.818 ± 0.362
0.866GluTrp: 0.866 ± 0.167
2.951GluTyr: 2.951 ± 0.332
0.0GluXaa: 0.0 ± 0.0
Phe
1.895PheAla: 1.895 ± 0.233
1.164PheCys: 1.164 ± 0.175
3.574PheAsp: 3.574 ± 0.356
2.68PheGlu: 2.68 ± 0.268
1.841PhePhe: 1.841 ± 0.268
1.597PheGly: 1.597 ± 0.184
0.758PheHis: 0.758 ± 0.146
3.005PheIle: 3.005 ± 0.309
2.978PheLys: 2.978 ± 0.357
3.953PheLeu: 3.953 ± 0.407
1.218PheMet: 1.218 ± 0.181
3.303PheAsn: 3.303 ± 0.319
1.083PhePro: 1.083 ± 0.189
1.245PheGln: 1.245 ± 0.161
1.922PheArg: 1.922 ± 0.217
2.789PheSer: 2.789 ± 0.281
2.139PheThr: 2.139 ± 0.273
5.063PheVal: 5.063 ± 0.397
0.298PheTrp: 0.298 ± 0.08
1.895PheTyr: 1.895 ± 0.231
0.0PheXaa: 0.0 ± 0.0
Gly
3.059GlyAla: 3.059 ± 0.281
0.839GlyCys: 0.839 ± 0.167
3.736GlyAsp: 3.736 ± 0.381
3.249GlyGlu: 3.249 ± 0.319
1.679GlyPhe: 1.679 ± 0.246
3.926GlyGly: 3.926 ± 0.406
0.731GlyHis: 0.731 ± 0.171
1.679GlyIle: 1.679 ± 0.201
2.031GlyLys: 2.031 ± 0.253
3.438GlyLeu: 3.438 ± 0.289
1.056GlyMet: 1.056 ± 0.178
1.949GlyAsn: 1.949 ± 0.25
0.812GlyPro: 0.812 ± 0.158
1.327GlyGln: 1.327 ± 0.212
2.41GlyArg: 2.41 ± 0.297
2.789GlySer: 2.789 ± 0.265
2.247GlyThr: 2.247 ± 0.267
5.171GlyVal: 5.171 ± 0.466
0.46GlyTrp: 0.46 ± 0.11
1.652GlyTyr: 1.652 ± 0.176
0.0GlyXaa: 0.0 ± 0.0
His
1.381HisAla: 1.381 ± 0.217
0.704HisCys: 0.704 ± 0.166
1.489HisAsp: 1.489 ± 0.225
1.245HisGlu: 1.245 ± 0.174
1.245HisPhe: 1.245 ± 0.195
0.785HisGly: 0.785 ± 0.14
1.489HisHis: 1.489 ± 0.31
1.76HisIle: 1.76 ± 0.188
1.841HisLys: 1.841 ± 0.238
2.41HisLeu: 2.41 ± 0.26
0.487HisMet: 0.487 ± 0.125
2.031HisAsn: 2.031 ± 0.203
1.327HisPro: 1.327 ± 0.226
1.002HisGln: 1.002 ± 0.14
1.083HisArg: 1.083 ± 0.202
1.516HisSer: 1.516 ± 0.194
1.868HisThr: 1.868 ± 0.294
1.489HisVal: 1.489 ± 0.221
0.298HisTrp: 0.298 ± 0.082
1.191HisTyr: 1.191 ± 0.225
0.0HisXaa: 0.0 ± 0.0
Ile
1.895IleAla: 1.895 ± 0.199
1.11IleCys: 1.11 ± 0.142
3.574IleAsp: 3.574 ± 0.253
3.276IleGlu: 3.276 ± 0.319
2.085IlePhe: 2.085 ± 0.271
1.976IleGly: 1.976 ± 0.191
1.3IleHis: 1.3 ± 0.179
3.384IleIle: 3.384 ± 0.499
4.711IleLys: 4.711 ± 0.357
4.44IleLeu: 4.44 ± 0.398
1.787IleMet: 1.787 ± 0.258
5.171IleAsn: 5.171 ± 0.408
1.76IlePro: 1.76 ± 0.247
1.652IleGln: 1.652 ± 0.234
2.166IleArg: 2.166 ± 0.295
2.978IleSer: 2.978 ± 0.248
3.384IleThr: 3.384 ± 0.266
5.307IleVal: 5.307 ± 0.368
0.433IleTrp: 0.433 ± 0.108
2.031IleTyr: 2.031 ± 0.226
0.0IleXaa: 0.0 ± 0.0
Lys
1.679LysAla: 1.679 ± 0.216
1.516LysCys: 1.516 ± 0.228
2.139LysAsp: 2.139 ± 0.24
3.628LysGlu: 3.628 ± 0.356
3.032LysPhe: 3.032 ± 0.286
2.139LysGly: 2.139 ± 0.308
1.814LysHis: 1.814 ± 0.219
3.33LysIle: 3.33 ± 0.326
5.009LysLys: 5.009 ± 0.49
5.902LysLeu: 5.902 ± 0.406
1.814LysMet: 1.814 ± 0.213
3.926LysAsn: 3.926 ± 0.41
1.597LysPro: 1.597 ± 0.198
2.978LysGln: 2.978 ± 0.334
4.684LysArg: 4.684 ± 0.346
4.224LysSer: 4.224 ± 0.299
3.736LysThr: 3.736 ± 0.381
4.711LysVal: 4.711 ± 0.342
0.785LysTrp: 0.785 ± 0.145
3.493LysTyr: 3.493 ± 0.312
0.0LysXaa: 0.0 ± 0.0
Leu
4.061LeuAla: 4.061 ± 0.304
2.437LeuCys: 2.437 ± 0.217
4.792LeuAsp: 4.792 ± 0.335
4.873LeuGlu: 4.873 ± 0.402
4.792LeuPhe: 4.792 ± 0.33
3.059LeuGly: 3.059 ± 0.282
2.545LeuHis: 2.545 ± 0.337
5.604LeuIle: 5.604 ± 0.381
6.308LeuLys: 6.308 ± 0.393
8.88LeuLeu: 8.88 ± 0.578
2.816LeuMet: 2.816 ± 0.251
6.606LeuAsn: 6.606 ± 0.395
3.168LeuPro: 3.168 ± 0.26
3.52LeuGln: 3.52 ± 0.364
5.225LeuArg: 5.225 ± 0.388
5.415LeuSer: 5.415 ± 0.358
5.09LeuThr: 5.09 ± 0.327
6.579LeuVal: 6.579 ± 0.383
0.975LeuTrp: 0.975 ± 0.173
4.521LeuTyr: 4.521 ± 0.365
0.0LeuXaa: 0.0 ± 0.0
Met
1.137MetAla: 1.137 ± 0.159
0.758MetCys: 0.758 ± 0.15
1.76MetAsp: 1.76 ± 0.221
1.787MetGlu: 1.787 ± 0.217
1.327MetPhe: 1.327 ± 0.182
1.191MetGly: 1.191 ± 0.167
0.596MetHis: 0.596 ± 0.135
1.489MetIle: 1.489 ± 0.197
1.435MetLys: 1.435 ± 0.191
2.843MetLeu: 2.843 ± 0.27
0.866MetMet: 0.866 ± 0.159
1.489MetAsn: 1.489 ± 0.193
0.433MetPro: 0.433 ± 0.094
0.65MetGln: 0.65 ± 0.124
1.218MetArg: 1.218 ± 0.178
2.193MetSer: 2.193 ± 0.247
1.489MetThr: 1.489 ± 0.165
2.328MetVal: 2.328 ± 0.206
0.271MetTrp: 0.271 ± 0.094
1.57MetTyr: 1.57 ± 0.194
0.0MetXaa: 0.0 ± 0.0
Asn
2.951AsnAla: 2.951 ± 0.252
1.462AsnCys: 1.462 ± 0.18
4.251AsnAsp: 4.251 ± 0.257
4.115AsnGlu: 4.115 ± 0.421
2.951AsnPhe: 2.951 ± 0.275
3.682AsnGly: 3.682 ± 0.317
1.679AsnHis: 1.679 ± 0.211
3.899AsnIle: 3.899 ± 0.318
4.603AsnLys: 4.603 ± 0.379
5.09AsnLeu: 5.09 ± 0.366
1.516AsnMet: 1.516 ± 0.177
6.606AsnAsn: 6.606 ± 0.532
1.976AsnPro: 1.976 ± 0.219
2.031AsnGln: 2.031 ± 0.24
3.438AsnArg: 3.438 ± 0.302
4.684AsnSer: 4.684 ± 0.366
5.009AsnThr: 5.009 ± 0.364
4.955AsnVal: 4.955 ± 0.428
0.487AsnTrp: 0.487 ± 0.106
3.222AsnTyr: 3.222 ± 0.344
0.0AsnXaa: 0.0 ± 0.0
Pro
1.706ProAla: 1.706 ± 0.269
0.677ProCys: 0.677 ± 0.154
1.733ProAsp: 1.733 ± 0.202
1.516ProGlu: 1.516 ± 0.219
1.516ProPhe: 1.516 ± 0.187
1.002ProGly: 1.002 ± 0.154
1.191ProHis: 1.191 ± 0.209
1.949ProIle: 1.949 ± 0.186
1.516ProLys: 1.516 ± 0.206
3.411ProLeu: 3.411 ± 0.337
0.731ProMet: 0.731 ± 0.129
2.491ProAsn: 2.491 ± 0.218
3.114ProPro: 3.114 ± 0.639
1.408ProGln: 1.408 ± 0.217
1.57ProArg: 1.57 ± 0.294
2.464ProSer: 2.464 ± 0.275
2.87ProThr: 2.87 ± 0.358
2.707ProVal: 2.707 ± 0.312
0.271ProTrp: 0.271 ± 0.099
1.543ProTyr: 1.543 ± 0.202
0.0ProXaa: 0.0 ± 0.0
Gln
1.218GlnAla: 1.218 ± 0.187
1.273GlnCys: 1.273 ± 0.205
1.273GlnAsp: 1.273 ± 0.213
2.193GlnGlu: 2.193 ± 0.267
2.004GlnPhe: 2.004 ± 0.236
0.623GlnGly: 0.623 ± 0.126
1.137GlnHis: 1.137 ± 0.175
2.274GlnIle: 2.274 ± 0.219
1.706GlnLys: 1.706 ± 0.204
4.657GlnLeu: 4.657 ± 0.428
0.975GlnMet: 0.975 ± 0.166
1.976GlnAsn: 1.976 ± 0.217
1.841GlnPro: 1.841 ± 0.321
2.301GlnGln: 2.301 ± 0.398
1.895GlnArg: 1.895 ± 0.237
2.355GlnSer: 2.355 ± 0.247
1.76GlnThr: 1.76 ± 0.215
1.949GlnVal: 1.949 ± 0.227
0.487GlnTrp: 0.487 ± 0.103
1.787GlnTyr: 1.787 ± 0.167
0.0GlnXaa: 0.0 ± 0.0
Arg
2.383ArgAla: 2.383 ± 0.243
1.679ArgCys: 1.679 ± 0.221
3.032ArgAsp: 3.032 ± 0.33
3.493ArgGlu: 3.493 ± 0.398
2.193ArgPhe: 2.193 ± 0.247
2.383ArgGly: 2.383 ± 0.293
1.381ArgHis: 1.381 ± 0.171
3.087ArgIle: 3.087 ± 0.272
2.437ArgLys: 2.437 ± 0.275
5.821ArgLeu: 5.821 ± 0.466
1.327ArgMet: 1.327 ± 0.173
3.168ArgAsn: 3.168 ± 0.293
1.381ArgPro: 1.381 ± 0.2
2.22ArgGln: 2.22 ± 0.295
4.305ArgArg: 4.305 ± 0.43
3.52ArgSer: 3.52 ± 0.609
2.22ArgThr: 2.22 ± 0.276
5.117ArgVal: 5.117 ± 0.511
0.569ArgTrp: 0.569 ± 0.114
2.004ArgTyr: 2.004 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
3.438SerAla: 3.438 ± 0.353
1.706SerCys: 1.706 ± 0.237
3.709SerAsp: 3.709 ± 0.341
2.707SerGlu: 2.707 ± 0.28
2.951SerPhe: 2.951 ± 0.279
3.845SerGly: 3.845 ± 0.344
1.408SerHis: 1.408 ± 0.178
3.222SerIle: 3.222 ± 0.295
3.709SerLys: 3.709 ± 0.361
6.119SerLeu: 6.119 ± 0.475
1.733SerMet: 1.733 ± 0.205
3.899SerAsn: 3.899 ± 0.36
2.112SerPro: 2.112 ± 0.258
1.841SerGln: 1.841 ± 0.254
3.276SerArg: 3.276 ± 0.407
5.009SerSer: 5.009 ± 0.604
4.549SerThr: 4.549 ± 0.371
5.659SerVal: 5.659 ± 0.42
0.487SerTrp: 0.487 ± 0.097
1.841SerTyr: 1.841 ± 0.207
0.0SerXaa: 0.0 ± 0.0
Thr
3.141ThrAla: 3.141 ± 0.259
1.245ThrCys: 1.245 ± 0.209
2.87ThrAsp: 2.87 ± 0.277
2.707ThrGlu: 2.707 ± 0.26
2.355ThrPhe: 2.355 ± 0.265
2.247ThrGly: 2.247 ± 0.272
2.031ThrHis: 2.031 ± 0.257
3.899ThrIle: 3.899 ± 0.337
3.357ThrLys: 3.357 ± 0.317
5.604ThrLeu: 5.604 ± 0.402
1.652ThrMet: 1.652 ± 0.235
4.494ThrAsn: 4.494 ± 0.402
3.574ThrPro: 3.574 ± 0.384
2.762ThrGln: 2.762 ± 0.331
2.735ThrArg: 2.735 ± 0.25
3.818ThrSer: 3.818 ± 0.32
5.442ThrThr: 5.442 ± 0.536
4.928ThrVal: 4.928 ± 0.341
0.514ThrTrp: 0.514 ± 0.112
2.166ThrTyr: 2.166 ± 0.211
0.0ThrXaa: 0.0 ± 0.0
Val
4.684ValAla: 4.684 ± 0.271
2.518ValCys: 2.518 ± 0.255
6.038ValAsp: 6.038 ± 0.475
5.415ValGlu: 5.415 ± 0.446
3.953ValPhe: 3.953 ± 0.334
3.601ValGly: 3.601 ± 0.397
2.437ValHis: 2.437 ± 0.3
3.709ValIle: 3.709 ± 0.364
4.494ValLys: 4.494 ± 0.321
7.229ValLeu: 7.229 ± 0.532
2.328ValMet: 2.328 ± 0.255
4.738ValAsn: 4.738 ± 0.401
2.355ValPro: 2.355 ± 0.278
2.491ValGln: 2.491 ± 0.227
4.115ValArg: 4.115 ± 0.321
5.117ValSer: 5.117 ± 0.344
4.034ValThr: 4.034 ± 0.391
7.662ValVal: 7.662 ± 0.562
0.839ValTrp: 0.839 ± 0.185
4.359ValTyr: 4.359 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
0.569TrpAla: 0.569 ± 0.114
0.244TrpCys: 0.244 ± 0.08
0.704TrpAsp: 0.704 ± 0.136
0.541TrpGlu: 0.541 ± 0.137
0.487TrpPhe: 0.487 ± 0.095
0.514TrpGly: 0.514 ± 0.12
0.19TrpHis: 0.19 ± 0.078
0.271TrpIle: 0.271 ± 0.085
0.623TrpLys: 0.623 ± 0.141
0.866TrpLeu: 0.866 ± 0.155
0.244TrpMet: 0.244 ± 0.081
0.623TrpAsn: 0.623 ± 0.11
0.487TrpPro: 0.487 ± 0.1
0.298TrpGln: 0.298 ± 0.086
0.704TrpArg: 0.704 ± 0.147
0.569TrpSer: 0.569 ± 0.131
0.541TrpThr: 0.541 ± 0.099
0.65TrpVal: 0.65 ± 0.147
0.244TrpTrp: 0.244 ± 0.091
0.569TrpTyr: 0.569 ± 0.128
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.597TyrAla: 1.597 ± 0.209
1.327TyrCys: 1.327 ± 0.196
2.328TyrAsp: 2.328 ± 0.292
2.951TyrGlu: 2.951 ± 0.25
2.058TyrPhe: 2.058 ± 0.251
1.787TyrGly: 1.787 ± 0.24
0.866TyrHis: 0.866 ± 0.161
2.301TyrIle: 2.301 ± 0.271
3.818TyrLys: 3.818 ± 0.313
3.763TyrLeu: 3.763 ± 0.307
1.327TyrMet: 1.327 ± 0.193
3.872TyrAsn: 3.872 ± 0.313
1.354TyrPro: 1.354 ± 0.195
1.435TyrGln: 1.435 ± 0.175
2.031TyrArg: 2.031 ± 0.267
2.897TyrSer: 2.897 ± 0.257
3.141TyrThr: 3.141 ± 0.33
3.547TyrVal: 3.547 ± 0.278
0.541TyrTrp: 0.541 ± 0.113
2.924TyrTyr: 2.924 ± 0.301
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 143 proteins (36936 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski