Amino acid dipepetide frequency for Diatraea saccharalis granulovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.737AlaAla: 1.737 ± 0.245
0.804AlaCys: 0.804 ± 0.167
2.606AlaAsp: 2.606 ± 0.385
1.673AlaGlu: 1.673 ± 0.238
1.416AlaPhe: 1.416 ± 0.195
1.319AlaGly: 1.319 ± 0.207
0.772AlaHis: 0.772 ± 0.16
2.252AlaIle: 2.252 ± 0.293
1.512AlaLys: 1.512 ± 0.24
3.764AlaLeu: 3.764 ± 0.412
0.611AlaMet: 0.611 ± 0.144
1.866AlaAsn: 1.866 ± 0.284
0.965AlaPro: 0.965 ± 0.159
0.869AlaGln: 0.869 ± 0.151
1.834AlaArg: 1.834 ± 0.243
1.866AlaSer: 1.866 ± 0.223
2.091AlaThr: 2.091 ± 0.316
2.188AlaVal: 2.188 ± 0.299
0.193AlaTrp: 0.193 ± 0.079
1.673AlaTyr: 1.673 ± 0.217
0.0AlaXaa: 0.0 ± 0.0
Cys
0.869CysAla: 0.869 ± 0.142
0.643CysCys: 0.643 ± 0.137
1.577CysAsp: 1.577 ± 0.208
1.577CysGlu: 1.577 ± 0.233
1.383CysPhe: 1.383 ± 0.205
1.094CysGly: 1.094 ± 0.168
0.611CysHis: 0.611 ± 0.169
1.577CysIle: 1.577 ± 0.234
1.705CysLys: 1.705 ± 0.253
2.413CysLeu: 2.413 ± 0.329
0.547CysMet: 0.547 ± 0.127
1.963CysAsn: 1.963 ± 0.246
0.772CysPro: 0.772 ± 0.172
0.804CysGln: 0.804 ± 0.173
1.158CysArg: 1.158 ± 0.268
1.319CysSer: 1.319 ± 0.221
1.416CysThr: 1.416 ± 0.224
2.349CysVal: 2.349 ± 0.276
0.193CysTrp: 0.193 ± 0.075
1.158CysTyr: 1.158 ± 0.187
0.0CysXaa: 0.0 ± 0.0
Asp
1.705AspAla: 1.705 ± 0.223
1.287AspCys: 1.287 ± 0.209
5.277AspAsp: 5.277 ± 0.701
5.244AspGlu: 5.244 ± 0.483
2.735AspPhe: 2.735 ± 0.327
2.735AspGly: 2.735 ± 0.313
0.965AspHis: 0.965 ± 0.16
5.083AspIle: 5.083 ± 0.386
4.215AspLys: 4.215 ± 0.422
4.858AspLeu: 4.858 ± 0.393
1.705AspMet: 1.705 ± 0.239
4.762AspAsn: 4.762 ± 0.417
2.059AspPro: 2.059 ± 0.267
1.319AspGln: 1.319 ± 0.253
2.252AspArg: 2.252 ± 0.263
3.378AspSer: 3.378 ± 0.321
3.829AspThr: 3.829 ± 0.315
4.215AspVal: 4.215 ± 0.393
0.708AspTrp: 0.708 ± 0.15
3.217AspTyr: 3.217 ± 0.369
0.0AspXaa: 0.0 ± 0.0
Glu
1.834GluAla: 1.834 ± 0.227
1.609GluCys: 1.609 ± 0.263
4.183GluAsp: 4.183 ± 0.447
7.271GluGlu: 7.271 ± 0.9
2.542GluPhe: 2.542 ± 0.316
2.735GluGly: 2.735 ± 0.356
1.609GluHis: 1.609 ± 0.249
4.376GluIle: 4.376 ± 0.411
6.306GluLys: 6.306 ± 0.583
5.244GluLeu: 5.244 ± 0.475
2.381GluMet: 2.381 ± 0.258
5.598GluAsn: 5.598 ± 0.472
1.448GluPro: 1.448 ± 0.305
1.898GluGln: 1.898 ± 0.223
3.346GluArg: 3.346 ± 0.308
3.089GluSer: 3.089 ± 0.353
3.539GluThr: 3.539 ± 0.326
3.507GluVal: 3.507 ± 0.264
0.933GluTrp: 0.933 ± 0.172
3.217GluTyr: 3.217 ± 0.336
0.0GluXaa: 0.0 ± 0.0
Phe
1.48PheAla: 1.48 ± 0.21
1.48PheCys: 1.48 ± 0.209
3.99PheAsp: 3.99 ± 0.369
3.571PheGlu: 3.571 ± 0.338
3.507PhePhe: 3.507 ± 0.481
1.737PheGly: 1.737 ± 0.251
0.933PheHis: 0.933 ± 0.188
5.18PheIle: 5.18 ± 0.35
4.311PheLys: 4.311 ± 0.407
5.502PheLeu: 5.502 ± 0.514
1.062PheMet: 1.062 ± 0.204
3.99PheAsn: 3.99 ± 0.315
1.577PhePro: 1.577 ± 0.214
1.062PheGln: 1.062 ± 0.201
1.705PheArg: 1.705 ± 0.258
3.089PheSer: 3.089 ± 0.325
2.574PheThr: 2.574 ± 0.332
4.537PheVal: 4.537 ± 0.433
0.45PheTrp: 0.45 ± 0.12
2.831PheTyr: 2.831 ± 0.322
0.032PheXaa: 0.032 ± 0.035
Gly
1.995GlyAla: 1.995 ± 0.229
0.676GlyCys: 0.676 ± 0.13
2.51GlyAsp: 2.51 ± 0.276
2.477GlyGlu: 2.477 ± 0.275
2.156GlyPhe: 2.156 ± 0.289
2.413GlyGly: 2.413 ± 0.315
0.611GlyHis: 0.611 ± 0.153
2.477GlyIle: 2.477 ± 0.259
2.51GlyLys: 2.51 ± 0.252
3.378GlyLeu: 3.378 ± 0.336
0.869GlyMet: 0.869 ± 0.172
2.059GlyAsn: 2.059 ± 0.282
0.772GlyPro: 0.772 ± 0.163
1.158GlyGln: 1.158 ± 0.179
1.544GlyArg: 1.544 ± 0.235
1.834GlySer: 1.834 ± 0.288
2.156GlyThr: 2.156 ± 0.306
3.539GlyVal: 3.539 ± 0.436
0.676GlyTrp: 0.676 ± 0.111
1.673GlyTyr: 1.673 ± 0.252
0.0GlyXaa: 0.0 ± 0.0
His
0.74HisAla: 0.74 ± 0.149
0.547HisCys: 0.547 ± 0.129
1.577HisAsp: 1.577 ± 0.213
0.965HisGlu: 0.965 ± 0.162
1.03HisPhe: 1.03 ± 0.148
0.579HisGly: 0.579 ± 0.159
0.418HisHis: 0.418 ± 0.112
1.609HisIle: 1.609 ± 0.212
1.673HisLys: 1.673 ± 0.276
1.995HisLeu: 1.995 ± 0.262
0.386HisMet: 0.386 ± 0.1
2.027HisAsn: 2.027 ± 0.247
0.772HisPro: 0.772 ± 0.156
0.547HisGln: 0.547 ± 0.145
0.643HisArg: 0.643 ± 0.112
0.997HisSer: 0.997 ± 0.177
1.93HisThr: 1.93 ± 0.29
1.512HisVal: 1.512 ± 0.207
0.129HisTrp: 0.129 ± 0.067
1.255HisTyr: 1.255 ± 0.203
0.0HisXaa: 0.0 ± 0.0
Ile
2.22IleAla: 2.22 ± 0.271
1.609IleCys: 1.609 ± 0.193
4.826IleAsp: 4.826 ± 0.411
5.18IleGlu: 5.18 ± 0.451
3.764IlePhe: 3.764 ± 0.377
2.027IleGly: 2.027 ± 0.262
1.48IleHis: 1.48 ± 0.199
5.695IleIle: 5.695 ± 0.537
6.242IleLys: 6.242 ± 0.483
6.531IleLeu: 6.531 ± 0.472
2.22IleMet: 2.22 ± 0.333
7.883IleAsn: 7.883 ± 0.514
2.284IlePro: 2.284 ± 0.239
1.995IleGln: 1.995 ± 0.244
2.606IleArg: 2.606 ± 0.301
3.7IleSer: 3.7 ± 0.377
4.923IleThr: 4.923 ± 0.441
4.73IleVal: 4.73 ± 0.453
0.483IleTrp: 0.483 ± 0.125
2.992IleTyr: 2.992 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
1.705LysAla: 1.705 ± 0.266
2.059LysCys: 2.059 ± 0.285
3.121LysAsp: 3.121 ± 0.318
5.952LysGlu: 5.952 ± 0.516
3.668LysPhe: 3.668 ± 0.305
2.445LysGly: 2.445 ± 0.267
1.93LysHis: 1.93 ± 0.281
5.63LysIle: 5.63 ± 0.475
8.687LysLys: 8.687 ± 0.839
7.046LysLeu: 7.046 ± 0.54
2.542LysMet: 2.542 ± 0.323
6.982LysAsn: 6.982 ± 0.565
1.512LysPro: 1.512 ± 0.23
2.252LysGln: 2.252 ± 0.303
5.083LysArg: 5.083 ± 0.45
4.15LysSer: 4.15 ± 0.511
4.408LysThr: 4.408 ± 0.451
4.086LysVal: 4.086 ± 0.35
0.869LysTrp: 0.869 ± 0.156
3.603LysTyr: 3.603 ± 0.357
0.064LysXaa: 0.064 ± 0.042
Leu
2.896LeuAla: 2.896 ± 0.297
2.317LeuCys: 2.317 ± 0.245
4.794LeuAsp: 4.794 ± 0.399
5.116LeuGlu: 5.116 ± 0.401
5.759LeuPhe: 5.759 ± 0.489
2.767LeuGly: 2.767 ± 0.283
2.188LeuHis: 2.188 ± 0.287
7.529LeuIle: 7.529 ± 0.48
8.172LeuLys: 8.172 ± 0.629
9.588LeuLeu: 9.588 ± 0.642
2.831LeuMet: 2.831 ± 0.346
7.85LeuAsn: 7.85 ± 0.529
2.606LeuPro: 2.606 ± 0.28
2.638LeuGln: 2.638 ± 0.308
4.311LeuArg: 4.311 ± 0.422
5.695LeuSer: 5.695 ± 0.425
4.955LeuThr: 4.955 ± 0.418
6.113LeuVal: 6.113 ± 0.479
1.448LeuTrp: 1.448 ± 0.225
5.823LeuTyr: 5.823 ± 0.479
0.0LeuXaa: 0.0 ± 0.0
Met
0.901MetAla: 0.901 ± 0.168
0.579MetCys: 0.579 ± 0.151
1.158MetAsp: 1.158 ± 0.195
1.77MetGlu: 1.77 ± 0.228
1.577MetPhe: 1.577 ± 0.273
1.126MetGly: 1.126 ± 0.195
0.354MetHis: 0.354 ± 0.104
1.866MetIle: 1.866 ± 0.275
2.574MetLys: 2.574 ± 0.318
3.185MetLeu: 3.185 ± 0.34
0.869MetMet: 0.869 ± 0.207
1.577MetAsn: 1.577 ± 0.235
0.579MetPro: 0.579 ± 0.15
0.515MetGln: 0.515 ± 0.147
1.351MetArg: 1.351 ± 0.2
2.381MetSer: 2.381 ± 0.313
1.03MetThr: 1.03 ± 0.169
1.802MetVal: 1.802 ± 0.238
0.418MetTrp: 0.418 ± 0.1
1.641MetTyr: 1.641 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
1.963AsnAla: 1.963 ± 0.259
1.448AsnCys: 1.448 ± 0.203
5.566AsnAsp: 5.566 ± 0.334
5.823AsnGlu: 5.823 ± 0.441
3.764AsnPhe: 3.764 ± 0.418
3.475AsnGly: 3.475 ± 0.308
1.544AsnHis: 1.544 ± 0.187
6.306AsnIle: 6.306 ± 0.435
5.952AsnLys: 5.952 ± 0.436
6.917AsnLeu: 6.917 ± 0.424
1.93AsnMet: 1.93 ± 0.246
8.301AsnAsn: 8.301 ± 0.854
2.863AsnPro: 2.863 ± 0.794
1.705AsnGln: 1.705 ± 0.246
3.571AsnArg: 3.571 ± 0.294
3.732AsnSer: 3.732 ± 0.305
4.955AsnThr: 4.955 ± 0.324
5.437AsnVal: 5.437 ± 0.425
0.483AsnTrp: 0.483 ± 0.103
3.41AsnTyr: 3.41 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
1.351ProAla: 1.351 ± 0.168
0.772ProCys: 0.772 ± 0.205
1.802ProAsp: 1.802 ± 0.24
1.609ProGlu: 1.609 ± 0.314
2.027ProPhe: 2.027 ± 0.236
1.126ProGly: 1.126 ± 0.223
0.933ProHis: 0.933 ± 0.14
2.284ProIle: 2.284 ± 0.263
1.416ProLys: 1.416 ± 0.229
3.217ProLeu: 3.217 ± 0.342
0.45ProMet: 0.45 ± 0.122
2.703ProAsn: 2.703 ± 0.628
2.96ProPro: 2.96 ± 0.914
1.094ProGln: 1.094 ± 0.194
1.416ProArg: 1.416 ± 0.17
1.866ProSer: 1.866 ± 0.262
2.123ProThr: 2.123 ± 0.318
1.737ProVal: 1.737 ± 0.255
0.193ProTrp: 0.193 ± 0.079
1.641ProTyr: 1.641 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
0.772GlnAla: 0.772 ± 0.163
0.74GlnCys: 0.74 ± 0.141
1.351GlnAsp: 1.351 ± 0.212
1.512GlnGlu: 1.512 ± 0.215
1.802GlnPhe: 1.802 ± 0.269
0.611GlnGly: 0.611 ± 0.125
0.901GlnHis: 0.901 ± 0.182
2.188GlnIle: 2.188 ± 0.318
1.963GlnLys: 1.963 ± 0.27
3.217GlnLeu: 3.217 ± 0.349
0.997GlnMet: 0.997 ± 0.17
1.898GlnAsn: 1.898 ± 0.254
0.997GlnPro: 0.997 ± 0.204
1.448GlnGln: 1.448 ± 0.229
1.383GlnArg: 1.383 ± 0.241
1.544GlnSer: 1.544 ± 0.254
1.512GlnThr: 1.512 ± 0.205
1.19GlnVal: 1.19 ± 0.194
0.225GlnTrp: 0.225 ± 0.081
1.319GlnTyr: 1.319 ± 0.203
0.0GlnXaa: 0.0 ± 0.0
Arg
1.673ArgAla: 1.673 ± 0.202
1.223ArgCys: 1.223 ± 0.169
2.928ArgAsp: 2.928 ± 0.301
3.346ArgGlu: 3.346 ± 0.371
2.22ArgPhe: 2.22 ± 0.325
1.963ArgGly: 1.963 ± 0.268
1.094ArgHis: 1.094 ± 0.173
3.089ArgIle: 3.089 ± 0.335
3.185ArgLys: 3.185 ± 0.402
5.277ArgLeu: 5.277 ± 0.407
1.351ArgMet: 1.351 ± 0.213
2.445ArgAsn: 2.445 ± 0.277
1.673ArgPro: 1.673 ± 0.244
1.673ArgGln: 1.673 ± 0.194
2.863ArgArg: 2.863 ± 0.559
2.445ArgSer: 2.445 ± 0.447
1.995ArgThr: 1.995 ± 0.296
3.314ArgVal: 3.314 ± 0.289
0.74ArgTrp: 0.74 ± 0.171
2.606ArgTyr: 2.606 ± 0.337
0.064ArgXaa: 0.064 ± 0.045
Ser
1.609SerAla: 1.609 ± 0.269
1.448SerCys: 1.448 ± 0.236
3.089SerAsp: 3.089 ± 0.274
2.799SerGlu: 2.799 ± 0.384
3.925SerPhe: 3.925 ± 0.336
2.445SerGly: 2.445 ± 0.298
0.997SerHis: 0.997 ± 0.188
3.99SerIle: 3.99 ± 0.402
3.732SerLys: 3.732 ± 0.416
5.47SerLeu: 5.47 ± 0.457
1.641SerMet: 1.641 ± 0.235
3.571SerAsn: 3.571 ± 0.382
1.898SerPro: 1.898 ± 0.242
1.834SerGln: 1.834 ± 0.264
2.381SerArg: 2.381 ± 0.362
4.311SerSer: 4.311 ± 0.528
3.668SerThr: 3.668 ± 0.303
3.957SerVal: 3.957 ± 0.384
0.804SerTrp: 0.804 ± 0.168
2.091SerTyr: 2.091 ± 0.264
0.032SerXaa: 0.032 ± 0.038
Thr
2.188ThrAla: 2.188 ± 0.301
1.255ThrCys: 1.255 ± 0.226
3.057ThrAsp: 3.057 ± 0.268
2.413ThrGlu: 2.413 ± 0.315
3.153ThrPhe: 3.153 ± 0.365
1.963ThrGly: 1.963 ± 0.291
1.48ThrHis: 1.48 ± 0.268
4.569ThrIle: 4.569 ± 0.414
4.086ThrLys: 4.086 ± 0.398
6.306ThrLeu: 6.306 ± 0.481
1.255ThrMet: 1.255 ± 0.148
4.183ThrAsn: 4.183 ± 0.422
3.024ThrPro: 3.024 ± 0.279
2.059ThrGln: 2.059 ± 0.253
3.41ThrArg: 3.41 ± 0.323
3.057ThrSer: 3.057 ± 0.27
4.343ThrThr: 4.343 ± 0.613
3.41ThrVal: 3.41 ± 0.295
0.547ThrTrp: 0.547 ± 0.11
2.027ThrTyr: 2.027 ± 0.297
0.0ThrXaa: 0.0 ± 0.0
Val
2.381ValAla: 2.381 ± 0.256
2.51ValCys: 2.51 ± 0.338
4.311ValAsp: 4.311 ± 0.381
4.086ValGlu: 4.086 ± 0.359
4.826ValPhe: 4.826 ± 0.44
2.767ValGly: 2.767 ± 0.285
1.094ValHis: 1.094 ± 0.153
3.861ValIle: 3.861 ± 0.342
4.923ValLys: 4.923 ± 0.439
6.692ValLeu: 6.692 ± 0.451
2.027ValMet: 2.027 ± 0.277
4.376ValAsn: 4.376 ± 0.4
2.123ValPro: 2.123 ± 0.269
1.19ValGln: 1.19 ± 0.175
3.41ValArg: 3.41 ± 0.279
3.475ValSer: 3.475 ± 0.418
3.121ValThr: 3.121 ± 0.311
5.18ValVal: 5.18 ± 0.471
0.643ValTrp: 0.643 ± 0.132
3.507ValTyr: 3.507 ± 0.323
0.0ValXaa: 0.0 ± 0.0
Trp
0.483TrpAla: 0.483 ± 0.112
0.386TrpCys: 0.386 ± 0.117
0.515TrpAsp: 0.515 ± 0.141
0.579TrpGlu: 0.579 ± 0.146
0.708TrpPhe: 0.708 ± 0.129
0.579TrpGly: 0.579 ± 0.13
0.129TrpHis: 0.129 ± 0.061
0.611TrpIle: 0.611 ± 0.161
0.676TrpLys: 0.676 ± 0.155
1.126TrpLeu: 1.126 ± 0.213
0.29TrpMet: 0.29 ± 0.093
0.837TrpAsn: 0.837 ± 0.15
0.418TrpPro: 0.418 ± 0.131
0.29TrpGln: 0.29 ± 0.089
0.611TrpArg: 0.611 ± 0.117
0.643TrpSer: 0.643 ± 0.133
0.45TrpThr: 0.45 ± 0.106
0.547TrpVal: 0.547 ± 0.135
0.354TrpTrp: 0.354 ± 0.137
0.74TrpTyr: 0.74 ± 0.131
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.512TyrAla: 1.512 ± 0.197
1.737TyrCys: 1.737 ± 0.245
3.314TyrAsp: 3.314 ± 0.315
3.7TyrGlu: 3.7 ± 0.378
2.542TyrPhe: 2.542 ± 0.255
1.577TyrGly: 1.577 ± 0.223
1.287TyrHis: 1.287 ± 0.181
3.217TyrIle: 3.217 ± 0.373
3.925TyrLys: 3.925 ± 0.358
3.636TyrLeu: 3.636 ± 0.292
1.255TyrMet: 1.255 ± 0.209
4.279TyrAsn: 4.279 ± 0.347
1.416TyrPro: 1.416 ± 0.249
1.223TyrGln: 1.223 ± 0.206
2.284TyrArg: 2.284 ± 0.309
3.121TyrSer: 3.121 ± 0.264
2.767TyrThr: 2.767 ± 0.318
3.153TyrVal: 3.153 ± 0.288
0.483TyrTrp: 0.483 ± 0.124
3.217TyrTyr: 3.217 ± 0.368
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.097XaaPhe: 0.097 ± 0.07
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.032XaaLys: 0.032 ± 0.031
0.032XaaLeu: 0.032 ± 0.032
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.032XaaGln: 0.032 ± 0.029
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 125 proteins (31082 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski