Amino acid dipepetide frequency for Trichoplusia ni granulovirus LBIV-12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.599AlaAla: 2.599 ± 0.282
0.826AlaCys: 0.826 ± 0.129
2.639AlaAsp: 2.639 ± 0.249
2.659AlaGlu: 2.659 ± 0.225
2.135AlaPhe: 2.135 ± 0.187
1.994AlaGly: 1.994 ± 0.208
0.987AlaHis: 0.987 ± 0.141
2.86AlaIle: 2.86 ± 0.25
2.82AlaLys: 2.82 ± 0.251
4.855AlaLeu: 4.855 ± 0.306
1.249AlaMet: 1.249 ± 0.14
3.304AlaAsn: 3.304 ± 0.297
1.833AlaPro: 1.833 ± 0.2
1.873AlaGln: 1.873 ± 0.231
1.974AlaArg: 1.974 ± 0.242
3.022AlaSer: 3.022 ± 0.217
3.062AlaThr: 3.062 ± 0.284
3.364AlaVal: 3.364 ± 0.233
0.584AlaTrp: 0.584 ± 0.107
2.276AlaTyr: 2.276 ± 0.245
0.0AlaXaa: 0.0 ± 0.0
Cys
1.108CysAla: 1.108 ± 0.146
0.483CysCys: 0.483 ± 0.089
1.773CysAsp: 1.773 ± 0.195
1.148CysGlu: 1.148 ± 0.153
0.927CysPhe: 0.927 ± 0.168
1.39CysGly: 1.39 ± 0.168
0.786CysHis: 0.786 ± 0.124
1.35CysIle: 1.35 ± 0.177
1.672CysLys: 1.672 ± 0.229
1.954CysLeu: 1.954 ± 0.211
0.624CysMet: 0.624 ± 0.103
1.551CysAsn: 1.551 ± 0.169
0.927CysPro: 0.927 ± 0.147
0.665CysGln: 0.665 ± 0.126
1.249CysArg: 1.249 ± 0.15
1.712CysSer: 1.712 ± 0.222
1.209CysThr: 1.209 ± 0.176
2.357CysVal: 2.357 ± 0.242
0.141CysTrp: 0.141 ± 0.051
1.43CysTyr: 1.43 ± 0.173
0.0CysXaa: 0.0 ± 0.0
Asp
3.183AspAla: 3.183 ± 0.252
1.39AspCys: 1.39 ± 0.186
4.774AspAsp: 4.774 ± 0.38
4.23AspGlu: 4.23 ± 0.234
2.437AspPhe: 2.437 ± 0.233
3.163AspGly: 3.163 ± 0.271
1.531AspHis: 1.531 ± 0.169
3.344AspIle: 3.344 ± 0.303
4.532AspLys: 4.532 ± 0.339
5.52AspLeu: 5.52 ± 0.35
1.612AspMet: 1.612 ± 0.201
4.593AspAsn: 4.593 ± 0.325
2.095AspPro: 2.095 ± 0.196
2.236AspGln: 2.236 ± 0.193
2.679AspArg: 2.679 ± 0.258
3.868AspSer: 3.868 ± 0.308
3.868AspThr: 3.868 ± 0.265
5.278AspVal: 5.278 ± 0.31
0.786AspTrp: 0.786 ± 0.113
3.384AspTyr: 3.384 ± 0.29
0.0AspXaa: 0.0 ± 0.0
Glu
2.578GluAla: 2.578 ± 0.211
1.571GluCys: 1.571 ± 0.188
3.404GluAsp: 3.404 ± 0.218
3.989GluGlu: 3.989 ± 0.371
2.699GluPhe: 2.699 ± 0.246
1.531GluGly: 1.531 ± 0.177
1.531GluHis: 1.531 ± 0.178
3.888GluIle: 3.888 ± 0.282
3.404GluLys: 3.404 ± 0.292
5.479GluLeu: 5.479 ± 0.381
1.551GluMet: 1.551 ± 0.151
4.19GluAsn: 4.19 ± 0.331
1.652GluPro: 1.652 ± 0.202
2.538GluGln: 2.538 ± 0.265
2.76GluArg: 2.76 ± 0.212
3.505GluSer: 3.505 ± 0.296
2.76GluThr: 2.76 ± 0.255
2.84GluVal: 2.84 ± 0.237
0.504GluTrp: 0.504 ± 0.095
2.599GluTyr: 2.599 ± 0.21
0.0GluXaa: 0.0 ± 0.0
Phe
2.014PheAla: 2.014 ± 0.202
1.35PheCys: 1.35 ± 0.188
3.606PheAsp: 3.606 ± 0.272
3.163PheGlu: 3.163 ± 0.267
1.672PhePhe: 1.672 ± 0.179
1.712PheGly: 1.712 ± 0.172
0.846PheHis: 0.846 ± 0.151
3.082PheIle: 3.082 ± 0.261
2.901PheLys: 2.901 ± 0.249
3.525PheLeu: 3.525 ± 0.297
0.947PheMet: 0.947 ± 0.133
3.324PheAsn: 3.324 ± 0.25
1.249PhePro: 1.249 ± 0.176
1.39PheGln: 1.39 ± 0.187
1.793PheArg: 1.793 ± 0.194
2.377PheSer: 2.377 ± 0.28
2.74PheThr: 2.74 ± 0.234
4.371PheVal: 4.371 ± 0.375
0.322PheTrp: 0.322 ± 0.076
2.558PheTyr: 2.558 ± 0.226
0.0PheXaa: 0.0 ± 0.0
Gly
1.894GlyAla: 1.894 ± 0.259
0.826GlyCys: 0.826 ± 0.124
2.8GlyAsp: 2.8 ± 0.216
1.632GlyGlu: 1.632 ± 0.145
1.612GlyPhe: 1.612 ± 0.207
2.599GlyGly: 2.599 ± 0.261
0.846GlyHis: 0.846 ± 0.15
1.692GlyIle: 1.692 ± 0.196
2.458GlyLys: 2.458 ± 0.225
3.042GlyLeu: 3.042 ± 0.222
0.987GlyMet: 0.987 ± 0.132
2.337GlyAsn: 2.337 ± 0.247
1.269GlyPro: 1.269 ± 0.167
1.209GlyGln: 1.209 ± 0.195
1.732GlyArg: 1.732 ± 0.207
2.176GlySer: 2.176 ± 0.207
2.095GlyThr: 2.095 ± 0.228
3.586GlyVal: 3.586 ± 0.371
0.363GlyTrp: 0.363 ± 0.094
2.196GlyTyr: 2.196 ± 0.241
0.0GlyXaa: 0.0 ± 0.0
His
1.128HisAla: 1.128 ± 0.133
0.645HisCys: 0.645 ± 0.106
1.471HisAsp: 1.471 ± 0.172
1.128HisGlu: 1.128 ± 0.113
1.048HisPhe: 1.048 ± 0.147
0.967HisGly: 0.967 ± 0.137
0.947HisHis: 0.947 ± 0.153
1.269HisIle: 1.269 ± 0.188
1.712HisLys: 1.712 ± 0.19
2.135HisLeu: 2.135 ± 0.229
0.463HisMet: 0.463 ± 0.09
2.075HisAsn: 2.075 ± 0.213
0.927HisPro: 0.927 ± 0.142
0.846HisGln: 0.846 ± 0.134
0.967HisArg: 0.967 ± 0.132
1.531HisSer: 1.531 ± 0.174
1.712HisThr: 1.712 ± 0.18
1.934HisVal: 1.934 ± 0.204
0.282HisTrp: 0.282 ± 0.065
1.269HisTyr: 1.269 ± 0.154
0.0HisXaa: 0.0 ± 0.0
Ile
2.8IleAla: 2.8 ± 0.294
1.068IleCys: 1.068 ± 0.141
4.371IleAsp: 4.371 ± 0.319
3.586IleGlu: 3.586 ± 0.244
2.458IlePhe: 2.458 ± 0.247
2.075IleGly: 2.075 ± 0.185
1.33IleHis: 1.33 ± 0.163
3.666IleIle: 3.666 ± 0.267
5.399IleLys: 5.399 ± 0.364
4.532IleLeu: 4.532 ± 0.312
1.672IleMet: 1.672 ± 0.178
5.479IleAsn: 5.479 ± 0.33
2.095IlePro: 2.095 ± 0.194
2.176IleGln: 2.176 ± 0.221
2.961IleArg: 2.961 ± 0.258
3.908IleSer: 3.908 ± 0.24
3.908IleThr: 3.908 ± 0.282
4.21IleVal: 4.21 ± 0.262
0.564IleTrp: 0.564 ± 0.104
2.377IleTyr: 2.377 ± 0.22
0.0IleXaa: 0.0 ± 0.0
Lys
2.256LysAla: 2.256 ± 0.239
1.954LysCys: 1.954 ± 0.211
3.183LysAsp: 3.183 ± 0.307
3.384LysGlu: 3.384 ± 0.242
3.284LysPhe: 3.284 ± 0.29
1.652LysGly: 1.652 ± 0.189
2.256LysHis: 2.256 ± 0.241
5.096LysIle: 5.096 ± 0.357
5.177LysLys: 5.177 ± 0.549
7.091LysLeu: 7.091 ± 0.521
1.994LysMet: 1.994 ± 0.247
4.875LysAsn: 4.875 ± 0.328
2.578LysPro: 2.578 ± 0.267
3.062LysGln: 3.062 ± 0.286
4.412LysArg: 4.412 ± 0.332
4.21LysSer: 4.21 ± 0.293
3.485LysThr: 3.485 ± 0.324
3.586LysVal: 3.586 ± 0.302
0.624LysTrp: 0.624 ± 0.099
3.586LysTyr: 3.586 ± 0.32
0.0LysXaa: 0.0 ± 0.0
Leu
4.714LeuAla: 4.714 ± 0.339
2.417LeuCys: 2.417 ± 0.227
5.62LeuAsp: 5.62 ± 0.312
4.774LeuGlu: 4.774 ± 0.367
4.17LeuPhe: 4.17 ± 0.281
2.82LeuGly: 2.82 ± 0.27
2.639LeuHis: 2.639 ± 0.214
5.681LeuIle: 5.681 ± 0.397
6.869LeuLys: 6.869 ± 0.499
9.508LeuLeu: 9.508 ± 0.468
2.317LeuMet: 2.317 ± 0.233
5.983LeuAsn: 5.983 ± 0.365
4.15LeuPro: 4.15 ± 0.214
4.553LeuGln: 4.553 ± 0.331
4.976LeuArg: 4.976 ± 0.307
5.419LeuSer: 5.419 ± 0.309
5.036LeuThr: 5.036 ± 0.368
5.902LeuVal: 5.902 ± 0.387
0.826LeuTrp: 0.826 ± 0.127
4.976LeuTyr: 4.976 ± 0.287
0.0LeuXaa: 0.0 ± 0.0
Met
1.007MetAla: 1.007 ± 0.164
0.504MetCys: 0.504 ± 0.098
1.914MetAsp: 1.914 ± 0.21
1.571MetGlu: 1.571 ± 0.186
1.289MetPhe: 1.289 ± 0.185
0.765MetGly: 0.765 ± 0.118
0.423MetHis: 0.423 ± 0.096
1.309MetIle: 1.309 ± 0.155
1.652MetLys: 1.652 ± 0.213
2.78MetLeu: 2.78 ± 0.203
0.665MetMet: 0.665 ± 0.102
1.511MetAsn: 1.511 ± 0.146
0.604MetPro: 0.604 ± 0.116
1.088MetGln: 1.088 ± 0.169
1.168MetArg: 1.168 ± 0.173
2.014MetSer: 2.014 ± 0.224
1.632MetThr: 1.632 ± 0.195
2.377MetVal: 2.377 ± 0.238
0.262MetTrp: 0.262 ± 0.075
1.511MetTyr: 1.511 ± 0.174
0.0MetXaa: 0.0 ± 0.0
Asn
3.243AsnAla: 3.243 ± 0.28
1.571AsnCys: 1.571 ± 0.204
4.19AsnAsp: 4.19 ± 0.312
3.707AsnGlu: 3.707 ± 0.289
3.183AsnPhe: 3.183 ± 0.232
2.82AsnGly: 2.82 ± 0.234
1.249AsnHis: 1.249 ± 0.143
4.532AsnIle: 4.532 ± 0.307
4.955AsnLys: 4.955 ± 0.357
6.003AsnLeu: 6.003 ± 0.334
2.014AsnMet: 2.014 ± 0.233
5.661AsnAsn: 5.661 ± 0.433
2.478AsnPro: 2.478 ± 0.247
2.196AsnGln: 2.196 ± 0.226
2.941AsnArg: 2.941 ± 0.25
5.117AsnSer: 5.117 ± 0.288
5.056AsnThr: 5.056 ± 0.341
5.499AsnVal: 5.499 ± 0.361
0.846AsnTrp: 0.846 ± 0.129
3.868AsnTyr: 3.868 ± 0.307
0.0AsnXaa: 0.0 ± 0.0
Pro
1.994ProAla: 1.994 ± 0.224
0.624ProCys: 0.624 ± 0.127
2.498ProAsp: 2.498 ± 0.216
1.934ProGlu: 1.934 ± 0.183
1.591ProPhe: 1.591 ± 0.194
1.41ProGly: 1.41 ± 0.182
1.007ProHis: 1.007 ± 0.134
2.377ProIle: 2.377 ± 0.245
1.652ProLys: 1.652 ± 0.205
3.042ProLeu: 3.042 ± 0.278
0.826ProMet: 0.826 ± 0.118
2.458ProAsn: 2.458 ± 0.201
2.276ProPro: 2.276 ± 0.564
1.753ProGln: 1.753 ± 0.17
1.491ProArg: 1.491 ± 0.177
2.417ProSer: 2.417 ± 0.211
2.679ProThr: 2.679 ± 0.266
2.78ProVal: 2.78 ± 0.229
0.363ProTrp: 0.363 ± 0.091
1.894ProTyr: 1.894 ± 0.195
0.0ProXaa: 0.0 ± 0.0
Gln
1.732GlnAla: 1.732 ± 0.176
1.269GlnCys: 1.269 ± 0.162
1.914GlnAsp: 1.914 ± 0.191
1.934GlnGlu: 1.934 ± 0.205
1.813GlnPhe: 1.813 ± 0.213
1.189GlnGly: 1.189 ± 0.168
1.269GlnHis: 1.269 ± 0.162
2.397GlnIle: 2.397 ± 0.203
2.599GlnLys: 2.599 ± 0.231
4.694GlnLeu: 4.694 ± 0.318
1.088GlnMet: 1.088 ± 0.165
2.357GlnAsn: 2.357 ± 0.198
1.511GlnPro: 1.511 ± 0.178
2.578GlnGln: 2.578 ± 0.291
2.256GlnArg: 2.256 ± 0.214
2.256GlnSer: 2.256 ± 0.202
2.256GlnThr: 2.256 ± 0.214
1.853GlnVal: 1.853 ± 0.207
0.262GlnTrp: 0.262 ± 0.071
2.216GlnTyr: 2.216 ± 0.219
0.0GlnXaa: 0.0 ± 0.0
Arg
2.135ArgAla: 2.135 ± 0.227
1.088ArgCys: 1.088 ± 0.156
3.505ArgAsp: 3.505 ± 0.258
2.518ArgGlu: 2.518 ± 0.216
2.337ArgPhe: 2.337 ± 0.245
1.612ArgGly: 1.612 ± 0.187
1.309ArgHis: 1.309 ± 0.177
2.961ArgIle: 2.961 ± 0.213
2.699ArgLys: 2.699 ± 0.254
5.681ArgLeu: 5.681 ± 0.312
1.269ArgMet: 1.269 ± 0.156
3.022ArgAsn: 3.022 ± 0.286
1.813ArgPro: 1.813 ± 0.212
2.155ArgGln: 2.155 ± 0.213
2.498ArgArg: 2.498 ± 0.27
2.82ArgSer: 2.82 ± 0.279
2.417ArgThr: 2.417 ± 0.216
3.525ArgVal: 3.525 ± 0.285
0.504ArgTrp: 0.504 ± 0.115
2.518ArgTyr: 2.518 ± 0.244
0.0ArgXaa: 0.0 ± 0.0
Ser
2.881SerAla: 2.881 ± 0.262
1.672SerCys: 1.672 ± 0.163
3.888SerAsp: 3.888 ± 0.272
3.143SerGlu: 3.143 ± 0.202
3.263SerPhe: 3.263 ± 0.246
2.78SerGly: 2.78 ± 0.264
1.37SerHis: 1.37 ± 0.166
3.827SerIle: 3.827 ± 0.294
4.049SerLys: 4.049 ± 0.289
5.62SerLeu: 5.62 ± 0.334
1.591SerMet: 1.591 ± 0.176
4.553SerAsn: 4.553 ± 0.354
2.256SerPro: 2.256 ± 0.242
2.317SerGln: 2.317 ± 0.234
3.102SerArg: 3.102 ± 0.285
4.996SerSer: 4.996 ± 0.509
4.271SerThr: 4.271 ± 0.31
4.553SerVal: 4.553 ± 0.337
0.504SerTrp: 0.504 ± 0.109
2.76SerTyr: 2.76 ± 0.275
0.0SerXaa: 0.0 ± 0.0
Thr
2.578ThrAla: 2.578 ± 0.195
1.229ThrCys: 1.229 ± 0.128
3.707ThrAsp: 3.707 ± 0.257
2.921ThrGlu: 2.921 ± 0.227
2.599ThrPhe: 2.599 ± 0.244
2.075ThrGly: 2.075 ± 0.209
1.45ThrHis: 1.45 ± 0.203
3.908ThrIle: 3.908 ± 0.342
3.948ThrLys: 3.948 ± 0.327
5.963ThrLeu: 5.963 ± 0.401
1.43ThrMet: 1.43 ± 0.164
4.814ThrAsn: 4.814 ± 0.345
2.679ThrPro: 2.679 ± 0.23
1.954ThrGln: 1.954 ± 0.19
3.223ThrArg: 3.223 ± 0.247
4.15ThrSer: 4.15 ± 0.306
4.996ThrThr: 4.996 ± 0.546
4.633ThrVal: 4.633 ± 0.243
0.665ThrTrp: 0.665 ± 0.101
2.478ThrTyr: 2.478 ± 0.229
0.02ThrXaa: 0.02 ± 0.022
Val
3.968ValAla: 3.968 ± 0.275
1.974ValCys: 1.974 ± 0.193
4.955ValAsp: 4.955 ± 0.336
4.089ValGlu: 4.089 ± 0.303
3.425ValPhe: 3.425 ± 0.283
2.357ValGly: 2.357 ± 0.268
1.511ValHis: 1.511 ± 0.167
4.13ValIle: 4.13 ± 0.262
5.056ValLys: 5.056 ± 0.398
6.406ValLeu: 6.406 ± 0.317
1.833ValMet: 1.833 ± 0.203
4.432ValAsn: 4.432 ± 0.304
3.122ValPro: 3.122 ± 0.231
2.578ValGln: 2.578 ± 0.242
3.163ValArg: 3.163 ± 0.298
4.21ValSer: 4.21 ± 0.287
4.734ValThr: 4.734 ± 0.281
5.499ValVal: 5.499 ± 0.39
0.665ValTrp: 0.665 ± 0.12
4.452ValTyr: 4.452 ± 0.275
0.0ValXaa: 0.0 ± 0.0
Trp
0.322TrpAla: 0.322 ± 0.098
0.443TrpCys: 0.443 ± 0.106
0.584TrpAsp: 0.584 ± 0.132
0.483TrpGlu: 0.483 ± 0.108
0.322TrpPhe: 0.322 ± 0.078
0.423TrpGly: 0.423 ± 0.097
0.181TrpHis: 0.181 ± 0.086
0.463TrpIle: 0.463 ± 0.087
0.584TrpLys: 0.584 ± 0.106
1.108TrpLeu: 1.108 ± 0.161
0.302TrpMet: 0.302 ± 0.086
0.504TrpAsn: 0.504 ± 0.089
0.363TrpPro: 0.363 ± 0.1
0.463TrpGln: 0.463 ± 0.091
0.806TrpArg: 0.806 ± 0.128
0.745TrpSer: 0.745 ± 0.128
0.504TrpThr: 0.504 ± 0.097
0.383TrpVal: 0.383 ± 0.084
0.181TrpTrp: 0.181 ± 0.067
0.544TrpTyr: 0.544 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.78TyrAla: 2.78 ± 0.214
1.551TyrCys: 1.551 ± 0.184
3.707TyrAsp: 3.707 ± 0.272
3.001TyrGlu: 3.001 ± 0.212
2.679TyrPhe: 2.679 ± 0.237
2.014TyrGly: 2.014 ± 0.215
0.987TyrHis: 0.987 ± 0.141
2.8TyrIle: 2.8 ± 0.253
3.606TyrLys: 3.606 ± 0.255
4.412TyrLeu: 4.412 ± 0.309
1.591TyrMet: 1.591 ± 0.186
4.109TyrAsn: 4.109 ± 0.294
1.189TyrPro: 1.189 ± 0.189
1.813TyrGln: 1.813 ± 0.193
2.236TyrArg: 2.236 ± 0.213
2.941TyrSer: 2.941 ± 0.225
2.981TyrThr: 2.981 ± 0.263
4.029TyrVal: 4.029 ± 0.232
0.463TyrTrp: 0.463 ± 0.099
2.619TyrTyr: 2.619 ± 0.246
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.02XaaThr: 0.02 ± 0.022
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 172 proteins (49643 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski