Amino acid dipepetide frequency for Drosophila innubila nudivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.822AlaAla: 2.822 ± 0.412
0.95AlaCys: 0.95 ± 0.146
1.794AlaAsp: 1.794 ± 0.19
1.767AlaGlu: 1.767 ± 0.185
1.609AlaPhe: 1.609 ± 0.183
0.897AlaGly: 0.897 ± 0.17
0.659AlaHis: 0.659 ± 0.14
4.247AlaIle: 4.247 ± 0.367
2.796AlaLys: 2.796 ± 0.25
3.904AlaLeu: 3.904 ± 0.34
1.266AlaMet: 1.266 ± 0.167
3.587AlaAsn: 3.587 ± 0.294
1.477AlaPro: 1.477 ± 0.199
1.266AlaGln: 1.266 ± 0.174
1.398AlaArg: 1.398 ± 0.269
3.482AlaSer: 3.482 ± 0.326
3.93AlaThr: 3.93 ± 0.442
1.688AlaVal: 1.688 ± 0.202
0.237AlaTrp: 0.237 ± 0.076
1.846AlaTyr: 1.846 ± 0.201
0.0AlaXaa: 0.0 ± 0.0
Cys
1.055CysAla: 1.055 ± 0.174
0.264CysCys: 0.264 ± 0.083
1.556CysAsp: 1.556 ± 0.225
0.923CysGlu: 0.923 ± 0.187
0.791CysPhe: 0.791 ± 0.147
0.818CysGly: 0.818 ± 0.164
0.317CysHis: 0.317 ± 0.082
1.899CysIle: 1.899 ± 0.227
1.345CysLys: 1.345 ± 0.178
1.556CysLeu: 1.556 ± 0.2
0.554CysMet: 0.554 ± 0.142
1.715CysAsn: 1.715 ± 0.235
0.422CysPro: 0.422 ± 0.107
0.554CysGln: 0.554 ± 0.117
1.108CysArg: 1.108 ± 0.167
1.609CysSer: 1.609 ± 0.228
1.108CysThr: 1.108 ± 0.152
1.24CysVal: 1.24 ± 0.223
0.079CysTrp: 0.079 ± 0.047
0.791CysTyr: 0.791 ± 0.156
0.0CysXaa: 0.0 ± 0.0
Asp
2.4AspAla: 2.4 ± 0.247
1.266AspCys: 1.266 ± 0.171
9.602AspAsp: 9.602 ± 1.074
4.221AspGlu: 4.221 ± 0.38
2.664AspPhe: 2.664 ± 0.259
3.535AspGly: 3.535 ± 0.345
1.134AspHis: 1.134 ± 0.155
5.856AspIle: 5.856 ± 0.479
2.981AspLys: 2.981 ± 0.294
4.511AspLeu: 4.511 ± 0.404
1.108AspMet: 1.108 ± 0.15
5.249AspAsn: 5.249 ± 0.453
1.82AspPro: 1.82 ± 0.224
1.266AspGln: 1.266 ± 0.178
2.058AspArg: 2.058 ± 0.246
4.748AspSer: 4.748 ± 0.465
4.036AspThr: 4.036 ± 0.39
3.218AspVal: 3.218 ± 0.287
0.369AspTrp: 0.369 ± 0.098
2.506AspTyr: 2.506 ± 0.268
0.0AspXaa: 0.0 ± 0.0
Glu
1.451GluAla: 1.451 ± 0.191
0.923GluCys: 0.923 ± 0.161
2.189GluAsp: 2.189 ± 0.307
2.005GluGlu: 2.005 ± 0.237
2.453GluPhe: 2.453 ± 0.289
1.293GluGly: 1.293 ± 0.329
1.055GluHis: 1.055 ± 0.135
4.458GluIle: 4.458 ± 0.357
3.165GluLys: 3.165 ± 0.288
4.247GluLeu: 4.247 ± 0.314
1.688GluMet: 1.688 ± 0.223
5.144GluAsn: 5.144 ± 0.293
1.609GluPro: 1.609 ± 0.166
1.952GluGln: 1.952 ± 0.241
1.82GluArg: 1.82 ± 0.26
4.009GluSer: 4.009 ± 0.361
3.297GluThr: 3.297 ± 0.285
1.556GluVal: 1.556 ± 0.202
0.369GluTrp: 0.369 ± 0.091
3.271GluTyr: 3.271 ± 0.298
0.0GluXaa: 0.0 ± 0.0
Phe
2.11PheAla: 2.11 ± 0.206
0.818PheCys: 0.818 ± 0.159
3.139PheAsp: 3.139 ± 0.322
2.453PheGlu: 2.453 ± 0.246
1.266PhePhe: 1.266 ± 0.189
1.952PheGly: 1.952 ± 0.229
0.897PheHis: 0.897 ± 0.174
3.746PheIle: 3.746 ± 0.33
3.165PheLys: 3.165 ± 0.286
2.928PheLeu: 2.928 ± 0.343
1.398PheMet: 1.398 ± 0.193
4.194PheAsn: 4.194 ± 0.332
1.24PhePro: 1.24 ± 0.165
1.609PheGln: 1.609 ± 0.205
1.266PheArg: 1.266 ± 0.213
2.664PheSer: 2.664 ± 0.296
2.717PheThr: 2.717 ± 0.24
2.691PheVal: 2.691 ± 0.269
0.29PheTrp: 0.29 ± 0.085
1.504PheTyr: 1.504 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
1.161GlyAla: 1.161 ± 0.145
0.686GlyCys: 0.686 ± 0.146
2.11GlyAsp: 2.11 ± 0.225
1.662GlyGlu: 1.662 ± 0.306
1.846GlyPhe: 1.846 ± 0.209
2.321GlyGly: 2.321 ± 0.89
0.58GlyHis: 0.58 ± 0.115
3.192GlyIle: 3.192 ± 0.278
1.952GlyLys: 1.952 ± 0.209
2.532GlyLeu: 2.532 ± 0.287
0.528GlyMet: 0.528 ± 0.143
2.664GlyAsn: 2.664 ± 0.234
1.055GlyPro: 1.055 ± 0.182
0.897GlyGln: 0.897 ± 0.148
1.319GlyArg: 1.319 ± 0.19
2.611GlySer: 2.611 ± 0.268
2.137GlyThr: 2.137 ± 0.236
2.058GlyVal: 2.058 ± 0.304
0.317GlyTrp: 0.317 ± 0.092
2.031GlyTyr: 2.031 ± 0.278
0.0GlyXaa: 0.0 ± 0.0
His
0.554HisAla: 0.554 ± 0.123
0.29HisCys: 0.29 ± 0.081
1.108HisAsp: 1.108 ± 0.183
1.002HisGlu: 1.002 ± 0.146
1.213HisPhe: 1.213 ± 0.187
1.002HisGly: 1.002 ± 0.148
0.844HisHis: 0.844 ± 0.205
1.767HisIle: 1.767 ± 0.237
1.345HisLys: 1.345 ± 0.186
1.715HisLeu: 1.715 ± 0.169
0.29HisMet: 0.29 ± 0.089
1.583HisAsn: 1.583 ± 0.189
0.818HisPro: 0.818 ± 0.19
1.134HisGln: 1.134 ± 0.203
0.923HisArg: 0.923 ± 0.177
1.293HisSer: 1.293 ± 0.184
1.187HisThr: 1.187 ± 0.204
1.134HisVal: 1.134 ± 0.194
0.132HisTrp: 0.132 ± 0.057
1.108HisTyr: 1.108 ± 0.194
0.0HisXaa: 0.0 ± 0.0
Ile
4.379IleAla: 4.379 ± 0.31
2.031IleCys: 2.031 ± 0.283
5.671IleAsp: 5.671 ± 0.445
5.091IleGlu: 5.091 ± 0.32
3.772IlePhe: 3.772 ± 0.35
2.822IleGly: 2.822 ± 0.285
2.084IleHis: 2.084 ± 0.215
8.837IleIle: 8.837 ± 0.756
6.041IleLys: 6.041 ± 0.393
8.256IleLeu: 8.256 ± 0.462
1.899IleMet: 1.899 ± 0.228
8.098IleAsn: 8.098 ± 0.538
3.93IlePro: 3.93 ± 0.307
3.324IleGln: 3.324 ± 0.246
2.902IleArg: 2.902 ± 0.238
6.542IleSer: 6.542 ± 0.422
5.645IleThr: 5.645 ± 0.369
5.671IleVal: 5.671 ± 0.405
0.686IleTrp: 0.686 ± 0.138
4.062IleTyr: 4.062 ± 0.388
0.0IleXaa: 0.0 ± 0.0
Lys
1.688LysAla: 1.688 ± 0.214
1.846LysCys: 1.846 ± 0.247
2.4LysAsp: 2.4 ± 0.248
2.374LysGlu: 2.374 ± 0.226
3.878LysPhe: 3.878 ± 0.313
0.95LysGly: 0.95 ± 0.192
1.477LysHis: 1.477 ± 0.186
6.99LysIle: 6.99 ± 0.483
4.115LysLys: 4.115 ± 0.358
5.882LysLeu: 5.882 ± 0.462
2.189LysMet: 2.189 ± 0.253
6.621LysAsn: 6.621 ± 0.473
2.981LysPro: 2.981 ± 0.341
2.506LysGln: 2.506 ± 0.259
2.875LysArg: 2.875 ± 0.253
5.434LysSer: 5.434 ± 0.439
5.012LysThr: 5.012 ± 0.34
2.11LysVal: 2.11 ± 0.264
0.369LysTrp: 0.369 ± 0.11
4.405LysTyr: 4.405 ± 0.385
0.0LysXaa: 0.0 ± 0.0
Leu
3.376LeuAla: 3.376 ± 0.407
1.583LeuCys: 1.583 ± 0.201
5.038LeuAsp: 5.038 ± 0.387
3.245LeuGlu: 3.245 ± 0.335
3.06LeuPhe: 3.06 ± 0.318
2.321LeuGly: 2.321 ± 0.275
1.767LeuHis: 1.767 ± 0.224
6.304LeuIle: 6.304 ± 0.398
6.515LeuLys: 6.515 ± 0.529
7.096LeuLeu: 7.096 ± 0.428
2.77LeuMet: 2.77 ± 0.287
8.019LeuAsn: 8.019 ± 0.54
3.719LeuPro: 3.719 ± 0.313
3.64LeuGln: 3.64 ± 0.459
3.192LeuArg: 3.192 ± 0.341
6.199LeuSer: 6.199 ± 0.37
4.643LeuThr: 4.643 ± 0.343
3.772LeuVal: 3.772 ± 0.402
0.501LeuTrp: 0.501 ± 0.121
3.746LeuTyr: 3.746 ± 0.399
0.0LeuXaa: 0.0 ± 0.0
Met
1.741MetAla: 1.741 ± 0.184
0.58MetCys: 0.58 ± 0.137
1.741MetAsp: 1.741 ± 0.197
1.319MetGlu: 1.319 ± 0.176
1.266MetPhe: 1.266 ± 0.152
1.029MetGly: 1.029 ± 0.168
0.58MetHis: 0.58 ± 0.123
2.48MetIle: 2.48 ± 0.247
1.424MetLys: 1.424 ± 0.236
2.084MetLeu: 2.084 ± 0.254
1.187MetMet: 1.187 ± 0.25
2.242MetAsn: 2.242 ± 0.229
1.53MetPro: 1.53 ± 0.204
0.818MetGln: 0.818 ± 0.149
1.055MetArg: 1.055 ± 0.158
2.321MetSer: 2.321 ± 0.225
1.583MetThr: 1.583 ± 0.225
1.108MetVal: 1.108 ± 0.171
0.106MetTrp: 0.106 ± 0.052
1.583MetTyr: 1.583 ± 0.228
0.0MetXaa: 0.0 ± 0.0
Asn
4.484AsnAla: 4.484 ± 0.306
1.741AsnCys: 1.741 ± 0.212
8.362AsnAsp: 8.362 ± 0.608
5.144AsnGlu: 5.144 ± 0.392
3.034AsnPhe: 3.034 ± 0.306
4.062AsnGly: 4.062 ± 0.382
1.741AsnHis: 1.741 ± 0.218
8.362AsnIle: 8.362 ± 0.476
5.223AsnLys: 5.223 ± 0.362
6.436AsnLeu: 6.436 ± 0.414
2.295AsnMet: 2.295 ± 0.232
12.899AsnAsn: 12.899 ± 1.157
2.4AsnPro: 2.4 ± 0.262
2.77AsnGln: 2.77 ± 0.29
3.508AsnArg: 3.508 ± 0.319
6.964AsnSer: 6.964 ± 0.546
5.777AsnThr: 5.777 ± 0.474
5.909AsnVal: 5.909 ± 0.465
0.396AsnTrp: 0.396 ± 0.123
3.007AsnTyr: 3.007 ± 0.322
0.0AsnXaa: 0.0 ± 0.0
Pro
1.161ProAla: 1.161 ± 0.185
0.528ProCys: 0.528 ± 0.133
1.741ProAsp: 1.741 ± 0.198
2.005ProGlu: 2.005 ± 0.23
1.266ProPhe: 1.266 ± 0.187
1.134ProGly: 1.134 ± 0.19
0.818ProHis: 0.818 ± 0.14
3.798ProIle: 3.798 ± 0.34
3.007ProLys: 3.007 ± 0.261
2.928ProLeu: 2.928 ± 0.365
0.976ProMet: 0.976 ± 0.174
3.482ProAsn: 3.482 ± 0.304
2.559ProPro: 2.559 ± 0.589
2.058ProGln: 2.058 ± 0.261
1.187ProArg: 1.187 ± 0.187
3.429ProSer: 3.429 ± 0.283
3.535ProThr: 3.535 ± 0.338
1.504ProVal: 1.504 ± 0.2
0.132ProTrp: 0.132 ± 0.053
1.926ProTyr: 1.926 ± 0.24
0.0ProXaa: 0.0 ± 0.0
Gln
1.161GlnAla: 1.161 ± 0.192
0.712GlnCys: 0.712 ± 0.136
1.345GlnAsp: 1.345 ± 0.189
0.923GlnGlu: 0.923 ± 0.154
1.846GlnPhe: 1.846 ± 0.226
0.448GlnGly: 0.448 ± 0.164
1.187GlnHis: 1.187 ± 0.246
2.981GlnIle: 2.981 ± 0.275
2.717GlnLys: 2.717 ± 0.296
3.456GlnLeu: 3.456 ± 0.341
1.398GlnMet: 1.398 ± 0.195
3.746GlnAsn: 3.746 ± 0.397
1.978GlnPro: 1.978 ± 0.304
5.144GlnGln: 5.144 ± 1.225
1.293GlnArg: 1.293 ± 0.172
3.113GlnSer: 3.113 ± 0.346
2.137GlnThr: 2.137 ± 0.247
1.161GlnVal: 1.161 ± 0.182
0.264GlnTrp: 0.264 ± 0.09
2.242GlnTyr: 2.242 ± 0.267
0.0GlnXaa: 0.0 ± 0.0
Arg
1.794ArgAla: 1.794 ± 0.207
0.739ArgCys: 0.739 ± 0.11
1.794ArgAsp: 1.794 ± 0.22
1.662ArgGlu: 1.662 ± 0.198
1.767ArgPhe: 1.767 ± 0.178
1.161ArgGly: 1.161 ± 0.187
0.976ArgHis: 0.976 ± 0.149
3.297ArgIle: 3.297 ± 0.372
3.35ArgLys: 3.35 ± 0.269
2.717ArgLeu: 2.717 ± 0.292
0.87ArgMet: 0.87 ± 0.176
3.64ArgAsn: 3.64 ± 0.246
1.741ArgPro: 1.741 ± 0.243
1.82ArgGln: 1.82 ± 0.209
2.295ArgArg: 2.295 ± 0.301
3.034ArgSer: 3.034 ± 0.431
1.715ArgThr: 1.715 ± 0.141
1.978ArgVal: 1.978 ± 0.201
0.264ArgTrp: 0.264 ± 0.087
1.662ArgTyr: 1.662 ± 0.24
0.0ArgXaa: 0.0 ± 0.0
Ser
3.165SerAla: 3.165 ± 0.247
1.609SerCys: 1.609 ± 0.243
4.062SerAsp: 4.062 ± 0.36
3.851SerGlu: 3.851 ± 0.339
3.113SerPhe: 3.113 ± 0.266
2.321SerGly: 2.321 ± 0.247
1.398SerHis: 1.398 ± 0.18
7.729SerIle: 7.729 ± 0.476
5.197SerLys: 5.197 ± 0.407
5.698SerLeu: 5.698 ± 0.432
2.691SerMet: 2.691 ± 0.259
7.307SerAsn: 7.307 ± 0.392
2.717SerPro: 2.717 ± 0.262
3.218SerGln: 3.218 ± 0.291
3.245SerArg: 3.245 ± 0.42
9.074SerSer: 9.074 ± 0.87
6.542SerThr: 6.542 ± 0.675
3.271SerVal: 3.271 ± 0.291
0.448SerTrp: 0.448 ± 0.089
3.007SerTyr: 3.007 ± 0.323
0.0SerXaa: 0.0 ± 0.0
Thr
2.875ThrAla: 2.875 ± 0.333
1.266ThrCys: 1.266 ± 0.181
3.165ThrAsp: 3.165 ± 0.316
2.743ThrGlu: 2.743 ± 0.269
2.928ThrPhe: 2.928 ± 0.314
2.453ThrGly: 2.453 ± 0.283
1.345ThrHis: 1.345 ± 0.171
7.069ThrIle: 7.069 ± 0.397
4.511ThrLys: 4.511 ± 0.296
5.724ThrLeu: 5.724 ± 0.325
1.873ThrMet: 1.873 ± 0.226
6.014ThrAsn: 6.014 ± 0.448
2.954ThrPro: 2.954 ± 0.334
2.295ThrGln: 2.295 ± 0.258
2.822ThrArg: 2.822 ± 0.247
6.12ThrSer: 6.12 ± 0.553
8.758ThrThr: 8.758 ± 1.725
3.034ThrVal: 3.034 ± 0.275
0.369ThrTrp: 0.369 ± 0.091
2.585ThrTyr: 2.585 ± 0.273
0.0ThrXaa: 0.0 ± 0.0
Val
2.453ValAla: 2.453 ± 0.279
0.87ValCys: 0.87 ± 0.124
3.693ValAsp: 3.693 ± 0.297
2.506ValGlu: 2.506 ± 0.249
1.846ValPhe: 1.846 ± 0.212
1.609ValGly: 1.609 ± 0.218
0.87ValHis: 0.87 ± 0.131
4.669ValIle: 4.669 ± 0.384
2.875ValLys: 2.875 ± 0.265
4.432ValLeu: 4.432 ± 0.417
1.187ValMet: 1.187 ± 0.166
3.139ValAsn: 3.139 ± 0.293
1.978ValPro: 1.978 ± 0.233
1.53ValGln: 1.53 ± 0.226
1.926ValArg: 1.926 ± 0.233
3.693ValSer: 3.693 ± 0.284
3.456ValThr: 3.456 ± 0.259
4.458ValVal: 4.458 ± 1.096
0.158ValTrp: 0.158 ± 0.067
2.427ValTyr: 2.427 ± 0.291
0.0ValXaa: 0.0 ± 0.0
Trp
0.343TrpAla: 0.343 ± 0.097
0.132TrpCys: 0.132 ± 0.049
0.264TrpAsp: 0.264 ± 0.083
0.211TrpGlu: 0.211 ± 0.072
0.528TrpPhe: 0.528 ± 0.113
0.211TrpGly: 0.211 ± 0.09
0.079TrpHis: 0.079 ± 0.045
0.369TrpIle: 0.369 ± 0.127
0.369TrpLys: 0.369 ± 0.1
0.607TrpLeu: 0.607 ± 0.152
0.106TrpMet: 0.106 ± 0.055
0.528TrpAsn: 0.528 ± 0.105
0.264TrpPro: 0.264 ± 0.082
0.185TrpGln: 0.185 ± 0.065
0.264TrpArg: 0.264 ± 0.095
0.422TrpSer: 0.422 ± 0.122
0.237TrpThr: 0.237 ± 0.072
0.211TrpVal: 0.211 ± 0.073
0.026TrpTrp: 0.026 ± 0.026
0.422TrpTyr: 0.422 ± 0.107
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.451TyrAla: 1.451 ± 0.211
0.87TyrCys: 0.87 ± 0.134
3.772TyrAsp: 3.772 ± 0.402
2.849TyrGlu: 2.849 ± 0.226
1.899TyrPhe: 1.899 ± 0.259
1.319TyrGly: 1.319 ± 0.191
0.633TyrHis: 0.633 ± 0.145
3.693TyrIle: 3.693 ± 0.352
4.009TyrLys: 4.009 ± 0.384
3.798TyrLeu: 3.798 ± 0.328
1.504TyrMet: 1.504 ± 0.189
4.643TyrAsn: 4.643 ± 0.326
1.978TyrPro: 1.978 ± 0.221
1.266TyrGln: 1.266 ± 0.199
1.82TyrArg: 1.82 ± 0.184
2.902TyrSer: 2.902 ± 0.292
3.456TyrThr: 3.456 ± 0.274
2.005TyrVal: 2.005 ± 0.212
0.264TyrTrp: 0.264 ± 0.076
2.427TyrTyr: 2.427 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 106 proteins (37911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski