Amino acid dipepetide frequency for Bombyx mandarina nucleopolyhedrovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.696AlaAla: 3.696 ± 0.42
1.119AlaCys: 1.119 ± 0.213
3.149AlaAsp: 3.149 ± 0.277
2.759AlaGlu: 2.759 ± 0.302
2.342AlaPhe: 2.342 ± 0.243
1.64AlaGly: 1.64 ± 0.227
1.275AlaHis: 1.275 ± 0.19
3.696AlaIle: 3.696 ± 0.341
3.201AlaLys: 3.201 ± 0.273
4.633AlaLeu: 4.633 ± 0.432
1.067AlaMet: 1.067 ± 0.159
3.696AlaAsn: 3.696 ± 0.336
2.733AlaPro: 2.733 ± 0.405
2.108AlaGln: 2.108 ± 0.216
2.212AlaArg: 2.212 ± 0.251
3.331AlaSer: 3.331 ± 0.326
2.941AlaThr: 2.941 ± 0.327
3.409AlaVal: 3.409 ± 0.325
0.286AlaTrp: 0.286 ± 0.101
2.655AlaTyr: 2.655 ± 0.294
0.0AlaXaa: 0.0 ± 0.0
Cys
1.275CysAla: 1.275 ± 0.161
0.677CysCys: 0.677 ± 0.165
1.301CysAsp: 1.301 ± 0.188
1.067CysGlu: 1.067 ± 0.178
1.119CysPhe: 1.119 ± 0.174
0.833CysGly: 0.833 ± 0.152
0.573CysHis: 0.573 ± 0.122
1.978CysIle: 1.978 ± 0.239
1.822CysLys: 1.822 ± 0.229
2.056CysLeu: 2.056 ± 0.264
0.416CysMet: 0.416 ± 0.116
2.16CysAsn: 2.16 ± 0.228
1.119CysPro: 1.119 ± 0.172
0.677CysGln: 0.677 ± 0.131
1.275CysArg: 1.275 ± 0.181
1.249CysSer: 1.249 ± 0.195
1.197CysThr: 1.197 ± 0.174
1.926CysVal: 1.926 ± 0.282
0.234CysTrp: 0.234 ± 0.094
0.963CysTyr: 0.963 ± 0.175
0.0CysXaa: 0.0 ± 0.0
Asp
3.826AspAla: 3.826 ± 0.375
1.509AspCys: 1.509 ± 0.234
5.674AspAsp: 5.674 ± 0.56
4.216AspGlu: 4.216 ± 0.406
2.603AspPhe: 2.603 ± 0.255
2.42AspGly: 2.42 ± 0.26
1.015AspHis: 1.015 ± 0.154
3.175AspIle: 3.175 ± 0.281
4.06AspLys: 4.06 ± 0.33
5.101AspLeu: 5.101 ± 0.394
1.952AspMet: 1.952 ± 0.218
4.893AspAsn: 4.893 ± 0.42
1.848AspPro: 1.848 ± 0.183
1.64AspGln: 1.64 ± 0.246
2.394AspArg: 2.394 ± 0.22
3.435AspSer: 3.435 ± 0.37
3.67AspThr: 3.67 ± 0.339
3.748AspVal: 3.748 ± 0.357
0.599AspTrp: 0.599 ± 0.147
2.941AspTyr: 2.941 ± 0.253
0.0AspXaa: 0.0 ± 0.0
Glu
2.446GluAla: 2.446 ± 0.265
1.353GluCys: 1.353 ± 0.23
3.149GluAsp: 3.149 ± 0.347
2.811GluGlu: 2.811 ± 0.346
2.889GluPhe: 2.889 ± 0.229
1.483GluGly: 1.483 ± 0.219
1.457GluHis: 1.457 ± 0.169
3.826GluIle: 3.826 ± 0.354
3.201GluLys: 3.201 ± 0.3
5.153GluLeu: 5.153 ± 0.45
1.796GluMet: 1.796 ± 0.186
4.606GluAsn: 4.606 ± 0.386
1.562GluPro: 1.562 ± 0.205
2.316GluGln: 2.316 ± 0.243
2.524GluArg: 2.524 ± 0.239
4.06GluSer: 4.06 ± 0.346
3.565GluThr: 3.565 ± 0.293
1.952GluVal: 1.952 ± 0.24
0.468GluTrp: 0.468 ± 0.107
2.655GluTyr: 2.655 ± 0.294
0.0GluXaa: 0.0 ± 0.0
Phe
2.55PheAla: 2.55 ± 0.222
1.483PheCys: 1.483 ± 0.185
4.112PheAsp: 4.112 ± 0.354
3.644PheGlu: 3.644 ± 0.3
1.796PhePhe: 1.796 ± 0.227
1.718PheGly: 1.718 ± 0.22
0.833PheHis: 0.833 ± 0.143
3.227PheIle: 3.227 ± 0.305
4.034PheLys: 4.034 ± 0.323
4.138PheLeu: 4.138 ± 0.347
1.405PheMet: 1.405 ± 0.177
4.45PheAsn: 4.45 ± 0.367
1.431PhePro: 1.431 ± 0.202
1.223PheGln: 1.223 ± 0.198
1.614PheArg: 1.614 ± 0.162
2.785PheSer: 2.785 ± 0.29
2.29PheThr: 2.29 ± 0.243
4.164PheVal: 4.164 ± 0.345
0.208PheTrp: 0.208 ± 0.077
2.264PheTyr: 2.264 ± 0.215
0.0PheXaa: 0.0 ± 0.0
Gly
1.666GlyAla: 1.666 ± 0.229
0.755GlyCys: 0.755 ± 0.147
2.134GlyAsp: 2.134 ± 0.291
2.03GlyGlu: 2.03 ± 0.235
1.666GlyPhe: 1.666 ± 0.245
2.577GlyGly: 2.577 ± 0.528
0.833GlyHis: 0.833 ± 0.132
1.692GlyIle: 1.692 ± 0.201
1.926GlyLys: 1.926 ± 0.233
2.446GlyLeu: 2.446 ± 0.252
0.651GlyMet: 0.651 ± 0.12
2.42GlyAsn: 2.42 ± 0.228
0.963GlyPro: 0.963 ± 0.156
1.197GlyGln: 1.197 ± 0.166
1.952GlyArg: 1.952 ± 0.266
1.692GlySer: 1.692 ± 0.176
1.9GlyThr: 1.9 ± 0.249
2.837GlyVal: 2.837 ± 0.309
0.312GlyTrp: 0.312 ± 0.092
1.588GlyTyr: 1.588 ± 0.188
0.0GlyXaa: 0.0 ± 0.0
His
1.041HisAla: 1.041 ± 0.186
0.234HisCys: 0.234 ± 0.072
1.171HisAsp: 1.171 ± 0.172
1.145HisGlu: 1.145 ± 0.216
1.249HisPhe: 1.249 ± 0.162
0.677HisGly: 0.677 ± 0.167
0.677HisHis: 0.677 ± 0.148
1.457HisIle: 1.457 ± 0.194
1.535HisLys: 1.535 ± 0.182
2.004HisLeu: 2.004 ± 0.275
0.599HisMet: 0.599 ± 0.124
2.004HisAsn: 2.004 ± 0.252
1.067HisPro: 1.067 ± 0.19
0.521HisGln: 0.521 ± 0.127
0.937HisArg: 0.937 ± 0.167
1.353HisSer: 1.353 ± 0.172
1.353HisThr: 1.353 ± 0.177
1.692HisVal: 1.692 ± 0.245
0.234HisTrp: 0.234 ± 0.104
1.145HisTyr: 1.145 ± 0.152
0.0HisXaa: 0.0 ± 0.0
Ile
3.201IleAla: 3.201 ± 0.288
1.353IleCys: 1.353 ± 0.215
4.659IleAsp: 4.659 ± 0.321
3.956IleGlu: 3.956 ± 0.328
3.279IlePhe: 3.279 ± 0.318
1.614IleGly: 1.614 ± 0.224
1.119IleHis: 1.119 ± 0.152
4.893IleIle: 4.893 ± 0.388
6.35IleLys: 6.35 ± 0.429
5.361IleLeu: 5.361 ± 0.39
2.186IleMet: 2.186 ± 0.284
6.064IleAsn: 6.064 ± 0.353
1.926IlePro: 1.926 ± 0.231
2.134IleGln: 2.134 ± 0.218
2.446IleArg: 2.446 ± 0.291
3.513IleSer: 3.513 ± 0.365
3.644IleThr: 3.644 ± 0.293
5.205IleVal: 5.205 ± 0.362
0.286IleTrp: 0.286 ± 0.077
2.577IleTyr: 2.577 ± 0.276
0.0IleXaa: 0.0 ± 0.0
Lys
2.186LysAla: 2.186 ± 0.235
1.952LysCys: 1.952 ± 0.26
2.733LysAsp: 2.733 ± 0.314
3.071LysGlu: 3.071 ± 0.292
3.748LysPhe: 3.748 ± 0.288
1.535LysGly: 1.535 ± 0.194
2.342LysHis: 2.342 ± 0.272
5.986LysIle: 5.986 ± 0.433
5.153LysLys: 5.153 ± 0.501
7.704LysLeu: 7.704 ± 0.464
2.577LysMet: 2.577 ± 0.255
6.402LysAsn: 6.402 ± 0.397
2.603LysPro: 2.603 ± 0.257
2.967LysGln: 2.967 ± 0.292
4.346LysArg: 4.346 ± 0.393
4.476LysSer: 4.476 ± 0.394
4.346LysThr: 4.346 ± 0.374
3.253LysVal: 3.253 ± 0.298
0.468LysTrp: 0.468 ± 0.115
4.112LysTyr: 4.112 ± 0.365
0.0LysXaa: 0.0 ± 0.0
Leu
4.19LeuAla: 4.19 ± 0.313
2.29LeuCys: 2.29 ± 0.234
4.528LeuAsp: 4.528 ± 0.3
5.075LeuGlu: 5.075 ± 0.453
4.893LeuPhe: 4.893 ± 0.344
2.681LeuGly: 2.681 ± 0.258
1.744LeuHis: 1.744 ± 0.211
6.845LeuIle: 6.845 ± 0.472
7.756LeuLys: 7.756 ± 0.43
9.213LeuLeu: 9.213 ± 0.532
2.603LeuMet: 2.603 ± 0.283
8.302LeuAsn: 8.302 ± 0.503
3.305LeuPro: 3.305 ± 0.335
5.023LeuGln: 5.023 ± 0.368
3.956LeuArg: 3.956 ± 0.335
5.361LeuSer: 5.361 ± 0.392
4.971LeuThr: 4.971 ± 0.368
5.101LeuVal: 5.101 ± 0.352
0.625LeuTrp: 0.625 ± 0.13
4.398LeuTyr: 4.398 ± 0.311
0.0LeuXaa: 0.0 ± 0.0
Met
1.666MetAla: 1.666 ± 0.188
1.067MetCys: 1.067 ± 0.189
1.119MetAsp: 1.119 ± 0.153
1.041MetGlu: 1.041 ± 0.179
1.535MetPhe: 1.535 ± 0.207
0.885MetGly: 0.885 ± 0.173
0.703MetHis: 0.703 ± 0.145
1.666MetIle: 1.666 ± 0.175
1.353MetLys: 1.353 ± 0.182
3.435MetLeu: 3.435 ± 0.248
0.573MetMet: 0.573 ± 0.133
2.108MetAsn: 2.108 ± 0.237
1.119MetPro: 1.119 ± 0.182
1.197MetGln: 1.197 ± 0.231
1.483MetArg: 1.483 ± 0.22
2.056MetSer: 2.056 ± 0.197
1.614MetThr: 1.614 ± 0.177
1.171MetVal: 1.171 ± 0.152
0.312MetTrp: 0.312 ± 0.083
1.64MetTyr: 1.64 ± 0.193
0.0MetXaa: 0.0 ± 0.0
Asn
5.101AsnAla: 5.101 ± 0.369
1.848AsnCys: 1.848 ± 0.245
4.711AsnAsp: 4.711 ± 0.457
4.58AsnGlu: 4.58 ± 0.342
4.216AsnPhe: 4.216 ± 0.375
3.279AsnGly: 3.279 ± 0.33
1.327AsnHis: 1.327 ± 0.211
4.815AsnIle: 4.815 ± 0.382
6.402AsnLys: 6.402 ± 0.504
6.793AsnLeu: 6.793 ± 0.408
2.108AsnMet: 2.108 ± 0.198
7.131AsnAsn: 7.131 ± 0.546
2.004AsnPro: 2.004 ± 0.262
2.056AsnGln: 2.056 ± 0.237
4.034AsnArg: 4.034 ± 0.351
5.439AsnSer: 5.439 ± 0.364
4.711AsnThr: 4.711 ± 0.401
6.767AsnVal: 6.767 ± 0.452
0.521AsnTrp: 0.521 ± 0.118
4.502AsnTyr: 4.502 ± 0.359
0.0AsnXaa: 0.0 ± 0.0
Pro
2.785ProAla: 2.785 ± 0.4
0.599ProCys: 0.599 ± 0.133
2.394ProAsp: 2.394 ± 0.219
1.588ProGlu: 1.588 ± 0.23
1.77ProPhe: 1.77 ± 0.198
1.275ProGly: 1.275 ± 0.194
0.885ProHis: 0.885 ± 0.157
1.874ProIle: 1.874 ± 0.221
1.978ProLys: 1.978 ± 0.239
3.722ProLeu: 3.722 ± 0.306
0.651ProMet: 0.651 ± 0.124
2.681ProAsn: 2.681 ± 0.236
3.8ProPro: 3.8 ± 1.012
1.353ProGln: 1.353 ± 0.188
1.822ProArg: 1.822 ± 0.27
2.577ProSer: 2.577 ± 0.329
2.472ProThr: 2.472 ± 0.279
2.212ProVal: 2.212 ± 0.256
0.286ProTrp: 0.286 ± 0.1
1.718ProTyr: 1.718 ± 0.218
0.0ProXaa: 0.0 ± 0.0
Gln
1.483GlnAla: 1.483 ± 0.237
1.015GlnCys: 1.015 ± 0.178
1.405GlnAsp: 1.405 ± 0.209
2.082GlnGlu: 2.082 ± 0.24
1.978GlnPhe: 1.978 ± 0.205
0.729GlnGly: 0.729 ± 0.125
1.015GlnHis: 1.015 ± 0.146
2.707GlnIle: 2.707 ± 0.252
2.733GlnLys: 2.733 ± 0.319
3.904GlnLeu: 3.904 ± 0.329
0.963GlnMet: 0.963 ± 0.152
2.681GlnAsn: 2.681 ± 0.264
1.588GlnPro: 1.588 ± 0.23
2.316GlnGln: 2.316 ± 0.296
1.848GlnArg: 1.848 ± 0.244
2.655GlnSer: 2.655 ± 0.277
2.264GlnThr: 2.264 ± 0.241
1.796GlnVal: 1.796 ± 0.272
0.364GlnTrp: 0.364 ± 0.1
1.718GlnTyr: 1.718 ± 0.209
0.0GlnXaa: 0.0 ± 0.0
Arg
2.16ArgAla: 2.16 ± 0.225
1.145ArgCys: 1.145 ± 0.186
2.993ArgAsp: 2.993 ± 0.285
2.264ArgGlu: 2.264 ± 0.197
2.212ArgPhe: 2.212 ± 0.244
1.535ArgGly: 1.535 ± 0.216
1.327ArgHis: 1.327 ± 0.214
3.357ArgIle: 3.357 ± 0.329
2.915ArgLys: 2.915 ± 0.304
4.45ArgLeu: 4.45 ± 0.35
1.093ArgMet: 1.093 ± 0.154
3.305ArgAsn: 3.305 ± 0.331
2.082ArgPro: 2.082 ± 0.259
2.186ArgGln: 2.186 ± 0.299
3.696ArgArg: 3.696 ± 0.609
3.175ArgSer: 3.175 ± 0.469
2.108ArgThr: 2.108 ± 0.239
2.863ArgVal: 2.863 ± 0.246
0.494ArgTrp: 0.494 ± 0.132
1.848ArgTyr: 1.848 ± 0.217
0.0ArgXaa: 0.0 ± 0.0
Ser
3.227SerAla: 3.227 ± 0.324
1.379SerCys: 1.379 ± 0.217
4.502SerAsp: 4.502 ± 0.366
3.305SerGlu: 3.305 ± 0.296
2.915SerPhe: 2.915 ± 0.237
2.29SerGly: 2.29 ± 0.245
1.093SerHis: 1.093 ± 0.155
3.982SerIle: 3.982 ± 0.324
3.878SerLys: 3.878 ± 0.33
5.96SerLeu: 5.96 ± 0.388
1.483SerMet: 1.483 ± 0.193
4.841SerAsn: 4.841 ± 0.391
2.342SerPro: 2.342 ± 0.283
1.978SerGln: 1.978 ± 0.383
2.629SerArg: 2.629 ± 0.378
5.127SerSer: 5.127 ± 0.567
3.8SerThr: 3.8 ± 0.336
5.205SerVal: 5.205 ± 0.384
0.364SerTrp: 0.364 ± 0.093
2.004SerTyr: 2.004 ± 0.272
0.0SerXaa: 0.0 ± 0.0
Thr
3.435ThrAla: 3.435 ± 0.297
1.067ThrCys: 1.067 ± 0.15
3.565ThrAsp: 3.565 ± 0.343
2.811ThrGlu: 2.811 ± 0.25
3.045ThrPhe: 3.045 ± 0.289
2.342ThrGly: 2.342 ± 0.313
1.171ThrHis: 1.171 ± 0.188
3.67ThrIle: 3.67 ± 0.299
3.253ThrLys: 3.253 ± 0.294
5.491ThrLeu: 5.491 ± 0.368
1.509ThrMet: 1.509 ± 0.234
4.32ThrAsn: 4.32 ± 0.396
2.681ThrPro: 2.681 ± 0.297
1.9ThrGln: 1.9 ± 0.222
3.097ThrArg: 3.097 ± 0.264
3.592ThrSer: 3.592 ± 0.343
4.606ThrThr: 4.606 ± 0.47
3.774ThrVal: 3.774 ± 0.427
0.416ThrTrp: 0.416 ± 0.118
2.446ThrTyr: 2.446 ± 0.249
0.0ThrXaa: 0.0 ± 0.0
Val
3.331ValAla: 3.331 ± 0.242
1.926ValCys: 1.926 ± 0.237
4.528ValAsp: 4.528 ± 0.409
3.175ValGlu: 3.175 ± 0.318
3.461ValPhe: 3.461 ± 0.299
1.926ValGly: 1.926 ± 0.233
1.379ValHis: 1.379 ± 0.22
3.956ValIle: 3.956 ± 0.344
5.205ValLys: 5.205 ± 0.423
6.454ValLeu: 6.454 ± 0.437
2.004ValMet: 2.004 ± 0.206
4.971ValAsn: 4.971 ± 0.382
2.811ValPro: 2.811 ± 0.296
2.811ValGln: 2.811 ± 0.297
2.629ValArg: 2.629 ± 0.259
3.461ValSer: 3.461 ± 0.347
3.279ValThr: 3.279 ± 0.279
4.659ValVal: 4.659 ± 0.418
0.468ValTrp: 0.468 ± 0.139
3.592ValTyr: 3.592 ± 0.337
0.0ValXaa: 0.0 ± 0.0
Trp
0.208TrpAla: 0.208 ± 0.09
0.13TrpCys: 0.13 ± 0.054
0.39TrpAsp: 0.39 ± 0.104
0.442TrpGlu: 0.442 ± 0.13
0.312TrpPhe: 0.312 ± 0.086
0.234TrpGly: 0.234 ± 0.088
0.208TrpHis: 0.208 ± 0.074
0.39TrpIle: 0.39 ± 0.101
0.703TrpLys: 0.703 ± 0.169
0.703TrpLeu: 0.703 ± 0.114
0.234TrpMet: 0.234 ± 0.078
0.781TrpAsn: 0.781 ± 0.167
0.364TrpPro: 0.364 ± 0.095
0.234TrpGln: 0.234 ± 0.085
0.416TrpArg: 0.416 ± 0.104
0.416TrpSer: 0.416 ± 0.111
0.599TrpThr: 0.599 ± 0.129
0.364TrpVal: 0.364 ± 0.094
0.104TrpTrp: 0.104 ± 0.054
0.234TrpTyr: 0.234 ± 0.069
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.472TyrAla: 2.472 ± 0.271
1.015TyrCys: 1.015 ± 0.137
2.498TyrAsp: 2.498 ± 0.299
2.212TyrGlu: 2.212 ± 0.252
2.342TyrPhe: 2.342 ± 0.227
1.562TyrGly: 1.562 ± 0.172
1.119TyrHis: 1.119 ± 0.166
2.577TyrIle: 2.577 ± 0.235
4.502TyrLys: 4.502 ± 0.386
4.138TyrLeu: 4.138 ± 0.369
1.848TyrMet: 1.848 ± 0.185
4.528TyrAsn: 4.528 ± 0.388
1.093TyrPro: 1.093 ± 0.181
1.405TyrGln: 1.405 ± 0.186
1.9TyrArg: 1.9 ± 0.246
2.55TyrSer: 2.55 ± 0.236
2.863TyrThr: 2.863 ± 0.284
3.956TyrVal: 3.956 ± 0.269
0.416TyrTrp: 0.416 ± 0.095
2.889TyrTyr: 2.889 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 141 proteins (38425 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski