Amino acid dipepetide frequency for Invertebrate iridescent virus 30

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.099AlaAla: 2.099 ± 0.359
0.597AlaCys: 0.597 ± 0.094
1.365AlaAsp: 1.365 ± 0.147
1.536AlaGlu: 1.536 ± 0.183
1.621AlaPhe: 1.621 ± 0.179
1.963AlaGly: 1.963 ± 0.35
0.444AlaHis: 0.444 ± 0.087
3.158AlaIle: 3.158 ± 0.3
2.885AlaLys: 2.885 ± 0.259
3.106AlaLeu: 3.106 ± 0.312
0.683AlaMet: 0.683 ± 0.12
2.731AlaAsn: 2.731 ± 0.39
2.611AlaPro: 2.611 ± 0.425
1.741AlaGln: 1.741 ± 0.183
1.075AlaArg: 1.075 ± 0.139
3.021AlaSer: 3.021 ± 0.345
2.919AlaThr: 2.919 ± 0.548
2.253AlaVal: 2.253 ± 0.337
0.171AlaTrp: 0.171 ± 0.064
1.536AlaTyr: 1.536 ± 0.203
0.0AlaXaa: 0.0 ± 0.0
Cys
0.649CysAla: 0.649 ± 0.103
0.546CysCys: 0.546 ± 0.133
0.939CysAsp: 0.939 ± 0.14
1.212CysGlu: 1.212 ± 0.155
0.734CysPhe: 0.734 ± 0.121
1.127CysGly: 1.127 ± 0.167
0.393CysHis: 0.393 ± 0.1
1.468CysIle: 1.468 ± 0.18
1.417CysLys: 1.417 ± 0.214
1.502CysLeu: 1.502 ± 0.193
0.358CysMet: 0.358 ± 0.071
1.263CysAsn: 1.263 ± 0.161
0.683CysPro: 0.683 ± 0.105
0.717CysGln: 0.717 ± 0.113
0.683CysArg: 0.683 ± 0.123
0.888CysSer: 0.888 ± 0.127
1.075CysThr: 1.075 ± 0.155
1.024CysVal: 1.024 ± 0.141
0.137CysTrp: 0.137 ± 0.061
0.58CysTyr: 0.58 ± 0.105
0.0CysXaa: 0.0 ± 0.0
Asp
2.134AspAla: 2.134 ± 0.329
0.87AspCys: 0.87 ± 0.127
3.397AspAsp: 3.397 ± 0.33
3.738AspGlu: 3.738 ± 0.335
2.765AspPhe: 2.765 ± 0.211
2.253AspGly: 2.253 ± 0.23
1.007AspHis: 1.007 ± 0.16
4.591AspIle: 4.591 ± 0.448
4.387AspLys: 4.387 ± 0.339
6.554AspLeu: 6.554 ± 0.349
0.785AspMet: 0.785 ± 0.121
3.294AspAsn: 3.294 ± 0.23
2.202AspPro: 2.202 ± 0.193
1.604AspGln: 1.604 ± 0.17
1.468AspArg: 1.468 ± 0.175
3.226AspSer: 3.226 ± 0.463
2.867AspThr: 2.867 ± 0.233
3.192AspVal: 3.192 ± 0.231
0.461AspTrp: 0.461 ± 0.089
3.004AspTyr: 3.004 ± 0.295
0.0AspXaa: 0.0 ± 0.0
Glu
1.621GluAla: 1.621 ± 0.167
1.161GluCys: 1.161 ± 0.156
3.875GluAsp: 3.875 ± 0.294
5.513GluGlu: 5.513 ± 0.494
2.953GluPhe: 2.953 ± 0.243
2.321GluGly: 2.321 ± 0.252
0.717GluHis: 0.717 ± 0.138
5.308GluIle: 5.308 ± 0.411
5.53GluLys: 5.53 ± 0.433
5.735GluLeu: 5.735 ± 0.403
1.348GluMet: 1.348 ± 0.18
4.762GluAsn: 4.762 ± 0.347
1.912GluPro: 1.912 ± 0.198
2.116GluGln: 2.116 ± 0.264
2.151GluArg: 2.151 ± 0.202
3.653GluSer: 3.653 ± 0.282
3.38GluThr: 3.38 ± 0.567
3.397GluVal: 3.397 ± 0.284
0.922GluTrp: 0.922 ± 0.118
2.56GluTyr: 2.56 ± 0.234
0.0GluXaa: 0.0 ± 0.0
Phe
1.348PheAla: 1.348 ± 0.175
0.751PheCys: 0.751 ± 0.121
3.294PheAsp: 3.294 ± 0.249
2.919PheGlu: 2.919 ± 0.284
2.014PhePhe: 2.014 ± 0.247
2.526PheGly: 2.526 ± 0.243
0.7PheHis: 0.7 ± 0.121
4.404PheIle: 4.404 ± 0.417
5.496PheLys: 5.496 ± 0.388
4.284PheLeu: 4.284 ± 0.288
1.4PheMet: 1.4 ± 0.179
3.789PheAsn: 3.789 ± 0.264
1.417PhePro: 1.417 ± 0.15
1.468PheGln: 1.468 ± 0.149
1.28PheArg: 1.28 ± 0.159
2.594PheSer: 2.594 ± 0.231
2.39PheThr: 2.39 ± 0.202
2.663PheVal: 2.663 ± 0.234
0.376PheTrp: 0.376 ± 0.079
1.792PheTyr: 1.792 ± 0.218
0.0PheXaa: 0.0 ± 0.0
Gly
1.963GlyAla: 1.963 ± 0.295
0.683GlyCys: 0.683 ± 0.09
2.56GlyAsp: 2.56 ± 0.279
2.441GlyGlu: 2.441 ± 0.205
1.878GlyPhe: 1.878 ± 0.199
4.199GlyGly: 4.199 ± 0.66
0.563GlyHis: 0.563 ± 0.084
3.943GlyIle: 3.943 ± 0.298
2.867GlyLys: 2.867 ± 0.265
4.062GlyLeu: 4.062 ± 0.279
0.7GlyMet: 0.7 ± 0.151
2.782GlyAsn: 2.782 ± 0.33
1.639GlyPro: 1.639 ± 0.218
1.69GlyGln: 1.69 ± 0.177
1.348GlyArg: 1.348 ± 0.148
4.591GlySer: 4.591 ± 0.647
4.165GlyThr: 4.165 ± 0.654
3.345GlyVal: 3.345 ± 0.311
0.512GlyTrp: 0.512 ± 0.097
2.492GlyTyr: 2.492 ± 0.22
0.0GlyXaa: 0.0 ± 0.0
His
0.512HisAla: 0.512 ± 0.101
0.324HisCys: 0.324 ± 0.082
0.495HisAsp: 0.495 ± 0.095
0.87HisGlu: 0.87 ± 0.131
1.024HisPhe: 1.024 ± 0.15
0.58HisGly: 0.58 ± 0.106
0.666HisHis: 0.666 ± 0.153
1.536HisIle: 1.536 ± 0.169
1.161HisLys: 1.161 ± 0.137
1.775HisLeu: 1.775 ± 0.191
0.154HisMet: 0.154 ± 0.045
1.058HisAsn: 1.058 ± 0.153
0.939HisPro: 0.939 ± 0.263
0.802HisGln: 0.802 ± 0.137
0.751HisArg: 0.751 ± 0.118
0.99HisSer: 0.99 ± 0.131
1.109HisThr: 1.109 ± 0.142
0.529HisVal: 0.529 ± 0.101
0.137HisTrp: 0.137 ± 0.049
0.836HisTyr: 0.836 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
2.816IleAla: 2.816 ± 0.211
1.297IleCys: 1.297 ± 0.155
4.66IleAsp: 4.66 ± 0.319
5.223IleGlu: 5.223 ± 0.399
4.352IlePhe: 4.352 ± 0.37
3.26IleGly: 3.26 ± 0.212
1.417IleHis: 1.417 ± 0.201
5.974IleIle: 5.974 ± 0.366
8.227IleLys: 8.227 ± 0.523
7.561IleLeu: 7.561 ± 0.438
2.134IleMet: 2.134 ± 0.219
6.52IleAsn: 6.52 ± 0.43
2.987IlePro: 2.987 ± 0.247
3.55IleGln: 3.55 ± 0.27
2.765IleArg: 2.765 ± 0.293
6.401IleSer: 6.401 ± 0.457
5.496IleThr: 5.496 ± 0.512
4.165IleVal: 4.165 ± 0.313
0.341IleTrp: 0.341 ± 0.076
2.646IleTyr: 2.646 ± 0.224
0.0IleXaa: 0.0 ± 0.0
Lys
3.141LysAla: 3.141 ± 0.317
1.741LysCys: 1.741 ± 0.21
5.172LysAsp: 5.172 ± 0.474
5.94LysGlu: 5.94 ± 0.493
4.608LysPhe: 4.608 ± 0.361
3.67LysGly: 3.67 ± 0.385
1.485LysHis: 1.485 ± 0.148
8.978LysIle: 8.978 ± 0.585
9.985LysLys: 9.985 ± 0.736
8.329LysLeu: 8.329 ± 0.5
2.253LysMet: 2.253 ± 0.222
7.203LysAsn: 7.203 ± 0.416
3.175LysPro: 3.175 ± 0.3
3.106LysGln: 3.106 ± 0.249
3.055LysArg: 3.055 ± 0.298
5.786LysSer: 5.786 ± 0.49
4.574LysThr: 4.574 ± 0.289
5.735LysVal: 5.735 ± 0.355
1.024LysTrp: 1.024 ± 0.142
4.438LysTyr: 4.438 ± 0.436
0.0LysXaa: 0.0 ± 0.0
Leu
3.96LeuAla: 3.96 ± 0.667
1.229LeuCys: 1.229 ± 0.185
4.455LeuAsp: 4.455 ± 0.358
6.23LeuGlu: 6.23 ± 0.522
3.55LeuPhe: 3.55 ± 0.317
4.011LeuGly: 4.011 ± 0.295
1.587LeuHis: 1.587 ± 0.197
7.322LeuIle: 7.322 ± 0.388
11.811LeuLys: 11.811 ± 0.583
8.261LeuLeu: 8.261 ± 0.551
1.741LeuMet: 1.741 ± 0.195
7.664LeuAsn: 7.664 ± 0.445
4.062LeuPro: 4.062 ± 0.234
3.636LeuGln: 3.636 ± 0.329
2.577LeuArg: 2.577 ± 0.234
7.408LeuSer: 7.408 ± 0.815
6.23LeuThr: 6.23 ± 0.381
4.318LeuVal: 4.318 ± 0.283
0.853LeuTrp: 0.853 ± 0.122
3.943LeuTyr: 3.943 ± 0.301
0.0LeuXaa: 0.0 ± 0.0
Met
1.383MetAla: 1.383 ± 0.147
0.597MetCys: 0.597 ± 0.098
1.195MetAsp: 1.195 ± 0.159
1.604MetGlu: 1.604 ± 0.159
0.87MetPhe: 0.87 ± 0.122
1.127MetGly: 1.127 ± 0.18
0.119MetHis: 0.119 ± 0.043
1.075MetIle: 1.075 ± 0.159
1.485MetLys: 1.485 ± 0.195
1.331MetLeu: 1.331 ± 0.143
0.632MetMet: 0.632 ± 0.111
1.058MetAsn: 1.058 ± 0.136
0.393MetPro: 0.393 ± 0.083
0.597MetGln: 0.597 ± 0.119
0.614MetArg: 0.614 ± 0.114
2.065MetSer: 2.065 ± 0.365
0.99MetThr: 0.99 ± 0.125
1.792MetVal: 1.792 ± 0.208
0.188MetTrp: 0.188 ± 0.056
0.973MetTyr: 0.973 ± 0.118
0.0MetXaa: 0.0 ± 0.0
Asn
2.372AsnAla: 2.372 ± 0.268
1.434AsnCys: 1.434 ± 0.163
3.311AsnAsp: 3.311 ± 0.237
3.772AsnGlu: 3.772 ± 0.282
4.489AsnPhe: 4.489 ± 0.3
3.465AsnGly: 3.465 ± 0.304
1.724AsnHis: 1.724 ± 0.197
6.947AsnIle: 6.947 ± 0.39
7.305AsnLys: 7.305 ± 0.358
8.159AsnLeu: 8.159 ± 0.463
1.434AsnMet: 1.434 ± 0.142
5.223AsnAsn: 5.223 ± 0.321
2.936AsnPro: 2.936 ± 0.219
2.287AsnGln: 2.287 ± 0.164
1.741AsnArg: 1.741 ± 0.216
4.387AsnSer: 4.387 ± 0.354
4.847AsnThr: 4.847 ± 0.747
3.431AsnVal: 3.431 ± 0.251
0.427AsnTrp: 0.427 ± 0.083
2.987AsnTyr: 2.987 ± 0.286
0.0AsnXaa: 0.0 ± 0.0
Pro
1.434ProAla: 1.434 ± 0.251
0.597ProCys: 0.597 ± 0.113
2.134ProAsp: 2.134 ± 0.18
2.134ProGlu: 2.134 ± 0.214
1.86ProPhe: 1.86 ± 0.196
1.656ProGly: 1.656 ± 0.301
0.853ProHis: 0.853 ± 0.223
2.936ProIle: 2.936 ± 0.258
3.328ProLys: 3.328 ± 0.352
4.694ProLeu: 4.694 ± 0.437
0.58ProMet: 0.58 ± 0.12
2.885ProAsn: 2.885 ± 0.239
2.526ProPro: 2.526 ± 0.51
1.929ProGln: 1.929 ± 0.303
1.161ProArg: 1.161 ± 0.23
4.182ProSer: 4.182 ± 0.999
2.885ProThr: 2.885 ± 0.376
2.782ProVal: 2.782 ± 0.455
0.188ProTrp: 0.188 ± 0.056
1.246ProTyr: 1.246 ± 0.118
0.0ProXaa: 0.0 ± 0.0
Gln
1.297GlnAla: 1.297 ± 0.18
0.597GlnCys: 0.597 ± 0.108
1.724GlnAsp: 1.724 ± 0.169
2.577GlnGlu: 2.577 ± 0.247
1.741GlnPhe: 1.741 ± 0.177
1.195GlnGly: 1.195 ± 0.149
0.768GlnHis: 0.768 ± 0.121
3.345GlnIle: 3.345 ± 0.216
4.011GlnLys: 4.011 ± 0.284
4.421GlnLeu: 4.421 ± 0.336
0.666GlnMet: 0.666 ± 0.107
3.004GlnAsn: 3.004 ± 0.34
2.048GlnPro: 2.048 ± 0.343
1.707GlnGln: 1.707 ± 0.189
1.417GlnArg: 1.417 ± 0.173
2.338GlnSer: 2.338 ± 0.219
2.134GlnThr: 2.134 ± 0.188
1.878GlnVal: 1.878 ± 0.204
0.273GlnTrp: 0.273 ± 0.069
1.604GlnTyr: 1.604 ± 0.2
0.0GlnXaa: 0.0 ± 0.0
Arg
1.127ArgAla: 1.127 ± 0.179
0.7ArgCys: 0.7 ± 0.125
1.502ArgAsp: 1.502 ± 0.168
2.27ArgGlu: 2.27 ± 0.235
1.468ArgPhe: 1.468 ± 0.174
1.007ArgGly: 1.007 ± 0.128
0.239ArgHis: 0.239 ± 0.074
2.099ArgIle: 2.099 ± 0.224
3.362ArgLys: 3.362 ± 0.286
2.509ArgLeu: 2.509 ± 0.238
0.495ArgMet: 0.495 ± 0.094
2.031ArgAsn: 2.031 ± 0.191
1.673ArgPro: 1.673 ± 0.314
1.348ArgGln: 1.348 ± 0.167
1.417ArgArg: 1.417 ± 0.25
2.543ArgSer: 2.543 ± 0.448
1.348ArgThr: 1.348 ± 0.196
1.741ArgVal: 1.741 ± 0.168
0.307ArgTrp: 0.307 ± 0.081
1.348ArgTyr: 1.348 ± 0.163
0.0ArgXaa: 0.0 ± 0.0
Ser
2.919SerAla: 2.919 ± 0.457
1.058SerCys: 1.058 ± 0.147
4.062SerAsp: 4.062 ± 0.629
3.636SerGlu: 3.636 ± 0.313
2.936SerPhe: 2.936 ± 0.209
5.513SerGly: 5.513 ± 1.071
1.058SerHis: 1.058 ± 0.124
5.325SerIle: 5.325 ± 0.404
6.298SerLys: 6.298 ± 0.404
6.605SerLeu: 6.605 ± 0.343
1.314SerMet: 1.314 ± 0.191
5.325SerAsn: 5.325 ± 0.43
3.277SerPro: 3.277 ± 0.569
3.243SerGln: 3.243 ± 0.278
2.116SerArg: 2.116 ± 0.307
6.776SerSer: 6.776 ± 0.886
5.24SerThr: 5.24 ± 0.574
4.267SerVal: 4.267 ± 0.607
0.444SerTrp: 0.444 ± 0.083
2.099SerTyr: 2.099 ± 0.216
0.0SerXaa: 0.0 ± 0.0
Thr
2.099ThrAla: 2.099 ± 0.395
1.195ThrCys: 1.195 ± 0.177
3.328ThrAsp: 3.328 ± 0.296
3.601ThrGlu: 3.601 ± 0.633
3.192ThrPhe: 3.192 ± 0.2
3.55ThrGly: 3.55 ± 0.491
0.973ThrHis: 0.973 ± 0.119
5.513ThrIle: 5.513 ± 0.339
4.899ThrLys: 4.899 ± 0.325
6.623ThrLeu: 6.623 ± 0.476
1.365ThrMet: 1.365 ± 0.204
5.018ThrAsn: 5.018 ± 0.615
2.56ThrPro: 2.56 ± 0.348
2.833ThrGln: 2.833 ± 0.319
2.031ThrArg: 2.031 ± 0.215
5.359ThrSer: 5.359 ± 0.649
6.52ThrThr: 6.52 ± 1.095
2.611ThrVal: 2.611 ± 0.212
0.358ThrTrp: 0.358 ± 0.092
1.792ThrTyr: 1.792 ± 0.157
0.0ThrXaa: 0.0 ± 0.0
Val
2.902ValAla: 2.902 ± 0.378
1.007ValCys: 1.007 ± 0.125
3.533ValAsp: 3.533 ± 0.264
3.857ValGlu: 3.857 ± 0.268
1.86ValPhe: 1.86 ± 0.175
2.56ValGly: 2.56 ± 0.21
0.632ValHis: 0.632 ± 0.109
3.789ValIle: 3.789 ± 0.285
5.53ValLys: 5.53 ± 0.278
4.352ValLeu: 4.352 ± 0.264
1.075ValMet: 1.075 ± 0.149
3.311ValAsn: 3.311 ± 0.255
3.004ValPro: 3.004 ± 0.472
2.594ValGln: 2.594 ± 0.322
1.587ValArg: 1.587 ± 0.236
3.909ValSer: 3.909 ± 0.439
3.106ValThr: 3.106 ± 0.312
4.591ValVal: 4.591 ± 0.506
0.683ValTrp: 0.683 ± 0.099
2.458ValTyr: 2.458 ± 0.225
0.0ValXaa: 0.0 ± 0.0
Trp
0.341TrpAla: 0.341 ± 0.08
0.256TrpCys: 0.256 ± 0.091
0.546TrpAsp: 0.546 ± 0.109
0.307TrpGlu: 0.307 ± 0.071
0.768TrpPhe: 0.768 ± 0.103
0.29TrpGly: 0.29 ± 0.073
0.068TrpHis: 0.068 ± 0.032
0.546TrpIle: 0.546 ± 0.1
0.461TrpLys: 0.461 ± 0.093
0.802TrpLeu: 0.802 ± 0.115
0.154TrpMet: 0.154 ± 0.053
0.717TrpAsn: 0.717 ± 0.12
0.256TrpPro: 0.256 ± 0.063
0.324TrpGln: 0.324 ± 0.08
0.102TrpArg: 0.102 ± 0.055
0.734TrpSer: 0.734 ± 0.141
0.307TrpThr: 0.307 ± 0.084
0.495TrpVal: 0.495 ± 0.073
0.119TrpTrp: 0.119 ± 0.051
0.546TrpTyr: 0.546 ± 0.113
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.4TyrAla: 1.4 ± 0.185
0.751TyrCys: 0.751 ± 0.125
2.27TyrAsp: 2.27 ± 0.208
1.553TyrGlu: 1.553 ± 0.187
2.219TyrPhe: 2.219 ± 0.195
2.151TyrGly: 2.151 ± 0.26
0.836TyrHis: 0.836 ± 0.149
3.533TyrIle: 3.533 ± 0.246
2.953TyrLys: 2.953 ± 0.351
4.182TyrLeu: 4.182 ± 0.332
0.802TyrMet: 0.802 ± 0.113
3.038TyrAsn: 3.038 ± 0.255
1.69TyrPro: 1.69 ± 0.176
1.451TyrGln: 1.451 ± 0.157
1.195TyrArg: 1.195 ± 0.144
2.663TyrSer: 2.663 ± 0.251
3.84TyrThr: 3.84 ± 0.365
2.134TyrVal: 2.134 ± 0.243
0.222TyrTrp: 0.222 ± 0.063
2.407TyrTyr: 2.407 ± 0.241
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 177 proteins (58589 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski