Amino acid dipepetide frequency for Shrimp hemocyte iridescent virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.69AlaAla: 1.69 ± 0.262
0.407AlaCys: 0.407 ± 0.075
1.873AlaAsp: 1.873 ± 0.194
2.892AlaGlu: 2.892 ± 0.363
1.751AlaPhe: 1.751 ± 0.183
1.466AlaGly: 1.466 ± 0.186
0.57AlaHis: 0.57 ± 0.108
2.953AlaIle: 2.953 ± 0.224
3.604AlaLys: 3.604 ± 0.311
2.749AlaLeu: 2.749 ± 0.216
1.039AlaMet: 1.039 ± 0.136
2.403AlaAsn: 2.403 ± 0.314
1.507AlaPro: 1.507 ± 0.207
1.548AlaGln: 1.548 ± 0.246
1.975AlaArg: 1.975 ± 0.264
1.873AlaSer: 1.873 ± 0.186
1.914AlaThr: 1.914 ± 0.224
2.505AlaVal: 2.505 ± 0.223
0.224AlaTrp: 0.224 ± 0.082
1.181AlaTyr: 1.181 ± 0.176
0.0AlaXaa: 0.0 ± 0.0
Cys
0.652CysAla: 0.652 ± 0.11
0.224CysCys: 0.224 ± 0.072
1.161CysAsp: 1.161 ± 0.177
1.446CysGlu: 1.446 ± 0.187
0.529CysPhe: 0.529 ± 0.095
0.815CysGly: 0.815 ± 0.14
0.224CysHis: 0.224 ± 0.064
1.405CysIle: 1.405 ± 0.191
2.016CysLys: 2.016 ± 0.328
1.385CysLeu: 1.385 ± 0.167
0.753CysMet: 0.753 ± 0.115
0.998CysAsn: 0.998 ± 0.144
0.529CysPro: 0.529 ± 0.136
0.407CysGln: 0.407 ± 0.095
0.774CysArg: 0.774 ± 0.141
0.835CysSer: 0.835 ± 0.147
0.815CysThr: 0.815 ± 0.127
1.201CysVal: 1.201 ± 0.165
0.204CysTrp: 0.204 ± 0.07
0.529CysTyr: 0.529 ± 0.101
0.0CysXaa: 0.0 ± 0.0
Asp
2.342AspAla: 2.342 ± 0.247
1.222AspCys: 1.222 ± 0.157
4.948AspAsp: 4.948 ± 0.411
6.964AspGlu: 6.964 ± 0.418
4.439AspPhe: 4.439 ± 0.316
3.238AspGly: 3.238 ± 0.259
0.896AspHis: 0.896 ± 0.119
4.948AspIle: 4.948 ± 0.378
4.846AspLys: 4.846 ± 0.301
5.233AspLeu: 5.233 ± 0.341
1.609AspMet: 1.609 ± 0.177
2.79AspAsn: 2.79 ± 0.254
2.138AspPro: 2.138 ± 0.229
1.425AspGln: 1.425 ± 0.176
2.097AspArg: 2.097 ± 0.263
3.91AspSer: 3.91 ± 0.332
1.853AspThr: 1.853 ± 0.212
4.073AspVal: 4.073 ± 0.271
0.733AspTrp: 0.733 ± 0.147
2.708AspTyr: 2.708 ± 0.276
0.0AspXaa: 0.0 ± 0.0
Glu
2.83GluAla: 2.83 ± 0.322
1.466GluCys: 1.466 ± 0.157
5.987GluAsp: 5.987 ± 0.459
7.412GluGlu: 7.412 ± 0.678
3.543GluPhe: 3.543 ± 0.263
2.342GluGly: 2.342 ± 0.25
1.303GluHis: 1.303 ± 0.143
7.921GluIle: 7.921 ± 0.439
8.939GluLys: 8.939 ± 0.592
5.193GluLeu: 5.193 ± 0.349
3.136GluMet: 3.136 ± 0.291
7.514GluAsn: 7.514 ± 0.416
2.423GluPro: 2.423 ± 0.476
2.484GluGln: 2.484 ± 0.244
4.215GluArg: 4.215 ± 0.406
5.03GluSer: 5.03 ± 0.349
4.235GluThr: 4.235 ± 0.332
2.871GluVal: 2.871 ± 0.267
0.713GluTrp: 0.713 ± 0.102
4.582GluTyr: 4.582 ± 0.365
0.0GluXaa: 0.0 ± 0.0
Phe
1.934PheAla: 1.934 ± 0.272
0.896PheCys: 0.896 ± 0.128
3.849PheAsp: 3.849 ± 0.284
4.459PheGlu: 4.459 ± 0.315
2.484PhePhe: 2.484 ± 0.259
2.647PheGly: 2.647 ± 0.271
1.14PheHis: 1.14 ± 0.154
4.663PheIle: 4.663 ± 0.31
4.378PheLys: 4.378 ± 0.289
3.95PheLeu: 3.95 ± 0.352
1.548PheMet: 1.548 ± 0.18
3.767PheAsn: 3.767 ± 0.315
1.792PhePro: 1.792 ± 0.227
1.955PheGln: 1.955 ± 0.198
1.69PheArg: 1.69 ± 0.191
4.011PheSer: 4.011 ± 0.299
2.892PheThr: 2.892 ± 0.276
2.993PheVal: 2.993 ± 0.251
0.611PheTrp: 0.611 ± 0.101
1.772PheTyr: 1.772 ± 0.217
0.0PheXaa: 0.0 ± 0.0
Gly
1.425GlyAla: 1.425 ± 0.18
0.977GlyCys: 0.977 ± 0.149
2.627GlyAsp: 2.627 ± 0.28
2.729GlyGlu: 2.729 ± 0.259
2.22GlyPhe: 2.22 ± 0.245
2.179GlyGly: 2.179 ± 0.332
0.652GlyHis: 0.652 ± 0.12
3.36GlyIle: 3.36 ± 0.295
4.867GlyLys: 4.867 ± 0.335
3.747GlyLeu: 3.747 ± 0.331
1.14GlyMet: 1.14 ± 0.154
3.075GlyAsn: 3.075 ± 0.221
0.916GlyPro: 0.916 ± 0.138
1.385GlyGln: 1.385 ± 0.323
1.181GlyArg: 1.181 ± 0.162
2.83GlySer: 2.83 ± 0.288
2.342GlyThr: 2.342 ± 0.249
2.586GlyVal: 2.586 ± 0.245
0.428GlyTrp: 0.428 ± 0.096
2.26GlyTyr: 2.26 ± 0.26
0.0GlyXaa: 0.0 ± 0.0
His
0.692HisAla: 0.692 ± 0.134
0.163HisCys: 0.163 ± 0.052
0.977HisAsp: 0.977 ± 0.166
1.629HisGlu: 1.629 ± 0.19
0.998HisPhe: 0.998 ± 0.167
0.733HisGly: 0.733 ± 0.135
0.448HisHis: 0.448 ± 0.089
1.609HisIle: 1.609 ± 0.186
1.324HisLys: 1.324 ± 0.158
1.751HisLeu: 1.751 ± 0.211
0.509HisMet: 0.509 ± 0.105
1.14HisAsn: 1.14 ± 0.165
0.672HisPro: 0.672 ± 0.133
0.855HisGln: 0.855 ± 0.178
0.672HisArg: 0.672 ± 0.125
1.018HisSer: 1.018 ± 0.13
0.611HisThr: 0.611 ± 0.107
1.059HisVal: 1.059 ± 0.153
0.102HisTrp: 0.102 ± 0.049
0.692HisTyr: 0.692 ± 0.103
0.0HisXaa: 0.0 ± 0.0
Ile
2.668IleAla: 2.668 ± 0.246
1.262IleCys: 1.262 ± 0.186
4.785IleAsp: 4.785 ± 0.277
7.371IleGlu: 7.371 ± 0.362
4.561IlePhe: 4.561 ± 0.351
3.645IleGly: 3.645 ± 0.361
1.507IleHis: 1.507 ± 0.178
4.928IleIle: 4.928 ± 0.329
6.984IleLys: 6.984 ± 0.467
7.371IleLeu: 7.371 ± 0.415
1.568IleMet: 1.568 ± 0.207
4.643IleAsn: 4.643 ± 0.328
4.195IlePro: 4.195 ± 0.382
2.993IleGln: 2.993 ± 0.244
3.584IleArg: 3.584 ± 0.353
6.068IleSer: 6.068 ± 0.393
3.787IleThr: 3.787 ± 0.279
5.009IleVal: 5.009 ± 0.268
0.509IleTrp: 0.509 ± 0.098
3.014IleTyr: 3.014 ± 0.301
0.0IleXaa: 0.0 ± 0.0
Lys
3.238LysAla: 3.238 ± 0.333
1.792LysCys: 1.792 ± 0.274
4.969LysAsp: 4.969 ± 0.357
6.292LysGlu: 6.292 ± 0.469
5.396LysPhe: 5.396 ± 0.358
2.708LysGly: 2.708 ± 0.271
1.71LysHis: 1.71 ± 0.169
9.265LysIle: 9.265 ± 0.609
9.041LysLys: 9.041 ± 0.534
7.208LysLeu: 7.208 ± 0.467
3.706LysMet: 3.706 ± 0.268
6.944LysAsn: 6.944 ± 0.378
4.704LysPro: 4.704 ± 0.472
3.93LysGln: 3.93 ± 0.336
4.622LysArg: 4.622 ± 0.286
6.394LysSer: 6.394 ± 0.471
5.417LysThr: 5.417 ± 0.441
3.849LysVal: 3.849 ± 0.332
0.672LysTrp: 0.672 ± 0.112
4.5LysTyr: 4.5 ± 0.351
0.0LysXaa: 0.0 ± 0.0
Leu
2.729LeuAla: 2.729 ± 0.224
1.324LeuCys: 1.324 ± 0.19
4.846LeuAsp: 4.846 ± 0.259
6.374LeuGlu: 6.374 ± 0.437
4.378LeuPhe: 4.378 ± 0.311
3.889LeuGly: 3.889 ± 0.418
1.548LeuHis: 1.548 ± 0.162
5.987LeuIle: 5.987 ± 0.428
8.756LeuLys: 8.756 ± 0.517
6.455LeuLeu: 6.455 ± 0.414
1.71LeuMet: 1.71 ± 0.169
6.048LeuAsn: 6.048 ± 0.418
2.729LeuPro: 2.729 ± 0.278
2.301LeuGln: 2.301 ± 0.195
3.584LeuArg: 3.584 ± 0.35
4.745LeuSer: 4.745 ± 0.349
3.014LeuThr: 3.014 ± 0.247
4.215LeuVal: 4.215 ± 0.337
0.529LeuTrp: 0.529 ± 0.118
2.892LeuTyr: 2.892 ± 0.299
0.0LeuXaa: 0.0 ± 0.0
Met
1.344MetAla: 1.344 ± 0.159
0.794MetCys: 0.794 ± 0.124
1.548MetAsp: 1.548 ± 0.162
2.382MetGlu: 2.382 ± 0.336
1.405MetPhe: 1.405 ± 0.166
1.568MetGly: 1.568 ± 0.173
0.367MetHis: 0.367 ± 0.075
1.894MetIle: 1.894 ± 0.177
2.871MetLys: 2.871 ± 0.257
1.996MetLeu: 1.996 ± 0.204
1.079MetMet: 1.079 ± 0.145
2.423MetAsn: 2.423 ± 0.243
0.591MetPro: 0.591 ± 0.099
0.509MetGln: 0.509 ± 0.097
0.794MetArg: 0.794 ± 0.124
2.138MetSer: 2.138 ± 0.199
1.548MetThr: 1.548 ± 0.198
1.344MetVal: 1.344 ± 0.155
0.244MetTrp: 0.244 ± 0.066
1.079MetTyr: 1.079 ± 0.145
0.0MetXaa: 0.0 ± 0.0
Asn
1.934AsnAla: 1.934 ± 0.194
0.998AsnCys: 0.998 ± 0.166
3.319AsnAsp: 3.319 ± 0.258
5.539AsnGlu: 5.539 ± 0.364
4.439AsnPhe: 4.439 ± 0.346
3.564AsnGly: 3.564 ± 0.324
1.181AsnHis: 1.181 ± 0.162
6.272AsnIle: 6.272 ± 0.41
5.905AsnLys: 5.905 ± 0.327
6.74AsnLeu: 6.74 ± 0.405
1.772AsnMet: 1.772 ± 0.19
3.136AsnAsn: 3.136 ± 0.323
3.217AsnPro: 3.217 ± 0.3
2.097AsnGln: 2.097 ± 0.226
2.484AsnArg: 2.484 ± 0.21
3.564AsnSer: 3.564 ± 0.282
3.034AsnThr: 3.034 ± 0.272
5.193AsnVal: 5.193 ± 0.38
0.631AsnTrp: 0.631 ± 0.116
2.26AsnTyr: 2.26 ± 0.173
0.0AsnXaa: 0.0 ± 0.0
Pro
1.059ProAla: 1.059 ± 0.165
0.57ProCys: 0.57 ± 0.11
2.382ProAsp: 2.382 ± 0.224
4.439ProGlu: 4.439 ± 0.315
1.873ProPhe: 1.873 ± 0.213
1.486ProGly: 1.486 ± 0.182
0.529ProHis: 0.529 ± 0.102
2.892ProIle: 2.892 ± 0.279
4.582ProLys: 4.582 ± 0.479
2.016ProLeu: 2.016 ± 0.197
0.835ProMet: 0.835 ± 0.177
2.525ProAsn: 2.525 ± 0.25
1.588ProPro: 1.588 ± 0.312
1.751ProGln: 1.751 ± 0.379
1.71ProArg: 1.71 ± 0.241
2.749ProSer: 2.749 ± 0.291
2.403ProThr: 2.403 ± 0.585
2.668ProVal: 2.668 ± 0.217
0.163ProTrp: 0.163 ± 0.056
1.222ProTyr: 1.222 ± 0.166
0.0ProXaa: 0.0 ± 0.0
Gln
1.425GlnAla: 1.425 ± 0.207
0.591GlnCys: 0.591 ± 0.122
2.321GlnAsp: 2.321 ± 0.205
2.138GlnGlu: 2.138 ± 0.256
1.486GlnPhe: 1.486 ± 0.193
0.977GlnGly: 0.977 ± 0.154
0.652GlnHis: 0.652 ± 0.093
3.258GlnIle: 3.258 ± 0.262
3.421GlnLys: 3.421 ± 0.256
2.525GlnLeu: 2.525 ± 0.28
0.916GlnMet: 0.916 ± 0.145
2.708GlnAsn: 2.708 ± 0.251
1.772GlnPro: 1.772 ± 0.42
1.446GlnGln: 1.446 ± 0.203
1.527GlnArg: 1.527 ± 0.22
1.833GlnSer: 1.833 ± 0.212
2.179GlnThr: 2.179 ± 0.236
1.568GlnVal: 1.568 ± 0.182
0.305GlnTrp: 0.305 ± 0.087
1.324GlnTyr: 1.324 ± 0.176
0.0GlnXaa: 0.0 ± 0.0
Arg
2.158ArgAla: 2.158 ± 0.245
0.489ArgCys: 0.489 ± 0.118
2.973ArgAsp: 2.973 ± 0.3
4.5ArgGlu: 4.5 ± 0.54
1.527ArgPhe: 1.527 ± 0.194
1.812ArgGly: 1.812 ± 0.168
0.509ArgHis: 0.509 ± 0.107
2.749ArgIle: 2.749 ± 0.235
4.134ArgLys: 4.134 ± 0.29
3.38ArgLeu: 3.38 ± 0.3
1.324ArgMet: 1.324 ± 0.19
3.217ArgAsn: 3.217 ± 0.22
1.14ArgPro: 1.14 ± 0.174
1.649ArgGln: 1.649 ± 0.226
2.627ArgArg: 2.627 ± 0.324
2.321ArgSer: 2.321 ± 0.333
1.792ArgThr: 1.792 ± 0.195
2.444ArgVal: 2.444 ± 0.218
0.407ArgTrp: 0.407 ± 0.103
1.955ArgTyr: 1.955 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
2.281SerAla: 2.281 ± 0.253
1.018SerCys: 1.018 ± 0.156
4.724SerAsp: 4.724 ± 0.339
5.579SerGlu: 5.579 ± 0.532
3.543SerPhe: 3.543 ± 0.28
2.708SerGly: 2.708 ± 0.263
1.344SerHis: 1.344 ± 0.18
4.174SerIle: 4.174 ± 0.283
5.905SerLys: 5.905 ± 0.416
4.622SerLeu: 4.622 ± 0.329
1.303SerMet: 1.303 ± 0.145
3.36SerAsn: 3.36 ± 0.274
2.464SerPro: 2.464 ± 0.264
2.606SerGln: 2.606 ± 0.219
3.258SerArg: 3.258 ± 0.392
4.745SerSer: 4.745 ± 0.463
3.564SerThr: 3.564 ± 0.301
3.686SerVal: 3.686 ± 0.276
0.509SerTrp: 0.509 ± 0.09
2.342SerTyr: 2.342 ± 0.218
0.0SerXaa: 0.0 ± 0.0
Thr
1.731ThrAla: 1.731 ± 0.187
0.753ThrCys: 0.753 ± 0.129
3.034ThrAsp: 3.034 ± 0.269
4.358ThrGlu: 4.358 ± 0.658
2.851ThrPhe: 2.851 ± 0.266
2.179ThrGly: 2.179 ± 0.175
0.998ThrHis: 0.998 ± 0.18
3.706ThrIle: 3.706 ± 0.343
4.48ThrLys: 4.48 ± 0.296
3.401ThrLeu: 3.401 ± 0.27
1.039ThrMet: 1.039 ± 0.151
3.421ThrAsn: 3.421 ± 0.223
2.729ThrPro: 2.729 ± 0.311
1.772ThrGln: 1.772 ± 0.181
2.016ThrArg: 2.016 ± 0.246
3.441ThrSer: 3.441 ± 0.293
3.584ThrThr: 3.584 ± 0.85
3.197ThrVal: 3.197 ± 0.244
0.448ThrTrp: 0.448 ± 0.087
1.405ThrTyr: 1.405 ± 0.193
0.0ThrXaa: 0.0 ± 0.0
Val
2.423ValAla: 2.423 ± 0.207
0.977ValCys: 0.977 ± 0.167
3.36ValAsp: 3.36 ± 0.274
4.032ValGlu: 4.032 ± 0.315
2.993ValPhe: 2.993 ± 0.267
2.688ValGly: 2.688 ± 0.265
1.12ValHis: 1.12 ± 0.169
4.276ValIle: 4.276 ± 0.36
6.048ValLys: 6.048 ± 0.356
4.582ValLeu: 4.582 ± 0.278
1.71ValMet: 1.71 ± 0.183
3.706ValAsn: 3.706 ± 0.26
2.281ValPro: 2.281 ± 0.216
1.507ValGln: 1.507 ± 0.183
2.362ValArg: 2.362 ± 0.231
3.584ValSer: 3.584 ± 0.292
2.973ValThr: 2.973 ± 0.339
3.36ValVal: 3.36 ± 0.267
0.428ValTrp: 0.428 ± 0.1
2.199ValTyr: 2.199 ± 0.216
0.0ValXaa: 0.0 ± 0.0
Trp
0.346TrpAla: 0.346 ± 0.093
0.163TrpCys: 0.163 ± 0.059
0.285TrpAsp: 0.285 ± 0.076
0.591TrpGlu: 0.591 ± 0.124
0.407TrpPhe: 0.407 ± 0.111
0.326TrpGly: 0.326 ± 0.108
0.102TrpHis: 0.102 ± 0.043
0.672TrpIle: 0.672 ± 0.143
0.733TrpLys: 0.733 ± 0.123
0.794TrpLeu: 0.794 ± 0.117
0.305TrpMet: 0.305 ± 0.086
0.896TrpAsn: 0.896 ± 0.134
0.102TrpPro: 0.102 ± 0.042
0.448TrpGln: 0.448 ± 0.084
0.163TrpArg: 0.163 ± 0.056
0.387TrpSer: 0.387 ± 0.097
0.489TrpThr: 0.489 ± 0.102
0.611TrpVal: 0.611 ± 0.103
0.081TrpTrp: 0.081 ± 0.037
0.387TrpTyr: 0.387 ± 0.105
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.222TyrAla: 1.222 ± 0.141
0.815TyrCys: 0.815 ± 0.12
2.444TyrAsp: 2.444 ± 0.224
3.014TyrGlu: 3.014 ± 0.249
2.606TyrPhe: 2.606 ± 0.197
1.955TyrGly: 1.955 ± 0.21
0.855TyrHis: 0.855 ± 0.13
3.523TyrIle: 3.523 ± 0.269
3.523TyrLys: 3.523 ± 0.278
2.973TyrLeu: 2.973 ± 0.221
0.896TyrMet: 0.896 ± 0.136
2.566TyrAsn: 2.566 ± 0.205
1.955TyrPro: 1.955 ± 0.19
1.303TyrGln: 1.303 ± 0.209
1.772TyrArg: 1.772 ± 0.245
2.26TyrSer: 2.26 ± 0.196
2.097TyrThr: 2.097 ± 0.225
2.179TyrVal: 2.179 ± 0.22
0.346TyrTrp: 0.346 ± 0.086
1.955TyrTyr: 1.955 ± 0.22
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 170 proteins (49110 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski