Amino acid dipepetide frequency for Bohle iridovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.518AlaAla: 10.518 ± 0.934
1.816AlaCys: 1.816 ± 0.253
4.283AlaAsp: 4.283 ± 0.396
5.756AlaGlu: 5.756 ± 0.51
2.741AlaPhe: 2.741 ± 0.308
7.537AlaGly: 7.537 ± 0.632
1.919AlaHis: 1.919 ± 0.267
2.433AlaIle: 2.433 ± 0.281
4.454AlaLys: 4.454 ± 0.454
7.674AlaLeu: 7.674 ± 0.732
3.083AlaMet: 3.083 ± 0.307
1.816AlaAsn: 1.816 ± 0.316
4.488AlaPro: 4.488 ± 0.449
2.638AlaGln: 2.638 ± 0.333
5.893AlaArg: 5.893 ± 0.787
6.304AlaSer: 6.304 ± 0.596
4.351AlaThr: 4.351 ± 0.364
8.497AlaVal: 8.497 ± 0.578
1.405AlaTrp: 1.405 ± 0.222
2.672AlaTyr: 2.672 ± 0.263
0.0AlaXaa: 0.0 ± 0.0
Cys
1.884CysAla: 1.884 ± 0.269
0.754CysCys: 0.754 ± 0.163
1.268CysAsp: 1.268 ± 0.197
1.165CysGlu: 1.165 ± 0.217
0.582CysPhe: 0.582 ± 0.137
1.713CysGly: 1.713 ± 0.262
0.445CysHis: 0.445 ± 0.166
0.582CysIle: 0.582 ± 0.17
1.405CysLys: 1.405 ± 0.239
1.61CysLeu: 1.61 ± 0.327
0.685CysMet: 0.685 ± 0.138
0.514CysAsn: 0.514 ± 0.163
1.542CysPro: 1.542 ± 0.294
0.548CysGln: 0.548 ± 0.119
1.61CysArg: 1.61 ± 0.254
1.473CysSer: 1.473 ± 0.24
0.754CysThr: 0.754 ± 0.129
1.576CysVal: 1.576 ± 0.224
0.514CysTrp: 0.514 ± 0.122
0.651CysTyr: 0.651 ± 0.162
0.0CysXaa: 0.0 ± 0.0
Asp
5.139AspAla: 5.139 ± 0.42
1.405AspCys: 1.405 ± 0.199
3.426AspAsp: 3.426 ± 0.361
3.015AspGlu: 3.015 ± 0.368
1.953AspPhe: 1.953 ± 0.382
4.659AspGly: 4.659 ± 0.552
0.994AspHis: 0.994 ± 0.155
2.433AspIle: 2.433 ± 0.346
2.741AspLys: 2.741 ± 0.305
5.345AspLeu: 5.345 ± 0.471
2.09AspMet: 2.09 ± 0.274
1.713AspAsn: 1.713 ± 0.331
4.214AspPro: 4.214 ± 0.42
1.439AspGln: 1.439 ± 0.243
4.077AspArg: 4.077 ± 0.445
4.557AspSer: 4.557 ± 0.428
2.604AspThr: 2.604 ± 0.272
4.899AspVal: 4.899 ± 0.366
0.925AspTrp: 0.925 ± 0.203
2.398AspTyr: 2.398 ± 0.319
0.0AspXaa: 0.0 ± 0.0
Glu
5.961GluAla: 5.961 ± 0.436
1.507GluCys: 1.507 ± 0.252
3.7GluAsp: 3.7 ± 0.481
3.495GluGlu: 3.495 ± 0.423
1.747GluPhe: 1.747 ± 0.232
3.666GluGly: 3.666 ± 0.405
0.857GluHis: 0.857 ± 0.182
1.816GluIle: 1.816 ± 0.222
3.186GluLys: 3.186 ± 0.339
3.46GluLeu: 3.46 ± 0.365
2.158GluMet: 2.158 ± 0.323
1.199GluAsn: 1.199 ± 0.234
2.638GluPro: 2.638 ± 0.37
1.919GluGln: 1.919 ± 0.498
4.043GluArg: 4.043 ± 0.355
3.7GluSer: 3.7 ± 0.422
3.563GluThr: 3.563 ± 0.342
3.529GluVal: 3.529 ± 0.398
1.233GluTrp: 1.233 ± 0.208
2.056GluTyr: 2.056 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
2.912PheAla: 2.912 ± 0.396
0.617PheCys: 0.617 ± 0.144
1.507PheAsp: 1.507 ± 0.24
1.884PheGlu: 1.884 ± 0.262
1.199PhePhe: 1.199 ± 0.23
2.467PheGly: 2.467 ± 0.279
0.685PheHis: 0.685 ± 0.155
0.994PheIle: 0.994 ± 0.206
1.473PheLys: 1.473 ± 0.202
3.221PheLeu: 3.221 ± 0.391
0.891PheMet: 0.891 ± 0.15
1.268PheAsn: 1.268 ± 0.169
1.919PhePro: 1.919 ± 0.257
0.685PheGln: 0.685 ± 0.151
2.295PheArg: 2.295 ± 0.265
2.672PheSer: 2.672 ± 0.3
1.884PheThr: 1.884 ± 0.305
2.912PheVal: 2.912 ± 0.321
0.308PheTrp: 0.308 ± 0.117
0.994PheTyr: 0.994 ± 0.158
0.0PheXaa: 0.0 ± 0.0
Gly
6.612GlyAla: 6.612 ± 0.598
1.679GlyCys: 1.679 ± 0.27
4.317GlyAsp: 4.317 ± 0.45
3.46GlyGlu: 3.46 ± 0.324
2.741GlyPhe: 2.741 ± 0.316
5.653GlyGly: 5.653 ± 0.541
1.919GlyHis: 1.919 ± 0.302
2.158GlyIle: 2.158 ± 0.269
4.317GlyLys: 4.317 ± 0.498
5.584GlyLeu: 5.584 ± 0.472
1.953GlyMet: 1.953 ± 0.254
1.37GlyAsn: 1.37 ± 0.227
4.214GlyPro: 4.214 ± 0.414
1.85GlyGln: 1.85 ± 0.245
5.516GlyArg: 5.516 ± 0.516
5.927GlySer: 5.927 ± 0.438
5.002GlyThr: 5.002 ± 0.659
5.584GlyVal: 5.584 ± 0.532
1.439GlyTrp: 1.439 ± 0.244
2.467GlyTyr: 2.467 ± 0.291
0.0GlyXaa: 0.0 ± 0.0
His
1.61HisAla: 1.61 ± 0.201
0.411HisCys: 0.411 ± 0.106
1.165HisAsp: 1.165 ± 0.186
0.754HisGlu: 0.754 ± 0.167
0.548HisPhe: 0.548 ± 0.124
1.884HisGly: 1.884 ± 0.287
0.651HisHis: 0.651 ± 0.195
1.028HisIle: 1.028 ± 0.22
0.891HisLys: 0.891 ± 0.165
2.158HisLeu: 2.158 ± 0.277
0.617HisMet: 0.617 ± 0.137
0.548HisAsn: 0.548 ± 0.16
1.473HisPro: 1.473 ± 0.252
0.719HisGln: 0.719 ± 0.16
1.37HisArg: 1.37 ± 0.235
1.199HisSer: 1.199 ± 0.207
1.233HisThr: 1.233 ± 0.208
2.09HisVal: 2.09 ± 0.297
0.24HisTrp: 0.24 ± 0.071
0.857HisTyr: 0.857 ± 0.163
0.0HisXaa: 0.0 ± 0.0
Ile
2.398IleAla: 2.398 ± 0.35
0.582IleCys: 0.582 ± 0.133
1.953IleAsp: 1.953 ± 0.265
1.679IleGlu: 1.679 ± 0.24
1.096IlePhe: 1.096 ± 0.185
1.782IleGly: 1.782 ± 0.273
0.891IleHis: 0.891 ± 0.184
1.028IleIle: 1.028 ± 0.173
2.364IleLys: 2.364 ± 0.234
3.323IleLeu: 3.323 ± 0.333
1.233IleMet: 1.233 ± 0.147
0.925IleAsn: 0.925 ± 0.18
2.158IlePro: 2.158 ± 0.254
0.788IleGln: 0.788 ± 0.188
2.809IleArg: 2.809 ± 0.292
2.398IleSer: 2.398 ± 0.254
1.507IleThr: 1.507 ± 0.285
2.912IleVal: 2.912 ± 0.335
0.137IleTrp: 0.137 ± 0.084
1.028IleTyr: 1.028 ± 0.234
0.0IleXaa: 0.0 ± 0.0
Lys
4.728LysAla: 4.728 ± 0.518
0.857LysCys: 0.857 ± 0.185
3.015LysAsp: 3.015 ± 0.393
3.152LysGlu: 3.152 ± 0.336
1.405LysPhe: 1.405 ± 0.244
4.214LysGly: 4.214 ± 0.365
0.719LysHis: 0.719 ± 0.129
2.398LysIle: 2.398 ± 0.274
3.837LysLys: 3.837 ± 0.442
4.077LysLeu: 4.077 ± 0.487
1.919LysMet: 1.919 ± 0.246
1.713LysAsn: 1.713 ± 0.225
3.392LysPro: 3.392 ± 0.53
1.542LysGln: 1.542 ± 0.257
5.345LysArg: 5.345 ± 0.575
4.42LysSer: 4.42 ± 0.961
3.7LysThr: 3.7 ± 0.361
3.426LysVal: 3.426 ± 0.307
0.685LysTrp: 0.685 ± 0.139
1.919LysTyr: 1.919 ± 0.246
0.0LysXaa: 0.0 ± 0.0
Leu
6.955LeuAla: 6.955 ± 0.51
2.021LeuCys: 2.021 ± 0.347
5.105LeuAsp: 5.105 ± 0.389
5.105LeuGlu: 5.105 ± 0.43
2.844LeuPhe: 2.844 ± 0.296
5.447LeuGly: 5.447 ± 0.496
1.713LeuHis: 1.713 ± 0.27
2.638LeuIle: 2.638 ± 0.339
4.968LeuLys: 4.968 ± 0.369
6.681LeuLeu: 6.681 ± 0.522
2.193LeuMet: 2.193 ± 0.268
2.707LeuAsn: 2.707 ± 0.266
4.18LeuPro: 4.18 ± 0.52
1.645LeuGln: 1.645 ± 0.259
6.372LeuArg: 6.372 ± 0.507
6.372LeuSer: 6.372 ± 0.486
5.139LeuThr: 5.139 ± 0.462
5.893LeuVal: 5.893 ± 0.451
0.994LeuTrp: 0.994 ± 0.179
2.227LeuTyr: 2.227 ± 0.204
0.0LeuXaa: 0.0 ± 0.0
Met
3.152MetAla: 3.152 ± 0.402
0.925MetCys: 0.925 ± 0.216
1.953MetAsp: 1.953 ± 0.213
1.987MetGlu: 1.987 ± 0.227
1.165MetPhe: 1.165 ± 0.195
2.501MetGly: 2.501 ± 0.27
0.788MetHis: 0.788 ± 0.162
0.582MetIle: 0.582 ± 0.127
0.822MetLys: 0.822 ± 0.157
2.227MetLeu: 2.227 ± 0.268
0.754MetMet: 0.754 ± 0.171
0.445MetAsn: 0.445 ± 0.134
1.507MetPro: 1.507 ± 0.221
0.617MetGln: 0.617 ± 0.159
2.124MetArg: 2.124 ± 0.319
2.946MetSer: 2.946 ± 0.39
1.953MetThr: 1.953 ± 0.24
2.193MetVal: 2.193 ± 0.323
0.377MetTrp: 0.377 ± 0.117
0.651MetTyr: 0.651 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
2.295AsnAla: 2.295 ± 0.278
0.48AsnCys: 0.48 ± 0.122
1.028AsnAsp: 1.028 ± 0.17
0.959AsnGlu: 0.959 ± 0.181
0.754AsnPhe: 0.754 ± 0.139
1.576AsnGly: 1.576 ± 0.18
0.48AsnHis: 0.48 ± 0.127
1.336AsnIle: 1.336 ± 0.255
1.096AsnLys: 1.096 ± 0.216
2.672AsnLeu: 2.672 ± 0.353
0.959AsnMet: 0.959 ± 0.212
0.891AsnAsn: 0.891 ± 0.22
2.398AsnPro: 2.398 ± 0.365
0.719AsnGln: 0.719 ± 0.116
1.542AsnArg: 1.542 ± 0.224
1.747AsnSer: 1.747 ± 0.229
1.302AsnThr: 1.302 ± 0.264
2.741AsnVal: 2.741 ± 0.279
0.445AsnTrp: 0.445 ± 0.13
0.857AsnTyr: 0.857 ± 0.157
0.0AsnXaa: 0.0 ± 0.0
Pro
6.235ProAla: 6.235 ± 0.857
1.131ProCys: 1.131 ± 0.222
3.7ProAsp: 3.7 ± 0.325
4.488ProGlu: 4.488 ± 0.589
1.953ProPhe: 1.953 ± 0.243
4.043ProGly: 4.043 ± 0.457
1.884ProHis: 1.884 ± 0.27
1.747ProIle: 1.747 ± 0.244
3.529ProLys: 3.529 ± 0.597
4.317ProLeu: 4.317 ± 0.567
1.199ProMet: 1.199 ± 0.175
1.439ProAsn: 1.439 ± 0.227
4.111ProPro: 4.111 ± 0.636
1.919ProGln: 1.919 ± 0.278
3.906ProArg: 3.906 ± 0.871
4.728ProSer: 4.728 ± 0.4
3.083ProThr: 3.083 ± 0.463
6.784ProVal: 6.784 ± 0.746
0.822ProTrp: 0.822 ± 0.171
1.542ProTyr: 1.542 ± 0.273
0.0ProXaa: 0.0 ± 0.0
Gln
2.398GlnAla: 2.398 ± 0.245
0.685GlnCys: 0.685 ± 0.143
1.987GlnAsp: 1.987 ± 0.279
1.782GlnGlu: 1.782 ± 0.293
0.651GlnPhe: 0.651 ± 0.125
1.953GlnGly: 1.953 ± 0.259
0.651GlnHis: 0.651 ± 0.182
1.028GlnIle: 1.028 ± 0.184
1.268GlnLys: 1.268 ± 0.199
1.85GlnLeu: 1.85 ± 0.304
0.685GlnMet: 0.685 ± 0.17
0.617GlnAsn: 0.617 ± 0.097
1.679GlnPro: 1.679 ± 0.453
1.542GlnGln: 1.542 ± 0.498
2.227GlnArg: 2.227 ± 0.375
2.056GlnSer: 2.056 ± 0.252
2.056GlnThr: 2.056 ± 0.369
2.193GlnVal: 2.193 ± 0.283
0.343GlnTrp: 0.343 ± 0.125
0.651GlnTyr: 0.651 ± 0.127
0.0GlnXaa: 0.0 ± 0.0
Arg
5.345ArgAla: 5.345 ± 0.514
1.062ArgCys: 1.062 ± 0.2
4.728ArgAsp: 4.728 ± 0.543
4.248ArgGlu: 4.248 ± 0.386
2.193ArgPhe: 2.193 ± 0.227
5.447ArgGly: 5.447 ± 0.536
1.85ArgHis: 1.85 ± 0.267
2.193ArgIle: 2.193 ± 0.299
5.276ArgLys: 5.276 ± 1.042
5.893ArgLeu: 5.893 ± 0.452
2.056ArgMet: 2.056 ± 0.321
2.227ArgAsn: 2.227 ± 0.329
4.454ArgPro: 4.454 ± 0.479
2.295ArgGln: 2.295 ± 0.288
6.098ArgArg: 6.098 ± 0.618
4.283ArgSer: 4.283 ± 0.872
3.871ArgThr: 3.871 ± 0.45
5.722ArgVal: 5.722 ± 0.498
0.891ArgTrp: 0.891 ± 0.157
2.193ArgTyr: 2.193 ± 0.24
0.0ArgXaa: 0.0 ± 0.0
Ser
6.475SerAla: 6.475 ± 0.592
1.439SerCys: 1.439 ± 0.209
4.968SerAsp: 4.968 ± 0.397
3.837SerGlu: 3.837 ± 0.438
2.741SerPhe: 2.741 ± 0.31
5.893SerGly: 5.893 ± 0.78
1.507SerHis: 1.507 ± 0.258
2.261SerIle: 2.261 ± 0.287
3.392SerLys: 3.392 ± 0.399
6.27SerLeu: 6.27 ± 0.6
1.713SerMet: 1.713 ± 0.264
1.782SerAsn: 1.782 ± 0.247
6.784SerPro: 6.784 ± 1.795
2.295SerGln: 2.295 ± 0.326
4.454SerArg: 4.454 ± 0.443
5.55SerSer: 5.55 ± 0.48
2.981SerThr: 2.981 ± 0.332
6.201SerVal: 6.201 ± 0.53
1.199SerTrp: 1.199 ± 0.189
1.782SerTyr: 1.782 ± 0.218
0.0SerXaa: 0.0 ± 0.0
Thr
5.79ThrAla: 5.79 ± 0.518
1.062ThrCys: 1.062 ± 0.199
3.495ThrAsp: 3.495 ± 0.282
2.364ThrGlu: 2.364 ± 0.267
2.33ThrPhe: 2.33 ± 0.257
4.968ThrGly: 4.968 ± 0.466
0.719ThrHis: 0.719 ± 0.133
2.227ThrIle: 2.227 ± 0.239
2.638ThrLys: 2.638 ± 0.342
4.557ThrLeu: 4.557 ± 0.455
1.679ThrMet: 1.679 ± 0.241
1.062ThrAsn: 1.062 ± 0.154
3.734ThrPro: 3.734 ± 0.348
1.645ThrGln: 1.645 ± 0.289
3.289ThrArg: 3.289 ± 0.315
3.323ThrSer: 3.323 ± 0.669
1.919ThrThr: 1.919 ± 0.332
5.927ThrVal: 5.927 ± 0.446
0.411ThrTrp: 0.411 ± 0.174
1.336ThrTyr: 1.336 ± 0.196
0.0ThrXaa: 0.0 ± 0.0
Val
6.27ValAla: 6.27 ± 0.508
1.816ValCys: 1.816 ± 0.28
5.071ValAsp: 5.071 ± 0.381
4.146ValGlu: 4.146 ± 0.425
2.775ValPhe: 2.775 ± 0.308
5.002ValGly: 5.002 ± 0.577
2.09ValHis: 2.09 ± 0.275
2.227ValIle: 2.227 ± 0.23
6.235ValLys: 6.235 ± 0.69
6.921ValLeu: 6.921 ± 0.598
2.57ValMet: 2.57 ± 0.33
2.467ValAsn: 2.467 ± 0.282
5.105ValPro: 5.105 ± 0.603
2.364ValGln: 2.364 ± 0.266
6.818ValArg: 6.818 ± 0.684
6.612ValSer: 6.612 ± 0.565
4.625ValThr: 4.625 ± 0.501
6.475ValVal: 6.475 ± 0.552
1.233ValTrp: 1.233 ± 0.193
2.433ValTyr: 2.433 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
0.994TrpAla: 0.994 ± 0.213
0.308TrpCys: 0.308 ± 0.095
1.131TrpAsp: 1.131 ± 0.177
0.788TrpGlu: 0.788 ± 0.161
0.548TrpPhe: 0.548 ± 0.117
0.891TrpGly: 0.891 ± 0.165
0.274TrpHis: 0.274 ± 0.1
0.548TrpIle: 0.548 ± 0.154
0.925TrpLys: 0.925 ± 0.154
1.37TrpLeu: 1.37 ± 0.182
0.445TrpMet: 0.445 ± 0.111
0.548TrpAsn: 0.548 ± 0.156
0.617TrpPro: 0.617 ± 0.156
0.206TrpGln: 0.206 ± 0.082
0.857TrpArg: 0.857 ± 0.159
0.788TrpSer: 0.788 ± 0.141
1.37TrpThr: 1.37 ± 0.217
0.822TrpVal: 0.822 ± 0.203
0.171TrpTrp: 0.171 ± 0.07
0.411TrpTyr: 0.411 ± 0.135
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.604TyrAla: 2.604 ± 0.305
0.719TyrCys: 0.719 ± 0.163
2.227TyrAsp: 2.227 ± 0.289
1.37TyrGlu: 1.37 ± 0.206
0.822TyrPhe: 0.822 ± 0.161
2.467TyrGly: 2.467 ± 0.311
0.411TyrHis: 0.411 ± 0.099
1.336TyrIle: 1.336 ± 0.217
1.645TyrLys: 1.645 ± 0.193
1.987TyrLeu: 1.987 ± 0.274
0.754TyrMet: 0.754 ± 0.153
1.028TyrAsn: 1.028 ± 0.212
1.953TyrPro: 1.953 ± 0.199
0.925TyrGln: 0.925 ± 0.172
1.679TyrArg: 1.679 ± 0.229
2.467TyrSer: 2.467 ± 0.26
1.576TyrThr: 1.576 ± 0.204
2.912TyrVal: 2.912 ± 0.359
0.24TyrTrp: 0.24 ± 0.104
0.822TyrTyr: 0.822 ± 0.156
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 100 proteins (29189 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski