Amino acid dipepetide frequency for Halorubrum virus Hardycor2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.347AlaAla: 4.347 ± 0.498
0.682AlaCys: 0.682 ± 0.174
5.029AlaAsp: 5.029 ± 0.476
6.564AlaGlu: 6.564 ± 0.589
2.728AlaPhe: 2.728 ± 0.344
5.285AlaGly: 5.285 ± 0.562
1.449AlaHis: 1.449 ± 0.214
2.728AlaIle: 2.728 ± 0.376
3.58AlaLys: 3.58 ± 0.399
5.328AlaLeu: 5.328 ± 0.527
1.321AlaMet: 1.321 ± 0.275
2.983AlaAsn: 2.983 ± 0.366
3.197AlaPro: 3.197 ± 0.37
2.174AlaGln: 2.174 ± 0.357
3.58AlaArg: 3.58 ± 0.387
4.39AlaSer: 4.39 ± 0.554
4.049AlaThr: 4.049 ± 0.424
4.433AlaVal: 4.433 ± 0.527
0.639AlaTrp: 0.639 ± 0.153
2.557AlaTyr: 2.557 ± 0.285
0.0AlaXaa: 0.0 ± 0.0
Cys
0.384CysAla: 0.384 ± 0.129
0.128CysCys: 0.128 ± 0.103
0.767CysAsp: 0.767 ± 0.198
0.98CysGlu: 0.98 ± 0.252
0.256CysPhe: 0.256 ± 0.101
1.236CysGly: 1.236 ± 0.253
0.298CysHis: 0.298 ± 0.128
0.128CysIle: 0.128 ± 0.07
0.426CysLys: 0.426 ± 0.121
0.597CysLeu: 0.597 ± 0.158
0.043CysMet: 0.043 ± 0.037
0.426CysAsn: 0.426 ± 0.141
0.852CysPro: 0.852 ± 0.226
0.298CysGln: 0.298 ± 0.111
0.682CysArg: 0.682 ± 0.169
0.554CysSer: 0.554 ± 0.143
0.256CysThr: 0.256 ± 0.084
0.341CysVal: 0.341 ± 0.115
0.17CysTrp: 0.17 ± 0.076
0.17CysTyr: 0.17 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
6.137AspAla: 6.137 ± 0.54
0.98AspCys: 0.98 ± 0.187
7.714AspAsp: 7.714 ± 0.708
9.376AspGlu: 9.376 ± 0.741
4.347AspPhe: 4.347 ± 0.432
8.013AspGly: 8.013 ± 0.663
1.534AspHis: 1.534 ± 0.275
3.623AspIle: 3.623 ± 0.343
2.515AspLys: 2.515 ± 0.34
6.819AspLeu: 6.819 ± 0.563
1.108AspMet: 1.108 ± 0.193
2.6AspAsn: 2.6 ± 0.426
4.177AspPro: 4.177 ± 0.494
1.406AspGln: 1.406 ± 0.222
3.878AspArg: 3.878 ± 0.421
6.223AspSer: 6.223 ± 0.476
4.944AspThr: 4.944 ± 0.572
6.99AspVal: 6.99 ± 0.454
1.236AspTrp: 1.236 ± 0.214
4.177AspTyr: 4.177 ± 0.419
0.0AspXaa: 0.0 ± 0.0
Glu
7.075GluAla: 7.075 ± 0.598
0.639GluCys: 0.639 ± 0.206
9.547GluAsp: 9.547 ± 0.673
8.268GluGlu: 8.268 ± 0.702
3.665GluPhe: 3.665 ± 0.427
6.904GluGly: 6.904 ± 0.511
1.833GluHis: 1.833 ± 0.244
5.796GluIle: 5.796 ± 0.473
4.347GluLys: 4.347 ± 0.461
7.544GluLeu: 7.544 ± 0.627
3.111GluMet: 3.111 ± 0.423
4.56GluAsn: 4.56 ± 0.401
4.092GluPro: 4.092 ± 0.429
3.367GluGln: 3.367 ± 0.373
5.583GluArg: 5.583 ± 0.486
5.285GluSer: 5.285 ± 0.524
4.944GluThr: 4.944 ± 0.449
6.095GluVal: 6.095 ± 0.531
1.534GluTrp: 1.534 ± 0.251
3.665GluTyr: 3.665 ± 0.391
0.0GluXaa: 0.0 ± 0.0
Phe
1.918PheAla: 1.918 ± 0.256
0.341PheCys: 0.341 ± 0.123
4.006PheAsp: 4.006 ± 0.461
3.878PheGlu: 3.878 ± 0.295
1.066PhePhe: 1.066 ± 0.205
3.324PheGly: 3.324 ± 0.394
0.852PheHis: 0.852 ± 0.2
1.961PheIle: 1.961 ± 0.23
1.279PheLys: 1.279 ± 0.202
2.259PheLeu: 2.259 ± 0.345
0.938PheMet: 0.938 ± 0.18
2.174PheAsn: 2.174 ± 0.329
1.705PhePro: 1.705 ± 0.245
0.98PheGln: 0.98 ± 0.186
1.918PheArg: 1.918 ± 0.285
2.856PheSer: 2.856 ± 0.351
1.833PheThr: 1.833 ± 0.264
1.918PheVal: 1.918 ± 0.34
0.298PheTrp: 0.298 ± 0.105
1.534PheTyr: 1.534 ± 0.273
0.0PheXaa: 0.0 ± 0.0
Gly
4.944GlyAla: 4.944 ± 0.64
0.554GlyCys: 0.554 ± 0.155
7.544GlyAsp: 7.544 ± 0.578
7.501GlyGlu: 7.501 ± 0.515
3.111GlyPhe: 3.111 ± 0.295
8.481GlyGly: 8.481 ± 1.614
1.833GlyHis: 1.833 ± 0.27
3.069GlyIle: 3.069 ± 0.359
3.239GlyLys: 3.239 ± 0.296
4.347GlyLeu: 4.347 ± 0.466
1.577GlyMet: 1.577 ± 0.219
3.751GlyAsn: 3.751 ± 0.517
2.088GlyPro: 2.088 ± 0.27
2.429GlyGln: 2.429 ± 0.274
4.646GlyArg: 4.646 ± 0.559
5.626GlySer: 5.626 ± 0.783
3.964GlyThr: 3.964 ± 0.441
4.987GlyVal: 4.987 ± 0.418
1.236GlyTrp: 1.236 ± 0.178
3.111GlyTyr: 3.111 ± 0.4
0.0GlyXaa: 0.0 ± 0.0
His
1.236HisAla: 1.236 ± 0.33
0.085HisCys: 0.085 ± 0.063
2.131HisAsp: 2.131 ± 0.261
1.747HisGlu: 1.747 ± 0.286
0.81HisPhe: 0.81 ± 0.209
2.088HisGly: 2.088 ± 0.308
0.767HisHis: 0.767 ± 0.181
1.577HisIle: 1.577 ± 0.294
0.725HisLys: 0.725 ± 0.193
1.62HisLeu: 1.62 ± 0.262
0.298HisMet: 0.298 ± 0.115
1.151HisAsn: 1.151 ± 0.197
1.279HisPro: 1.279 ± 0.241
0.469HisGln: 0.469 ± 0.132
0.895HisArg: 0.895 ± 0.198
1.449HisSer: 1.449 ± 0.23
1.108HisThr: 1.108 ± 0.216
1.279HisVal: 1.279 ± 0.229
0.256HisTrp: 0.256 ± 0.094
1.193HisTyr: 1.193 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
2.856IleAla: 2.856 ± 0.311
0.298IleCys: 0.298 ± 0.118
4.39IleAsp: 4.39 ± 0.46
7.459IleGlu: 7.459 ± 0.504
0.895IlePhe: 0.895 ± 0.247
2.898IleGly: 2.898 ± 0.317
1.279IleHis: 1.279 ± 0.271
2.515IleIle: 2.515 ± 0.311
1.833IleLys: 1.833 ± 0.255
3.197IleLeu: 3.197 ± 0.433
1.023IleMet: 1.023 ± 0.206
2.003IleAsn: 2.003 ± 0.329
2.216IlePro: 2.216 ± 0.287
1.364IleGln: 1.364 ± 0.214
3.026IleArg: 3.026 ± 0.409
3.495IleSer: 3.495 ± 0.512
2.856IleThr: 2.856 ± 0.305
3.026IleVal: 3.026 ± 0.348
0.298IleTrp: 0.298 ± 0.094
1.066IleTyr: 1.066 ± 0.181
0.0IleXaa: 0.0 ± 0.0
Lys
2.898LysAla: 2.898 ± 0.414
0.384LysCys: 0.384 ± 0.159
2.77LysAsp: 2.77 ± 0.367
3.921LysGlu: 3.921 ± 0.444
1.875LysPhe: 1.875 ± 0.295
2.728LysGly: 2.728 ± 0.31
0.938LysHis: 0.938 ± 0.214
1.577LysIle: 1.577 ± 0.276
2.088LysLys: 2.088 ± 0.37
3.452LysLeu: 3.452 ± 0.378
0.98LysMet: 0.98 ± 0.182
2.131LysAsn: 2.131 ± 0.307
1.79LysPro: 1.79 ± 0.282
1.449LysGln: 1.449 ± 0.261
3.069LysArg: 3.069 ± 0.308
2.898LysSer: 2.898 ± 0.328
2.77LysThr: 2.77 ± 0.314
2.813LysVal: 2.813 ± 0.359
0.639LysTrp: 0.639 ± 0.18
1.875LysTyr: 1.875 ± 0.286
0.0LysXaa: 0.0 ± 0.0
Leu
5.498LeuAla: 5.498 ± 0.509
0.426LeuCys: 0.426 ± 0.15
6.095LeuAsp: 6.095 ± 0.499
7.288LeuGlu: 7.288 ± 0.456
2.387LeuPhe: 2.387 ± 0.372
4.603LeuGly: 4.603 ± 0.435
1.62LeuHis: 1.62 ± 0.284
3.239LeuIle: 3.239 ± 0.279
3.324LeuLys: 3.324 ± 0.392
4.433LeuLeu: 4.433 ± 0.476
2.046LeuMet: 2.046 ± 0.328
3.239LeuAsn: 3.239 ± 0.418
3.111LeuPro: 3.111 ± 0.327
1.79LeuGln: 1.79 ± 0.244
4.347LeuArg: 4.347 ± 0.503
5.2LeuSer: 5.2 ± 0.464
4.347LeuThr: 4.347 ± 0.49
4.475LeuVal: 4.475 ± 0.469
0.852LeuTrp: 0.852 ± 0.171
2.515LeuTyr: 2.515 ± 0.389
0.0LeuXaa: 0.0 ± 0.0
Met
2.088MetAla: 2.088 ± 0.433
0.256MetCys: 0.256 ± 0.105
1.833MetAsp: 1.833 ± 0.271
2.216MetGlu: 2.216 ± 0.285
0.725MetPhe: 0.725 ± 0.216
1.492MetGly: 1.492 ± 0.244
0.298MetHis: 0.298 ± 0.101
0.852MetIle: 0.852 ± 0.181
1.534MetLys: 1.534 ± 0.259
1.364MetLeu: 1.364 ± 0.283
0.767MetMet: 0.767 ± 0.169
1.62MetAsn: 1.62 ± 0.274
0.767MetPro: 0.767 ± 0.175
0.384MetGln: 0.384 ± 0.109
1.151MetArg: 1.151 ± 0.22
2.472MetSer: 2.472 ± 0.353
1.193MetThr: 1.193 ± 0.269
1.236MetVal: 1.236 ± 0.271
0.341MetTrp: 0.341 ± 0.12
0.511MetTyr: 0.511 ± 0.13
0.0MetXaa: 0.0 ± 0.0
Asn
2.983AsnAla: 2.983 ± 0.306
0.554AsnCys: 0.554 ± 0.181
3.239AsnAsp: 3.239 ± 0.349
4.006AsnGlu: 4.006 ± 0.391
1.961AsnPhe: 1.961 ± 0.297
4.305AsnGly: 4.305 ± 0.543
1.108AsnHis: 1.108 ± 0.221
2.642AsnIle: 2.642 ± 0.391
1.79AsnLys: 1.79 ± 0.327
2.856AsnLeu: 2.856 ± 0.333
0.426AsnMet: 0.426 ± 0.136
2.131AsnAsn: 2.131 ± 0.385
2.77AsnPro: 2.77 ± 0.26
1.577AsnGln: 1.577 ± 0.268
3.239AsnArg: 3.239 ± 0.361
3.154AsnSer: 3.154 ± 0.418
2.429AsnThr: 2.429 ± 0.396
2.429AsnVal: 2.429 ± 0.333
0.554AsnTrp: 0.554 ± 0.128
1.577AsnTyr: 1.577 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
3.239ProAla: 3.239 ± 0.378
0.511ProCys: 0.511 ± 0.16
5.328ProAsp: 5.328 ± 0.536
5.114ProGlu: 5.114 ± 0.496
1.577ProPhe: 1.577 ± 0.303
2.515ProGly: 2.515 ± 0.287
0.98ProHis: 0.98 ± 0.247
2.131ProIle: 2.131 ± 0.357
2.046ProLys: 2.046 ± 0.367
2.6ProLeu: 2.6 ± 0.279
1.066ProMet: 1.066 ± 0.174
1.662ProAsn: 1.662 ± 0.256
2.216ProPro: 2.216 ± 0.324
0.725ProGln: 0.725 ± 0.179
1.961ProArg: 1.961 ± 0.271
2.515ProSer: 2.515 ± 0.329
2.983ProThr: 2.983 ± 0.379
3.452ProVal: 3.452 ± 0.355
0.81ProTrp: 0.81 ± 0.201
0.938ProTyr: 0.938 ± 0.238
0.0ProXaa: 0.0 ± 0.0
Gln
2.046GlnAla: 2.046 ± 0.272
0.213GlnCys: 0.213 ± 0.103
1.492GlnAsp: 1.492 ± 0.303
2.301GlnGlu: 2.301 ± 0.279
0.98GlnPhe: 0.98 ± 0.185
1.492GlnGly: 1.492 ± 0.257
1.023GlnHis: 1.023 ± 0.204
1.449GlnIle: 1.449 ± 0.25
1.961GlnLys: 1.961 ± 0.283
2.983GlnLeu: 2.983 ± 0.348
1.236GlnMet: 1.236 ± 0.335
1.705GlnAsn: 1.705 ± 0.282
0.938GlnPro: 0.938 ± 0.201
1.023GlnGln: 1.023 ± 0.226
1.62GlnArg: 1.62 ± 0.254
1.875GlnSer: 1.875 ± 0.265
2.088GlnThr: 2.088 ± 0.218
2.216GlnVal: 2.216 ± 0.282
0.298GlnTrp: 0.298 ± 0.114
0.895GlnTyr: 0.895 ± 0.182
0.0GlnXaa: 0.0 ± 0.0
Arg
3.282ArgAla: 3.282 ± 0.284
0.554ArgCys: 0.554 ± 0.156
4.262ArgAsp: 4.262 ± 0.471
5.029ArgGlu: 5.029 ± 0.535
2.046ArgPhe: 2.046 ± 0.275
4.134ArgGly: 4.134 ± 0.486
0.98ArgHis: 0.98 ± 0.199
3.197ArgIle: 3.197 ± 0.445
2.642ArgLys: 2.642 ± 0.324
3.878ArgLeu: 3.878 ± 0.384
1.577ArgMet: 1.577 ± 0.258
3.239ArgAsn: 3.239 ± 0.328
2.429ArgPro: 2.429 ± 0.365
2.088ArgGln: 2.088 ± 0.29
3.708ArgArg: 3.708 ± 0.661
3.41ArgSer: 3.41 ± 0.268
2.301ArgThr: 2.301 ± 0.347
4.134ArgVal: 4.134 ± 0.413
0.895ArgTrp: 0.895 ± 0.199
2.301ArgTyr: 2.301 ± 0.329
0.0ArgXaa: 0.0 ± 0.0
Ser
4.305SerAla: 4.305 ± 0.417
0.554SerCys: 0.554 ± 0.169
5.796SerAsp: 5.796 ± 0.681
6.436SerGlu: 6.436 ± 0.533
2.344SerPhe: 2.344 ± 0.25
6.564SerGly: 6.564 ± 0.869
1.279SerHis: 1.279 ± 0.228
3.282SerIle: 3.282 ± 0.296
2.301SerLys: 2.301 ± 0.282
4.944SerLeu: 4.944 ± 0.405
1.875SerMet: 1.875 ± 0.321
2.983SerAsn: 2.983 ± 0.313
2.131SerPro: 2.131 ± 0.306
1.833SerGln: 1.833 ± 0.287
3.751SerArg: 3.751 ± 0.391
4.262SerSer: 4.262 ± 0.563
3.58SerThr: 3.58 ± 0.421
4.56SerVal: 4.56 ± 0.494
1.151SerTrp: 1.151 ± 0.207
2.429SerTyr: 2.429 ± 0.309
0.0SerXaa: 0.0 ± 0.0
Thr
3.324ThrAla: 3.324 ± 0.383
0.384ThrCys: 0.384 ± 0.14
4.944ThrAsp: 4.944 ± 0.55
4.816ThrGlu: 4.816 ± 0.462
2.174ThrPhe: 2.174 ± 0.32
4.475ThrGly: 4.475 ± 0.429
1.62ThrHis: 1.62 ± 0.269
3.367ThrIle: 3.367 ± 0.359
2.557ThrLys: 2.557 ± 0.332
4.901ThrLeu: 4.901 ± 0.518
1.066ThrMet: 1.066 ± 0.223
2.003ThrAsn: 2.003 ± 0.375
3.026ThrPro: 3.026 ± 0.374
1.918ThrGln: 1.918 ± 0.321
2.387ThrArg: 2.387 ± 0.325
3.069ThrSer: 3.069 ± 0.439
3.878ThrThr: 3.878 ± 0.481
4.433ThrVal: 4.433 ± 0.396
0.725ThrTrp: 0.725 ± 0.176
1.833ThrTyr: 1.833 ± 0.257
0.0ThrXaa: 0.0 ± 0.0
Val
4.901ValAla: 4.901 ± 0.509
0.682ValCys: 0.682 ± 0.197
6.223ValAsp: 6.223 ± 0.456
6.052ValGlu: 6.052 ± 0.584
2.131ValPhe: 2.131 ± 0.256
3.964ValGly: 3.964 ± 0.291
1.364ValHis: 1.364 ± 0.286
3.282ValIle: 3.282 ± 0.397
2.983ValLys: 2.983 ± 0.304
3.921ValLeu: 3.921 ± 0.333
1.492ValMet: 1.492 ± 0.267
2.983ValAsn: 2.983 ± 0.378
3.282ValPro: 3.282 ± 0.363
2.813ValGln: 2.813 ± 0.284
3.58ValArg: 3.58 ± 0.348
4.816ValSer: 4.816 ± 0.47
4.901ValThr: 4.901 ± 0.562
5.157ValVal: 5.157 ± 0.498
0.852ValTrp: 0.852 ± 0.183
1.747ValTyr: 1.747 ± 0.267
0.0ValXaa: 0.0 ± 0.0
Trp
0.98TrpAla: 0.98 ± 0.165
0.213TrpCys: 0.213 ± 0.093
0.938TrpAsp: 0.938 ± 0.164
1.321TrpGlu: 1.321 ± 0.235
0.725TrpPhe: 0.725 ± 0.168
0.597TrpGly: 0.597 ± 0.163
0.128TrpHis: 0.128 ± 0.077
0.341TrpIle: 0.341 ± 0.118
0.852TrpLys: 0.852 ± 0.216
1.321TrpLeu: 1.321 ± 0.316
0.597TrpMet: 0.597 ± 0.197
0.725TrpAsn: 0.725 ± 0.169
0.554TrpPro: 0.554 ± 0.171
0.384TrpGln: 0.384 ± 0.124
1.066TrpArg: 1.066 ± 0.239
0.938TrpSer: 0.938 ± 0.201
0.554TrpThr: 0.554 ± 0.16
0.767TrpVal: 0.767 ± 0.172
0.213TrpTrp: 0.213 ± 0.089
0.469TrpTyr: 0.469 ± 0.134
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.6TyrAla: 2.6 ± 0.352
0.554TyrCys: 0.554 ± 0.154
3.751TyrAsp: 3.751 ± 0.388
3.58TyrGlu: 3.58 ± 0.37
1.236TyrPhe: 1.236 ± 0.223
2.941TyrGly: 2.941 ± 0.319
0.98TyrHis: 0.98 ± 0.232
1.236TyrIle: 1.236 ± 0.216
1.066TyrLys: 1.066 ± 0.21
2.472TyrLeu: 2.472 ± 0.38
0.511TyrMet: 0.511 ± 0.127
1.662TyrAsn: 1.662 ± 0.246
1.747TyrPro: 1.747 ± 0.262
1.364TyrGln: 1.364 ± 0.216
2.003TyrArg: 2.003 ± 0.321
1.875TyrSer: 1.875 ± 0.277
1.918TyrThr: 1.918 ± 0.299
2.387TyrVal: 2.387 ± 0.295
0.682TyrTrp: 0.682 ± 0.156
1.449TyrTyr: 1.449 ± 0.285
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 125 proteins (23464 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski