Amino acid dipepetide frequency for Halobacterium virus ChaoS9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.767AlaAla: 10.767 ± 1.44
0.468AlaCys: 0.468 ± 0.231
8.953AlaAsp: 8.953 ± 0.758
8.777AlaGlu: 8.777 ± 0.813
2.048AlaPhe: 2.048 ± 0.344
9.304AlaGly: 9.304 ± 1.204
2.458AlaHis: 2.458 ± 0.463
3.452AlaIle: 3.452 ± 0.532
1.697AlaLys: 1.697 ± 0.355
7.782AlaLeu: 7.782 ± 0.831
3.511AlaMet: 3.511 ± 0.99
2.399AlaAsn: 2.399 ± 0.351
3.745AlaPro: 3.745 ± 0.461
3.101AlaGln: 3.101 ± 0.49
6.612AlaArg: 6.612 ± 0.809
5.442AlaSer: 5.442 ± 0.683
7.49AlaThr: 7.49 ± 0.719
7.548AlaVal: 7.548 ± 1.061
1.58AlaTrp: 1.58 ± 0.235
1.697AlaTyr: 1.697 ± 0.376
0.0AlaXaa: 0.0 ± 0.0
Cys
0.351CysAla: 0.351 ± 0.16
0.0CysCys: 0.0 ± 0.0
0.878CysAsp: 0.878 ± 0.255
0.702CysGlu: 0.702 ± 0.186
0.117CysPhe: 0.117 ± 0.073
1.229CysGly: 1.229 ± 0.362
0.41CysHis: 0.41 ± 0.152
0.293CysIle: 0.293 ± 0.135
0.351CysLys: 0.351 ± 0.146
0.468CysLeu: 0.468 ± 0.159
0.059CysMet: 0.059 ± 0.054
0.059CysAsn: 0.059 ± 0.056
0.644CysPro: 0.644 ± 0.226
0.293CysGln: 0.293 ± 0.153
1.17CysArg: 1.17 ± 0.281
1.053CysSer: 1.053 ± 0.262
0.293CysThr: 0.293 ± 0.14
0.176CysVal: 0.176 ± 0.102
0.234CysTrp: 0.234 ± 0.105
0.176CysTyr: 0.176 ± 0.093
0.0CysXaa: 0.0 ± 0.0
Asp
9.713AspAla: 9.713 ± 0.946
0.41AspCys: 0.41 ± 0.133
11.527AspAsp: 11.527 ± 1.031
12.288AspGlu: 12.288 ± 0.957
1.872AspPhe: 1.872 ± 0.344
8.836AspGly: 8.836 ± 0.961
2.75AspHis: 2.75 ± 0.382
4.389AspIle: 4.389 ± 0.555
1.287AspLys: 1.287 ± 0.287
7.724AspLeu: 7.724 ± 0.572
1.346AspMet: 1.346 ± 0.243
1.346AspAsn: 1.346 ± 0.236
5.383AspPro: 5.383 ± 0.562
2.75AspGln: 2.75 ± 0.394
6.554AspArg: 6.554 ± 0.49
3.745AspSer: 3.745 ± 0.483
3.862AspThr: 3.862 ± 0.56
10.474AspVal: 10.474 ± 0.784
1.638AspTrp: 1.638 ± 0.317
2.165AspTyr: 2.165 ± 0.341
0.0AspXaa: 0.0 ± 0.0
Glu
10.415GluAla: 10.415 ± 0.692
1.053GluCys: 1.053 ± 0.368
10.591GluAsp: 10.591 ± 0.845
8.426GluGlu: 8.426 ± 0.916
3.628GluPhe: 3.628 ± 0.459
6.671GluGly: 6.671 ± 0.874
1.697GluHis: 1.697 ± 0.335
4.857GluIle: 4.857 ± 0.541
2.516GluLys: 2.516 ± 0.432
7.548GluLeu: 7.548 ± 0.792
2.048GluMet: 2.048 ± 0.344
2.809GluAsn: 2.809 ± 0.48
5.091GluPro: 5.091 ± 0.671
4.915GluGln: 4.915 ± 0.721
7.08GluArg: 7.08 ± 0.829
4.74GluSer: 4.74 ± 0.444
7.899GluThr: 7.899 ± 0.817
7.139GluVal: 7.139 ± 0.839
1.17GluTrp: 1.17 ± 0.21
2.282GluTyr: 2.282 ± 0.368
0.0GluXaa: 0.0 ± 0.0
Phe
2.224PheAla: 2.224 ± 0.354
0.234PheCys: 0.234 ± 0.111
2.926PheAsp: 2.926 ± 0.352
3.92PheGlu: 3.92 ± 0.399
0.527PhePhe: 0.527 ± 0.161
1.463PheGly: 1.463 ± 0.279
0.468PheHis: 0.468 ± 0.16
0.878PheIle: 0.878 ± 0.254
0.878PheLys: 0.878 ± 0.276
1.346PheLeu: 1.346 ± 0.242
0.176PheMet: 0.176 ± 0.098
0.585PheAsn: 0.585 ± 0.194
1.112PhePro: 1.112 ± 0.238
1.17PheGln: 1.17 ± 0.254
1.638PheArg: 1.638 ± 0.263
1.58PheSer: 1.58 ± 0.3
1.521PheThr: 1.521 ± 0.227
1.755PheVal: 1.755 ± 0.292
0.585PheTrp: 0.585 ± 0.183
0.468PheTyr: 0.468 ± 0.168
0.0PheXaa: 0.0 ± 0.0
Gly
8.133GlyAla: 8.133 ± 1.208
0.936GlyCys: 0.936 ± 0.24
8.192GlyAsp: 8.192 ± 0.519
8.016GlyGlu: 8.016 ± 0.76
1.931GlyPhe: 1.931 ± 0.354
5.676GlyGly: 5.676 ± 0.821
1.463GlyHis: 1.463 ± 0.326
3.335GlyIle: 3.335 ± 0.485
1.931GlyLys: 1.931 ± 0.32
4.857GlyLeu: 4.857 ± 0.547
1.404GlyMet: 1.404 ± 0.323
1.931GlyAsn: 1.931 ± 0.306
3.16GlyPro: 3.16 ± 0.357
2.575GlyGln: 2.575 ± 0.393
4.74GlyArg: 4.74 ± 0.615
4.213GlySer: 4.213 ± 0.409
4.74GlyThr: 4.74 ± 0.626
5.032GlyVal: 5.032 ± 0.528
1.287GlyTrp: 1.287 ± 0.256
2.106GlyTyr: 2.106 ± 0.38
0.0GlyXaa: 0.0 ± 0.0
His
1.521HisAla: 1.521 ± 0.313
0.176HisCys: 0.176 ± 0.083
2.75HisAsp: 2.75 ± 0.5
2.165HisGlu: 2.165 ± 0.345
0.644HisPhe: 0.644 ± 0.192
1.638HisGly: 1.638 ± 0.329
0.644HisHis: 0.644 ± 0.2
0.819HisIle: 0.819 ± 0.208
0.585HisLys: 0.585 ± 0.178
1.463HisLeu: 1.463 ± 0.346
0.293HisMet: 0.293 ± 0.157
0.761HisAsn: 0.761 ± 0.178
1.346HisPro: 1.346 ± 0.305
0.819HisGln: 0.819 ± 0.214
1.697HisArg: 1.697 ± 0.433
1.346HisSer: 1.346 ± 0.263
0.527HisThr: 0.527 ± 0.189
1.404HisVal: 1.404 ± 0.243
0.41HisTrp: 0.41 ± 0.161
0.761HisTyr: 0.761 ± 0.232
0.0HisXaa: 0.0 ± 0.0
Ile
4.154IleAla: 4.154 ± 0.456
0.293IleCys: 0.293 ± 0.121
5.266IleAsp: 5.266 ± 0.568
5.208IleGlu: 5.208 ± 0.521
1.463IlePhe: 1.463 ± 0.332
3.394IleGly: 3.394 ± 0.432
1.404IleHis: 1.404 ± 0.302
1.463IleIle: 1.463 ± 0.314
0.995IleLys: 0.995 ± 0.22
3.335IleLeu: 3.335 ± 0.37
0.585IleMet: 0.585 ± 0.144
1.17IleAsn: 1.17 ± 0.285
1.931IlePro: 1.931 ± 0.314
1.872IleGln: 1.872 ± 0.35
2.575IleArg: 2.575 ± 0.373
2.458IleSer: 2.458 ± 0.359
2.165IleThr: 2.165 ± 0.37
2.75IleVal: 2.75 ± 0.402
0.41IleTrp: 0.41 ± 0.162
0.702IleTyr: 0.702 ± 0.185
0.0IleXaa: 0.0 ± 0.0
Lys
1.931LysAla: 1.931 ± 0.308
0.059LysCys: 0.059 ± 0.059
1.463LysAsp: 1.463 ± 0.31
1.755LysGlu: 1.755 ± 0.383
0.819LysPhe: 0.819 ± 0.228
1.112LysGly: 1.112 ± 0.283
0.644LysHis: 0.644 ± 0.183
0.936LysIle: 0.936 ± 0.234
0.527LysLys: 0.527 ± 0.183
1.872LysLeu: 1.872 ± 0.365
0.41LysMet: 0.41 ± 0.132
0.936LysAsn: 0.936 ± 0.179
1.287LysPro: 1.287 ± 0.296
0.936LysGln: 0.936 ± 0.193
2.224LysArg: 2.224 ± 0.419
1.814LysSer: 1.814 ± 0.352
1.521LysThr: 1.521 ± 0.237
1.346LysVal: 1.346 ± 0.326
0.293LysTrp: 0.293 ± 0.13
1.053LysTyr: 1.053 ± 0.289
0.0LysXaa: 0.0 ± 0.0
Leu
7.841LeuAla: 7.841 ± 0.727
0.761LeuCys: 0.761 ± 0.236
7.607LeuAsp: 7.607 ± 0.656
8.602LeuGlu: 8.602 ± 0.581
1.931LeuPhe: 1.931 ± 0.385
5.325LeuGly: 5.325 ± 0.609
1.346LeuHis: 1.346 ± 0.288
2.867LeuIle: 2.867 ± 0.356
2.106LeuLys: 2.106 ± 0.376
4.798LeuLeu: 4.798 ± 0.57
1.17LeuMet: 1.17 ± 0.267
2.399LeuAsn: 2.399 ± 0.44
3.16LeuPro: 3.16 ± 0.403
2.867LeuGln: 2.867 ± 0.397
5.617LeuArg: 5.617 ± 0.653
4.681LeuSer: 4.681 ± 0.494
4.096LeuThr: 4.096 ± 0.591
6.085LeuVal: 6.085 ± 0.567
0.819LeuTrp: 0.819 ± 0.238
1.404LeuTyr: 1.404 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
1.697MetAla: 1.697 ± 0.322
0.293MetCys: 0.293 ± 0.126
1.989MetAsp: 1.989 ± 0.759
1.346MetGlu: 1.346 ± 0.26
0.41MetPhe: 0.41 ± 0.149
0.995MetGly: 0.995 ± 0.252
0.351MetHis: 0.351 ± 0.146
0.761MetIle: 0.761 ± 0.209
1.053MetLys: 1.053 ± 0.274
1.287MetLeu: 1.287 ± 0.255
0.41MetMet: 0.41 ± 0.146
0.468MetAsn: 0.468 ± 0.164
0.995MetPro: 0.995 ± 0.287
0.644MetGln: 0.644 ± 0.166
0.644MetArg: 0.644 ± 0.193
1.989MetSer: 1.989 ± 0.34
1.931MetThr: 1.931 ± 0.352
1.229MetVal: 1.229 ± 0.253
0.176MetTrp: 0.176 ± 0.103
0.468MetTyr: 0.468 ± 0.142
0.0MetXaa: 0.0 ± 0.0
Asn
2.341AsnAla: 2.341 ± 0.379
0.41AsnCys: 0.41 ± 0.149
1.638AsnAsp: 1.638 ± 0.364
1.989AsnGlu: 1.989 ± 0.302
0.527AsnPhe: 0.527 ± 0.151
2.341AsnGly: 2.341 ± 0.308
0.351AsnHis: 0.351 ± 0.121
0.936AsnIle: 0.936 ± 0.216
0.41AsnLys: 0.41 ± 0.179
2.224AsnLeu: 2.224 ± 0.372
0.585AsnMet: 0.585 ± 0.172
0.644AsnAsn: 0.644 ± 0.18
1.931AsnPro: 1.931 ± 0.341
1.17AsnGln: 1.17 ± 0.26
1.463AsnArg: 1.463 ± 0.366
1.287AsnSer: 1.287 ± 0.222
1.346AsnThr: 1.346 ± 0.268
2.165AsnVal: 2.165 ± 0.501
0.468AsnTrp: 0.468 ± 0.163
0.819AsnTyr: 0.819 ± 0.229
0.0AsnXaa: 0.0 ± 0.0
Pro
3.628ProAla: 3.628 ± 0.477
0.176ProCys: 0.176 ± 0.102
5.208ProAsp: 5.208 ± 0.628
5.383ProGlu: 5.383 ± 0.742
1.521ProPhe: 1.521 ± 0.364
3.92ProGly: 3.92 ± 0.467
0.585ProHis: 0.585 ± 0.189
1.872ProIle: 1.872 ± 0.291
1.17ProLys: 1.17 ± 0.289
3.101ProLeu: 3.101 ± 0.47
0.468ProMet: 0.468 ± 0.17
1.229ProAsn: 1.229 ± 0.273
1.58ProPro: 1.58 ± 0.383
1.229ProGln: 1.229 ± 0.243
3.335ProArg: 3.335 ± 0.431
2.399ProSer: 2.399 ± 0.392
2.867ProThr: 2.867 ± 0.409
3.569ProVal: 3.569 ± 0.525
0.819ProTrp: 0.819 ± 0.246
1.17ProTyr: 1.17 ± 0.321
0.0ProXaa: 0.0 ± 0.0
Gln
3.862GlnAla: 3.862 ± 0.502
0.585GlnCys: 0.585 ± 0.239
1.872GlnAsp: 1.872 ± 0.324
3.335GlnGlu: 3.335 ± 0.519
0.761GlnPhe: 0.761 ± 0.193
1.58GlnGly: 1.58 ± 0.301
0.644GlnHis: 0.644 ± 0.169
1.404GlnIle: 1.404 ± 0.306
0.761GlnLys: 0.761 ± 0.236
4.974GlnLeu: 4.974 ± 0.556
0.702GlnMet: 0.702 ± 0.179
1.229GlnAsn: 1.229 ± 0.304
1.346GlnPro: 1.346 ± 0.311
1.404GlnGln: 1.404 ± 0.294
2.926GlnArg: 2.926 ± 0.481
1.755GlnSer: 1.755 ± 0.304
2.867GlnThr: 2.867 ± 0.513
1.638GlnVal: 1.638 ± 0.265
0.527GlnTrp: 0.527 ± 0.179
1.17GlnTyr: 1.17 ± 0.226
0.0GlnXaa: 0.0 ± 0.0
Arg
5.676ArgAla: 5.676 ± 0.549
0.936ArgCys: 0.936 ± 0.204
5.91ArgAsp: 5.91 ± 0.578
6.963ArgGlu: 6.963 ± 0.587
2.106ArgPhe: 2.106 ± 0.288
4.33ArgGly: 4.33 ± 0.425
1.404ArgHis: 1.404 ± 0.353
4.037ArgIle: 4.037 ± 0.481
1.638ArgLys: 1.638 ± 0.277
5.5ArgLeu: 5.5 ± 0.766
1.755ArgMet: 1.755 ± 0.26
1.697ArgAsn: 1.697 ± 0.302
2.516ArgPro: 2.516 ± 0.355
2.692ArgGln: 2.692 ± 0.289
5.383ArgArg: 5.383 ± 0.621
3.686ArgSer: 3.686 ± 0.496
3.569ArgThr: 3.569 ± 0.463
4.798ArgVal: 4.798 ± 0.499
0.878ArgTrp: 0.878 ± 0.233
1.931ArgTyr: 1.931 ± 0.338
0.0ArgXaa: 0.0 ± 0.0
Ser
5.442SerAla: 5.442 ± 0.612
0.41SerCys: 0.41 ± 0.18
5.091SerAsp: 5.091 ± 0.495
5.5SerGlu: 5.5 ± 0.687
1.463SerPhe: 1.463 ± 0.292
5.5SerGly: 5.5 ± 0.539
1.229SerHis: 1.229 ± 0.263
3.452SerIle: 3.452 ± 0.402
1.521SerLys: 1.521 ± 0.311
3.569SerLeu: 3.569 ± 0.37
1.346SerMet: 1.346 ± 0.22
1.521SerAsn: 1.521 ± 0.263
2.809SerPro: 2.809 ± 0.394
1.872SerGln: 1.872 ± 0.513
3.218SerArg: 3.218 ± 0.546
3.16SerSer: 3.16 ± 0.377
3.862SerThr: 3.862 ± 0.613
3.628SerVal: 3.628 ± 0.554
0.761SerTrp: 0.761 ± 0.201
1.58SerTyr: 1.58 ± 0.322
0.0SerXaa: 0.0 ± 0.0
Thr
7.314ThrAla: 7.314 ± 1.117
0.468ThrCys: 0.468 ± 0.179
4.915ThrAsp: 4.915 ± 0.424
5.734ThrGlu: 5.734 ± 0.721
1.521ThrPhe: 1.521 ± 0.244
3.862ThrGly: 3.862 ± 0.5
0.995ThrHis: 0.995 ± 0.244
3.335ThrIle: 3.335 ± 0.549
1.229ThrLys: 1.229 ± 0.253
5.676ThrLeu: 5.676 ± 0.513
1.17ThrMet: 1.17 ± 0.244
1.346ThrAsn: 1.346 ± 0.348
3.394ThrPro: 3.394 ± 0.479
0.995ThrGln: 0.995 ± 0.211
2.692ThrArg: 2.692 ± 0.495
4.623ThrSer: 4.623 ± 0.655
5.091ThrThr: 5.091 ± 0.721
5.266ThrVal: 5.266 ± 0.46
0.819ThrTrp: 0.819 ± 0.206
1.638ThrTyr: 1.638 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
8.133ValAla: 8.133 ± 0.906
0.644ValCys: 0.644 ± 0.206
9.538ValAsp: 9.538 ± 0.939
8.426ValGlu: 8.426 ± 0.684
1.229ValPhe: 1.229 ± 0.23
5.442ValGly: 5.442 ± 0.638
1.638ValHis: 1.638 ± 0.271
2.75ValIle: 2.75 ± 0.315
1.521ValLys: 1.521 ± 0.323
5.091ValLeu: 5.091 ± 0.518
1.229ValMet: 1.229 ± 0.28
1.521ValAsn: 1.521 ± 0.28
2.458ValPro: 2.458 ± 0.489
2.575ValGln: 2.575 ± 0.358
4.74ValArg: 4.74 ± 0.533
4.857ValSer: 4.857 ± 0.726
4.564ValThr: 4.564 ± 0.536
5.851ValVal: 5.851 ± 0.716
0.995ValTrp: 0.995 ± 0.262
1.638ValTyr: 1.638 ± 0.314
0.0ValXaa: 0.0 ± 0.0
Trp
1.287TrpAla: 1.287 ± 0.267
0.117TrpCys: 0.117 ± 0.088
1.17TrpAsp: 1.17 ± 0.301
1.17TrpGlu: 1.17 ± 0.251
0.468TrpPhe: 0.468 ± 0.149
0.878TrpGly: 0.878 ± 0.2
0.527TrpHis: 0.527 ± 0.197
0.702TrpIle: 0.702 ± 0.168
0.293TrpLys: 0.293 ± 0.133
1.463TrpLeu: 1.463 ± 0.256
0.234TrpMet: 0.234 ± 0.098
0.41TrpAsn: 0.41 ± 0.138
0.468TrpPro: 0.468 ± 0.137
0.351TrpGln: 0.351 ± 0.124
1.17TrpArg: 1.17 ± 0.245
0.995TrpSer: 0.995 ± 0.275
1.112TrpThr: 1.112 ± 0.247
1.053TrpVal: 1.053 ± 0.247
0.176TrpTrp: 0.176 ± 0.092
0.293TrpTyr: 0.293 ± 0.136
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.399TyrAla: 2.399 ± 0.29
0.527TyrCys: 0.527 ± 0.163
2.575TyrAsp: 2.575 ± 0.354
2.867TyrGlu: 2.867 ± 0.445
0.468TyrPhe: 0.468 ± 0.161
2.165TyrGly: 2.165 ± 0.38
0.761TyrHis: 0.761 ± 0.204
1.053TyrIle: 1.053 ± 0.246
0.468TyrLys: 0.468 ± 0.176
1.404TyrLeu: 1.404 ± 0.287
0.176TyrMet: 0.176 ± 0.091
0.585TyrAsn: 0.585 ± 0.234
0.936TyrPro: 0.936 ± 0.198
1.112TyrGln: 1.112 ± 0.296
1.931TyrArg: 1.931 ± 0.373
1.112TyrSer: 1.112 ± 0.236
0.761TyrThr: 0.761 ± 0.211
1.931TyrVal: 1.931 ± 0.415
0.293TyrTrp: 0.293 ± 0.12
0.819TyrTyr: 0.819 ± 0.237
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 83 proteins (17091 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski