Amino acid dipepetide frequency for Haloferax tailed virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.086AlaAla: 6.086 ± 0.738
0.882AlaCys: 0.882 ± 0.276
7.321AlaAsp: 7.321 ± 0.846
6.792AlaGlu: 6.792 ± 1.062
2.558AlaPhe: 2.558 ± 0.362
5.028AlaGly: 5.028 ± 0.762
1.764AlaHis: 1.764 ± 0.321
4.587AlaIle: 4.587 ± 0.705
4.322AlaLys: 4.322 ± 0.683
6.792AlaLeu: 6.792 ± 0.84
1.5AlaMet: 1.5 ± 0.436
3.087AlaAsn: 3.087 ± 0.662
2.823AlaPro: 2.823 ± 0.523
2.558AlaGln: 2.558 ± 0.521
3.705AlaArg: 3.705 ± 0.598
4.851AlaSer: 4.851 ± 0.707
5.204AlaThr: 5.204 ± 0.782
4.763AlaVal: 4.763 ± 0.516
1.676AlaTrp: 1.676 ± 0.406
2.646AlaTyr: 2.646 ± 0.456
0.0AlaXaa: 0.0 ± 0.0
Cys
0.529CysAla: 0.529 ± 0.224
0.176CysCys: 0.176 ± 0.165
0.882CysAsp: 0.882 ± 0.404
1.058CysGlu: 1.058 ± 0.401
0.265CysPhe: 0.265 ± 0.206
1.235CysGly: 1.235 ± 0.444
0.088CysHis: 0.088 ± 0.083
0.265CysIle: 0.265 ± 0.15
0.617CysLys: 0.617 ± 0.238
0.353CysLeu: 0.353 ± 0.187
0.176CysMet: 0.176 ± 0.13
0.353CysAsn: 0.353 ± 0.173
0.97CysPro: 0.97 ± 0.387
0.706CysGln: 0.706 ± 0.262
0.794CysArg: 0.794 ± 0.241
1.058CysSer: 1.058 ± 0.445
0.441CysThr: 0.441 ± 0.196
0.617CysVal: 0.617 ± 0.281
0.176CysTrp: 0.176 ± 0.126
0.176CysTyr: 0.176 ± 0.133
0.0CysXaa: 0.0 ± 0.0
Asp
8.291AspAla: 8.291 ± 0.565
1.323AspCys: 1.323 ± 0.374
8.909AspAsp: 8.909 ± 0.731
8.644AspGlu: 8.644 ± 0.962
3.175AspPhe: 3.175 ± 0.518
7.674AspGly: 7.674 ± 0.78
1.235AspHis: 1.235 ± 0.327
4.675AspIle: 4.675 ± 0.86
3.264AspLys: 3.264 ± 0.687
6.086AspLeu: 6.086 ± 0.754
2.117AspMet: 2.117 ± 0.532
3.264AspAsn: 3.264 ± 0.608
4.058AspPro: 4.058 ± 0.55
1.941AspGln: 1.941 ± 0.443
3.793AspArg: 3.793 ± 0.769
6.351AspSer: 6.351 ± 0.789
6.174AspThr: 6.174 ± 1.054
7.85AspVal: 7.85 ± 0.82
0.97AspTrp: 0.97 ± 0.276
3.352AspTyr: 3.352 ± 0.601
0.0AspXaa: 0.0 ± 0.0
Glu
6.792GluAla: 6.792 ± 0.992
1.676GluCys: 1.676 ± 0.532
5.381GluAsp: 5.381 ± 0.733
6.616GluGlu: 6.616 ± 1.08
3.881GluPhe: 3.881 ± 0.559
4.94GluGly: 4.94 ± 0.729
1.411GluHis: 1.411 ± 0.317
4.763GluIle: 4.763 ± 0.635
3.793GluLys: 3.793 ± 0.756
9.085GluLeu: 9.085 ± 0.926
2.911GluMet: 2.911 ± 0.524
3.793GluAsn: 3.793 ± 0.596
2.999GluPro: 2.999 ± 0.459
3.881GluGln: 3.881 ± 0.56
4.499GluArg: 4.499 ± 0.738
4.587GluSer: 4.587 ± 0.584
5.645GluThr: 5.645 ± 0.715
7.321GluVal: 7.321 ± 0.951
2.558GluTrp: 2.558 ± 0.618
2.382GluTyr: 2.382 ± 0.446
0.0GluXaa: 0.0 ± 0.0
Phe
2.47PheAla: 2.47 ± 0.486
0.441PheCys: 0.441 ± 0.211
3.616PheAsp: 3.616 ± 0.652
3.528PheGlu: 3.528 ± 0.519
0.706PhePhe: 0.706 ± 0.317
3.44PheGly: 3.44 ± 0.491
0.617PheHis: 0.617 ± 0.258
0.97PheIle: 0.97 ± 0.279
1.411PheLys: 1.411 ± 0.302
1.852PheLeu: 1.852 ± 0.414
0.265PheMet: 0.265 ± 0.135
1.235PheAsn: 1.235 ± 0.328
1.235PhePro: 1.235 ± 0.309
0.529PheGln: 0.529 ± 0.217
2.646PheArg: 2.646 ± 0.515
2.823PheSer: 2.823 ± 0.575
2.029PheThr: 2.029 ± 0.463
1.941PheVal: 1.941 ± 0.402
0.176PheTrp: 0.176 ± 0.153
0.97PheTyr: 0.97 ± 0.246
0.0PheXaa: 0.0 ± 0.0
Gly
4.763GlyAla: 4.763 ± 0.606
0.617GlyCys: 0.617 ± 0.25
7.409GlyAsp: 7.409 ± 0.981
8.38GlyGlu: 8.38 ± 1.118
2.823GlyPhe: 2.823 ± 0.487
7.321GlyGly: 7.321 ± 1.28
0.97GlyHis: 0.97 ± 0.313
2.558GlyIle: 2.558 ± 0.45
2.911GlyLys: 2.911 ± 0.504
4.675GlyLeu: 4.675 ± 0.556
1.323GlyMet: 1.323 ± 0.391
2.646GlyAsn: 2.646 ± 0.486
1.323GlyPro: 1.323 ± 0.29
2.029GlyGln: 2.029 ± 0.411
4.058GlyArg: 4.058 ± 0.676
5.469GlySer: 5.469 ± 1.024
6.351GlyThr: 6.351 ± 0.836
5.822GlyVal: 5.822 ± 0.675
1.5GlyTrp: 1.5 ± 0.36
1.676GlyTyr: 1.676 ± 0.326
0.0GlyXaa: 0.0 ± 0.0
His
2.029HisAla: 2.029 ± 0.33
0.088HisCys: 0.088 ± 0.093
1.058HisAsp: 1.058 ± 0.254
1.764HisGlu: 1.764 ± 0.337
0.441HisPhe: 0.441 ± 0.186
1.852HisGly: 1.852 ± 0.399
0.441HisHis: 0.441 ± 0.27
0.97HisIle: 0.97 ± 0.219
0.794HisLys: 0.794 ± 0.265
0.441HisLeu: 0.441 ± 0.188
0.088HisMet: 0.088 ± 0.083
0.617HisAsn: 0.617 ± 0.271
1.147HisPro: 1.147 ± 0.257
0.882HisGln: 0.882 ± 0.302
1.235HisArg: 1.235 ± 0.378
0.794HisSer: 0.794 ± 0.239
1.676HisThr: 1.676 ± 0.424
2.205HisVal: 2.205 ± 0.45
0.0HisTrp: 0.0 ± 0.0
0.882HisTyr: 0.882 ± 0.366
0.0HisXaa: 0.0 ± 0.0
Ile
3.616IleAla: 3.616 ± 0.593
0.353IleCys: 0.353 ± 0.157
6.174IleAsp: 6.174 ± 0.892
3.881IleGlu: 3.881 ± 0.412
0.97IlePhe: 0.97 ± 0.374
3.087IleGly: 3.087 ± 0.438
1.764IleHis: 1.764 ± 0.387
1.676IleIle: 1.676 ± 0.339
1.764IleLys: 1.764 ± 0.391
2.734IleLeu: 2.734 ± 0.481
0.794IleMet: 0.794 ± 0.265
2.117IleAsn: 2.117 ± 0.355
2.734IlePro: 2.734 ± 0.414
1.676IleGln: 1.676 ± 0.375
2.999IleArg: 2.999 ± 0.59
2.823IleSer: 2.823 ± 0.637
2.293IleThr: 2.293 ± 0.594
2.47IleVal: 2.47 ± 0.381
0.353IleTrp: 0.353 ± 0.17
0.617IleTyr: 0.617 ± 0.25
0.0IleXaa: 0.0 ± 0.0
Lys
3.793LysAla: 3.793 ± 0.842
0.353LysCys: 0.353 ± 0.191
2.999LysAsp: 2.999 ± 0.535
3.352LysGlu: 3.352 ± 0.665
1.058LysPhe: 1.058 ± 0.327
1.764LysGly: 1.764 ± 0.499
0.97LysHis: 0.97 ± 0.332
1.411LysIle: 1.411 ± 0.371
1.411LysLys: 1.411 ± 0.354
2.823LysLeu: 2.823 ± 0.423
0.353LysMet: 0.353 ± 0.174
1.852LysAsn: 1.852 ± 0.445
1.588LysPro: 1.588 ± 0.322
1.411LysGln: 1.411 ± 0.322
2.999LysArg: 2.999 ± 0.539
2.734LysSer: 2.734 ± 0.511
2.823LysThr: 2.823 ± 0.472
3.087LysVal: 3.087 ± 0.619
0.353LysTrp: 0.353 ± 0.211
1.058LysTyr: 1.058 ± 0.285
0.0LysXaa: 0.0 ± 0.0
Leu
5.381LeuAla: 5.381 ± 0.874
0.441LeuCys: 0.441 ± 0.189
7.145LeuAsp: 7.145 ± 0.769
6.968LeuGlu: 6.968 ± 0.663
1.941LeuPhe: 1.941 ± 0.423
4.851LeuGly: 4.851 ± 0.712
1.235LeuHis: 1.235 ± 0.32
2.029LeuIle: 2.029 ± 0.392
3.175LeuLys: 3.175 ± 0.545
3.969LeuLeu: 3.969 ± 0.559
1.588LeuMet: 1.588 ± 0.489
1.852LeuAsn: 1.852 ± 0.328
2.47LeuPro: 2.47 ± 0.472
3.087LeuGln: 3.087 ± 0.441
4.587LeuArg: 4.587 ± 0.756
4.851LeuSer: 4.851 ± 0.706
5.469LeuThr: 5.469 ± 0.679
3.969LeuVal: 3.969 ± 0.58
0.794LeuTrp: 0.794 ± 0.265
1.764LeuTyr: 1.764 ± 0.315
0.0LeuXaa: 0.0 ± 0.0
Met
1.764MetAla: 1.764 ± 0.372
0.088MetCys: 0.088 ± 0.088
2.47MetAsp: 2.47 ± 0.538
1.588MetGlu: 1.588 ± 0.401
0.441MetPhe: 0.441 ± 0.185
1.147MetGly: 1.147 ± 0.398
0.265MetHis: 0.265 ± 0.151
0.617MetIle: 0.617 ± 0.231
0.529MetLys: 0.529 ± 0.211
1.235MetLeu: 1.235 ± 0.349
0.353MetMet: 0.353 ± 0.266
0.617MetAsn: 0.617 ± 0.288
1.235MetPro: 1.235 ± 0.302
0.265MetGln: 0.265 ± 0.133
0.97MetArg: 0.97 ± 0.292
2.646MetSer: 2.646 ± 0.513
2.47MetThr: 2.47 ± 0.456
0.882MetVal: 0.882 ± 0.212
0.088MetTrp: 0.088 ± 0.076
0.441MetTyr: 0.441 ± 0.212
0.0MetXaa: 0.0 ± 0.0
Asn
3.175AsnAla: 3.175 ± 0.55
0.176AsnCys: 0.176 ± 0.165
3.087AsnAsp: 3.087 ± 0.483
3.616AsnGlu: 3.616 ± 0.567
1.323AsnPhe: 1.323 ± 0.353
2.558AsnGly: 2.558 ± 0.436
1.235AsnHis: 1.235 ± 0.403
1.676AsnIle: 1.676 ± 0.335
1.235AsnLys: 1.235 ± 0.384
2.734AsnLeu: 2.734 ± 0.47
0.529AsnMet: 0.529 ± 0.17
1.411AsnAsn: 1.411 ± 0.412
1.588AsnPro: 1.588 ± 0.282
1.323AsnGln: 1.323 ± 0.372
2.293AsnArg: 2.293 ± 0.466
2.47AsnSer: 2.47 ± 0.369
3.175AsnThr: 3.175 ± 0.74
3.705AsnVal: 3.705 ± 0.661
0.882AsnTrp: 0.882 ± 0.259
0.529AsnTyr: 0.529 ± 0.343
0.0AsnXaa: 0.0 ± 0.0
Pro
2.911ProAla: 2.911 ± 0.53
0.353ProCys: 0.353 ± 0.15
4.146ProAsp: 4.146 ± 0.556
3.793ProGlu: 3.793 ± 0.694
1.411ProPhe: 1.411 ± 0.341
2.646ProGly: 2.646 ± 0.542
0.882ProHis: 0.882 ± 0.241
2.47ProIle: 2.47 ± 0.482
2.029ProLys: 2.029 ± 0.403
2.646ProLeu: 2.646 ± 0.432
1.058ProMet: 1.058 ± 0.327
2.117ProAsn: 2.117 ± 0.448
1.676ProPro: 1.676 ± 0.413
1.411ProGln: 1.411 ± 0.371
2.558ProArg: 2.558 ± 0.46
3.087ProSer: 3.087 ± 0.414
3.352ProThr: 3.352 ± 0.567
2.382ProVal: 2.382 ± 0.496
0.529ProTrp: 0.529 ± 0.225
0.794ProTyr: 0.794 ± 0.249
0.0ProXaa: 0.0 ± 0.0
Gln
3.264GlnAla: 3.264 ± 0.646
0.353GlnCys: 0.353 ± 0.189
2.47GlnAsp: 2.47 ± 0.436
2.911GlnGlu: 2.911 ± 0.498
1.235GlnPhe: 1.235 ± 0.289
2.117GlnGly: 2.117 ± 0.574
0.794GlnHis: 0.794 ± 0.292
1.588GlnIle: 1.588 ± 0.552
0.794GlnLys: 0.794 ± 0.318
2.646GlnLeu: 2.646 ± 0.468
1.058GlnMet: 1.058 ± 0.279
1.235GlnAsn: 1.235 ± 0.212
1.764GlnPro: 1.764 ± 0.458
1.5GlnGln: 1.5 ± 0.338
2.029GlnArg: 2.029 ± 0.455
2.823GlnSer: 2.823 ± 0.431
3.528GlnThr: 3.528 ± 0.86
1.411GlnVal: 1.411 ± 0.315
0.617GlnTrp: 0.617 ± 0.222
0.529GlnTyr: 0.529 ± 0.267
0.0GlnXaa: 0.0 ± 0.0
Arg
3.616ArgAla: 3.616 ± 0.608
0.794ArgCys: 0.794 ± 0.287
4.851ArgAsp: 4.851 ± 0.818
5.292ArgGlu: 5.292 ± 0.763
2.205ArgPhe: 2.205 ± 0.419
3.881ArgGly: 3.881 ± 0.59
1.058ArgHis: 1.058 ± 0.248
3.528ArgIle: 3.528 ± 0.47
2.205ArgLys: 2.205 ± 0.429
3.175ArgLeu: 3.175 ± 0.539
1.676ArgMet: 1.676 ± 0.4
1.764ArgAsn: 1.764 ± 0.358
2.382ArgPro: 2.382 ± 0.324
2.382ArgGln: 2.382 ± 0.505
3.616ArgArg: 3.616 ± 0.613
3.087ArgSer: 3.087 ± 0.467
2.999ArgThr: 2.999 ± 0.622
5.028ArgVal: 5.028 ± 0.77
0.882ArgTrp: 0.882 ± 0.317
2.117ArgTyr: 2.117 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
6.704SerAla: 6.704 ± 0.818
0.441SerCys: 0.441 ± 0.233
7.057SerAsp: 7.057 ± 0.676
6.263SerGlu: 6.263 ± 0.662
2.293SerPhe: 2.293 ± 0.431
6.88SerGly: 6.88 ± 0.953
0.97SerHis: 0.97 ± 0.292
2.558SerIle: 2.558 ± 0.472
2.823SerLys: 2.823 ± 0.523
3.264SerLeu: 3.264 ± 0.608
1.058SerMet: 1.058 ± 0.311
2.47SerAsn: 2.47 ± 0.427
3.264SerPro: 3.264 ± 0.401
2.205SerGln: 2.205 ± 0.416
3.087SerArg: 3.087 ± 0.473
5.381SerSer: 5.381 ± 0.888
4.146SerThr: 4.146 ± 0.91
6.263SerVal: 6.263 ± 0.916
1.147SerTrp: 1.147 ± 0.284
2.205SerTyr: 2.205 ± 0.409
0.0SerXaa: 0.0 ± 0.0
Thr
5.204ThrAla: 5.204 ± 0.861
0.529ThrCys: 0.529 ± 0.239
5.998ThrAsp: 5.998 ± 0.751
3.528ThrGlu: 3.528 ± 0.577
2.382ThrPhe: 2.382 ± 0.5
6.704ThrGly: 6.704 ± 0.653
1.235ThrHis: 1.235 ± 0.431
3.881ThrIle: 3.881 ± 0.599
1.235ThrLys: 1.235 ± 0.342
6.792ThrLeu: 6.792 ± 0.955
0.706ThrMet: 0.706 ± 0.36
3.793ThrAsn: 3.793 ± 0.608
4.146ThrPro: 4.146 ± 0.534
2.558ThrGln: 2.558 ± 0.377
4.234ThrArg: 4.234 ± 0.522
4.146ThrSer: 4.146 ± 0.776
5.292ThrThr: 5.292 ± 0.967
5.645ThrVal: 5.645 ± 1.225
0.882ThrTrp: 0.882 ± 0.327
2.734ThrTyr: 2.734 ± 0.469
0.0ThrXaa: 0.0 ± 0.0
Val
5.381ValAla: 5.381 ± 0.809
0.794ValCys: 0.794 ± 0.324
8.203ValAsp: 8.203 ± 0.859
7.057ValGlu: 7.057 ± 0.869
2.734ValPhe: 2.734 ± 0.539
4.587ValGly: 4.587 ± 0.577
1.058ValHis: 1.058 ± 0.299
3.264ValIle: 3.264 ± 0.536
2.823ValLys: 2.823 ± 0.525
2.911ValLeu: 2.911 ± 0.496
1.764ValMet: 1.764 ± 0.355
2.734ValAsn: 2.734 ± 0.39
2.999ValPro: 2.999 ± 0.555
2.999ValGln: 2.999 ± 0.49
4.322ValArg: 4.322 ± 0.559
6.704ValSer: 6.704 ± 0.788
5.469ValThr: 5.469 ± 0.88
4.234ValVal: 4.234 ± 0.689
1.323ValTrp: 1.323 ± 0.321
1.588ValTyr: 1.588 ± 0.445
0.0ValXaa: 0.0 ± 0.0
Trp
0.794TrpAla: 0.794 ± 0.352
0.265TrpCys: 0.265 ± 0.152
1.235TrpAsp: 1.235 ± 0.309
1.235TrpGlu: 1.235 ± 0.275
0.617TrpPhe: 0.617 ± 0.235
1.235TrpGly: 1.235 ± 0.346
0.882TrpHis: 0.882 ± 0.29
0.617TrpIle: 0.617 ± 0.28
0.441TrpLys: 0.441 ± 0.25
1.676TrpLeu: 1.676 ± 0.392
0.265TrpMet: 0.265 ± 0.138
0.706TrpAsn: 0.706 ± 0.332
0.265TrpPro: 0.265 ± 0.158
0.176TrpGln: 0.176 ± 0.101
0.353TrpArg: 0.353 ± 0.167
1.323TrpSer: 1.323 ± 0.346
1.235TrpThr: 1.235 ± 0.283
1.323TrpVal: 1.323 ± 0.317
0.088TrpTrp: 0.088 ± 0.083
0.706TrpTyr: 0.706 ± 0.275
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.47TyrAla: 2.47 ± 0.473
0.794TyrCys: 0.794 ± 0.226
2.734TyrAsp: 2.734 ± 0.498
2.646TyrGlu: 2.646 ± 0.465
0.441TyrPhe: 0.441 ± 0.185
1.676TyrGly: 1.676 ± 0.368
0.529TyrHis: 0.529 ± 0.226
0.794TyrIle: 0.794 ± 0.28
0.617TyrLys: 0.617 ± 0.173
1.676TyrLeu: 1.676 ± 0.413
0.441TyrMet: 0.441 ± 0.184
1.058TyrAsn: 1.058 ± 0.366
1.588TyrPro: 1.588 ± 0.487
1.147TyrGln: 1.147 ± 0.387
1.764TyrArg: 1.764 ± 0.363
2.47TyrSer: 2.47 ± 0.506
1.852TyrThr: 1.852 ± 0.372
2.029TyrVal: 2.029 ± 0.407
0.441TyrTrp: 0.441 ± 0.218
0.529TyrTyr: 0.529 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (11338 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski