Amino acid dipepetide frequency for Hawaiian green turtle herpesvirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.818AlaAla: 14.818 ± 2.097
1.735AlaCys: 1.735 ± 0.424
3.738AlaAsp: 3.738 ± 0.647
6.541AlaGlu: 6.541 ± 1.117
4.806AlaPhe: 4.806 ± 0.748
5.874AlaGly: 5.874 ± 1.381
1.735AlaHis: 1.735 ± 0.647
2.536AlaIle: 2.536 ± 0.541
3.204AlaLys: 3.204 ± 0.612
10.412AlaLeu: 10.412 ± 1.287
1.335AlaMet: 1.335 ± 0.381
1.068AlaAsn: 1.068 ± 0.386
8.143AlaPro: 8.143 ± 1.064
3.604AlaGln: 3.604 ± 0.644
7.876AlaArg: 7.876 ± 1.09
5.874AlaSer: 5.874 ± 0.695
4.405AlaThr: 4.405 ± 0.735
6.541AlaVal: 6.541 ± 0.575
0.667AlaTrp: 0.667 ± 0.232
3.07AlaTyr: 3.07 ± 0.536
0.0AlaXaa: 0.0 ± 0.0
Cys
1.735CysAla: 1.735 ± 0.537
0.534CysCys: 0.534 ± 0.309
1.735CysAsp: 1.735 ± 0.538
1.335CysGlu: 1.335 ± 0.382
1.335CysPhe: 1.335 ± 0.454
0.934CysGly: 0.934 ± 0.392
0.267CysHis: 0.267 ± 0.215
0.133CysIle: 0.133 ± 0.129
0.667CysLys: 0.667 ± 0.297
2.403CysLeu: 2.403 ± 0.469
0.267CysMet: 0.267 ± 0.173
0.4CysAsn: 0.4 ± 0.181
1.602CysPro: 1.602 ± 0.588
0.133CysGln: 0.133 ± 0.16
2.136CysArg: 2.136 ± 0.511
1.068CysSer: 1.068 ± 0.532
0.667CysThr: 0.667 ± 0.295
1.335CysVal: 1.335 ± 0.419
0.534CysTrp: 0.534 ± 0.238
0.4CysTyr: 0.4 ± 0.254
0.0CysXaa: 0.0 ± 0.0
Asp
5.874AspAla: 5.874 ± 0.901
1.468AspCys: 1.468 ± 0.409
2.002AspAsp: 2.002 ± 0.266
4.272AspGlu: 4.272 ± 0.923
1.602AspPhe: 1.602 ± 0.422
1.869AspGly: 1.869 ± 0.377
1.068AspHis: 1.068 ± 0.457
1.602AspIle: 1.602 ± 0.391
1.735AspLys: 1.735 ± 0.5
6.408AspLeu: 6.408 ± 1.032
0.934AspMet: 0.934 ± 0.276
1.068AspAsn: 1.068 ± 0.509
3.604AspPro: 3.604 ± 0.516
1.468AspGln: 1.468 ± 0.491
4.272AspArg: 4.272 ± 0.776
2.536AspSer: 2.536 ± 0.557
1.068AspThr: 1.068 ± 0.398
4.405AspVal: 4.405 ± 0.812
0.667AspTrp: 0.667 ± 0.303
1.869AspTyr: 1.869 ± 0.436
0.0AspXaa: 0.0 ± 0.0
Glu
7.075GluAla: 7.075 ± 1.163
0.934GluCys: 0.934 ± 0.401
3.204GluAsp: 3.204 ± 0.512
6.541GluGlu: 6.541 ± 1.507
2.002GluPhe: 2.002 ± 0.62
4.005GluGly: 4.005 ± 0.874
1.201GluHis: 1.201 ± 0.41
2.136GluIle: 2.136 ± 0.59
4.138GluLys: 4.138 ± 0.54
7.209GluLeu: 7.209 ± 0.826
0.801GluMet: 0.801 ± 0.298
2.403GluAsn: 2.403 ± 0.404
2.67GluPro: 2.67 ± 0.468
1.602GluGln: 1.602 ± 0.401
6.675GluArg: 6.675 ± 0.917
4.272GluSer: 4.272 ± 0.579
6.541GluThr: 6.541 ± 1.235
3.471GluVal: 3.471 ± 0.695
0.667GluTrp: 0.667 ± 0.232
1.335GluTyr: 1.335 ± 0.252
0.0GluXaa: 0.0 ± 0.0
Phe
3.871PheAla: 3.871 ± 0.697
0.934PheCys: 0.934 ± 0.262
2.536PheAsp: 2.536 ± 0.526
3.337PheGlu: 3.337 ± 0.529
2.136PhePhe: 2.136 ± 0.6
2.937PheGly: 2.937 ± 0.472
1.201PheHis: 1.201 ± 0.337
1.468PheIle: 1.468 ± 0.358
1.869PheLys: 1.869 ± 0.674
5.206PheLeu: 5.206 ± 0.702
0.934PheMet: 0.934 ± 0.326
1.068PheAsn: 1.068 ± 0.454
2.937PhePro: 2.937 ± 0.577
2.002PheGln: 2.002 ± 0.48
3.604PheArg: 3.604 ± 0.695
3.337PheSer: 3.337 ± 0.492
1.869PheThr: 1.869 ± 0.794
4.138PheVal: 4.138 ± 0.484
0.4PheTrp: 0.4 ± 0.21
2.269PheTyr: 2.269 ± 0.507
0.0PheXaa: 0.0 ± 0.0
Gly
4.138GlyAla: 4.138 ± 0.76
0.267GlyCys: 0.267 ± 0.17
2.803GlyAsp: 2.803 ± 0.433
5.073GlyGlu: 5.073 ± 0.667
4.405GlyPhe: 4.405 ± 0.509
4.272GlyGly: 4.272 ± 0.922
1.068GlyHis: 1.068 ± 0.258
1.335GlyIle: 1.335 ± 0.353
2.403GlyLys: 2.403 ± 0.514
6.408GlyLeu: 6.408 ± 0.697
0.4GlyMet: 0.4 ± 0.193
1.201GlyAsn: 1.201 ± 0.394
4.672GlyPro: 4.672 ± 1.048
2.136GlyGln: 2.136 ± 0.374
7.342GlyArg: 7.342 ± 1.241
3.604GlySer: 3.604 ± 0.836
2.937GlyThr: 2.937 ± 0.733
4.272GlyVal: 4.272 ± 0.812
0.534GlyTrp: 0.534 ± 0.198
1.735GlyTyr: 1.735 ± 0.539
0.0GlyXaa: 0.0 ± 0.0
His
2.002HisAla: 2.002 ± 0.494
0.934HisCys: 0.934 ± 0.3
0.534HisAsp: 0.534 ± 0.235
0.801HisGlu: 0.801 ± 0.297
1.602HisPhe: 1.602 ± 0.708
1.201HisGly: 1.201 ± 0.429
0.133HisHis: 0.133 ± 0.124
0.267HisIle: 0.267 ± 0.169
0.4HisLys: 0.4 ± 0.195
2.002HisLeu: 2.002 ± 0.403
0.267HisMet: 0.267 ± 0.221
1.068HisAsn: 1.068 ± 0.246
0.0HisPro: 0.0 ± 0.0
0.801HisGln: 0.801 ± 0.398
1.468HisArg: 1.468 ± 0.36
1.201HisSer: 1.201 ± 0.409
1.201HisThr: 1.201 ± 0.422
2.403HisVal: 2.403 ± 0.56
0.267HisTrp: 0.267 ± 0.166
0.4HisTyr: 0.4 ± 0.233
0.0HisXaa: 0.0 ± 0.0
Ile
1.602IleAla: 1.602 ± 0.394
0.667IleCys: 0.667 ± 0.321
1.602IleAsp: 1.602 ± 0.465
1.869IleGlu: 1.869 ± 0.664
2.136IlePhe: 2.136 ± 0.749
1.068IleGly: 1.068 ± 0.31
0.667IleHis: 0.667 ± 0.245
0.667IleIle: 0.667 ± 0.294
2.136IleLys: 2.136 ± 0.446
2.269IleLeu: 2.269 ± 0.518
0.4IleMet: 0.4 ± 0.209
1.201IleAsn: 1.201 ± 0.387
1.468IlePro: 1.468 ± 0.77
1.468IleGln: 1.468 ± 0.519
1.335IleArg: 1.335 ± 0.393
1.335IleSer: 1.335 ± 0.385
2.403IleThr: 2.403 ± 0.585
2.002IleVal: 2.002 ± 0.498
0.4IleTrp: 0.4 ± 0.227
1.335IleTyr: 1.335 ± 0.338
0.0IleXaa: 0.0 ± 0.0
Lys
3.738LysAla: 3.738 ± 0.834
0.4LysCys: 0.4 ± 0.263
1.068LysAsp: 1.068 ± 0.34
2.536LysGlu: 2.536 ± 0.525
2.002LysPhe: 2.002 ± 0.35
1.335LysGly: 1.335 ± 0.326
1.068LysHis: 1.068 ± 0.543
1.602LysIle: 1.602 ± 0.472
2.937LysLys: 2.937 ± 0.572
3.871LysLeu: 3.871 ± 1.119
1.468LysMet: 1.468 ± 0.511
2.536LysAsn: 2.536 ± 0.597
1.335LysPro: 1.335 ± 0.368
2.403LysGln: 2.403 ± 0.685
3.738LysArg: 3.738 ± 0.628
2.002LysSer: 2.002 ± 0.353
3.07LysThr: 3.07 ± 0.639
2.269LysVal: 2.269 ± 0.357
0.0LysTrp: 0.0 ± 0.0
0.534LysTyr: 0.534 ± 0.288
0.0LysXaa: 0.0 ± 0.0
Leu
9.211LeuAla: 9.211 ± 0.914
3.07LeuCys: 3.07 ± 0.864
5.073LeuAsp: 5.073 ± 1.164
7.075LeuGlu: 7.075 ± 1.134
6.274LeuPhe: 6.274 ± 0.763
6.541LeuGly: 6.541 ± 0.753
1.335LeuHis: 1.335 ± 0.408
3.738LeuIle: 3.738 ± 0.608
5.34LeuLys: 5.34 ± 0.754
16.019LeuLeu: 16.019 ± 1.44
0.934LeuMet: 0.934 ± 0.358
4.272LeuAsn: 4.272 ± 0.75
4.672LeuPro: 4.672 ± 0.77
3.738LeuGln: 3.738 ± 0.523
8.277LeuArg: 8.277 ± 1.547
6.942LeuSer: 6.942 ± 0.86
6.408LeuThr: 6.408 ± 0.946
6.007LeuVal: 6.007 ± 0.964
0.667LeuTrp: 0.667 ± 0.338
3.604LeuTyr: 3.604 ± 0.633
0.0LeuXaa: 0.0 ± 0.0
Met
0.934MetAla: 0.934 ± 0.453
0.267MetCys: 0.267 ± 0.206
0.801MetAsp: 0.801 ± 0.257
1.602MetGlu: 1.602 ± 0.681
0.534MetPhe: 0.534 ± 0.234
1.468MetGly: 1.468 ± 0.476
0.133MetHis: 0.133 ± 0.133
0.4MetIle: 0.4 ± 0.262
0.534MetLys: 0.534 ± 0.276
2.136MetLeu: 2.136 ± 0.362
0.801MetMet: 0.801 ± 0.286
0.667MetAsn: 0.667 ± 0.288
0.267MetPro: 0.267 ± 0.187
0.534MetGln: 0.534 ± 0.311
0.4MetArg: 0.4 ± 0.282
1.201MetSer: 1.201 ± 0.431
1.068MetThr: 1.068 ± 0.385
0.667MetVal: 0.667 ± 0.433
0.267MetTrp: 0.267 ± 0.15
0.534MetTyr: 0.534 ± 0.394
0.0MetXaa: 0.0 ± 0.0
Asn
4.272AsnAla: 4.272 ± 0.476
0.667AsnCys: 0.667 ± 0.169
1.869AsnAsp: 1.869 ± 0.457
1.735AsnGlu: 1.735 ± 0.569
1.602AsnPhe: 1.602 ± 0.539
2.002AsnGly: 2.002 ± 0.552
0.801AsnHis: 0.801 ± 0.198
0.934AsnIle: 0.934 ± 0.427
1.068AsnLys: 1.068 ± 0.415
2.403AsnLeu: 2.403 ± 0.412
0.267AsnMet: 0.267 ± 0.162
0.934AsnAsn: 0.934 ± 0.307
2.403AsnPro: 2.403 ± 0.435
0.801AsnGln: 0.801 ± 0.28
3.07AsnArg: 3.07 ± 0.519
1.869AsnSer: 1.869 ± 0.667
2.002AsnThr: 2.002 ± 0.498
2.136AsnVal: 2.136 ± 0.58
0.0AsnTrp: 0.0 ± 0.0
1.335AsnTyr: 1.335 ± 0.323
0.0AsnXaa: 0.0 ± 0.0
Pro
5.874ProAla: 5.874 ± 1.116
1.869ProCys: 1.869 ± 0.614
2.67ProAsp: 2.67 ± 0.711
6.274ProGlu: 6.274 ± 1.225
1.602ProPhe: 1.602 ± 0.523
4.939ProGly: 4.939 ± 1.226
1.201ProHis: 1.201 ± 0.355
1.201ProIle: 1.201 ± 0.513
1.335ProLys: 1.335 ± 0.312
5.607ProLeu: 5.607 ± 0.853
0.667ProMet: 0.667 ± 0.255
1.068ProAsn: 1.068 ± 0.449
7.876ProPro: 7.876 ± 2.6
0.934ProGln: 0.934 ± 0.288
4.005ProArg: 4.005 ± 0.898
5.607ProSer: 5.607 ± 0.929
2.937ProThr: 2.937 ± 1.043
4.272ProVal: 4.272 ± 0.958
0.267ProTrp: 0.267 ± 0.164
2.937ProTyr: 2.937 ± 0.719
0.0ProXaa: 0.0 ± 0.0
Gln
2.002GlnAla: 2.002 ± 0.47
0.133GlnCys: 0.133 ± 0.124
1.201GlnAsp: 1.201 ± 0.318
2.269GlnGlu: 2.269 ± 0.647
2.269GlnPhe: 2.269 ± 0.453
1.468GlnGly: 1.468 ± 0.54
0.667GlnHis: 0.667 ± 0.336
1.735GlnIle: 1.735 ± 0.284
1.735GlnLys: 1.735 ± 0.405
4.405GlnLeu: 4.405 ± 0.693
0.667GlnMet: 0.667 ± 0.386
1.201GlnAsn: 1.201 ± 0.42
2.136GlnPro: 2.136 ± 0.535
1.201GlnGln: 1.201 ± 0.335
3.204GlnArg: 3.204 ± 0.696
1.735GlnSer: 1.735 ± 0.408
2.136GlnThr: 2.136 ± 0.695
1.869GlnVal: 1.869 ± 0.265
0.4GlnTrp: 0.4 ± 0.299
0.534GlnTyr: 0.534 ± 0.268
0.0GlnXaa: 0.0 ± 0.0
Arg
9.345ArgAla: 9.345 ± 1.08
1.068ArgCys: 1.068 ± 0.291
5.874ArgAsp: 5.874 ± 0.791
4.806ArgGlu: 4.806 ± 0.688
4.272ArgPhe: 4.272 ± 0.68
5.74ArgGly: 5.74 ± 0.988
1.468ArgHis: 1.468 ± 0.347
2.002ArgIle: 2.002 ± 0.326
2.803ArgLys: 2.803 ± 0.859
10.546ArgLeu: 10.546 ± 1.828
0.534ArgMet: 0.534 ± 0.211
2.536ArgAsn: 2.536 ± 0.672
5.473ArgPro: 5.473 ± 0.969
2.937ArgGln: 2.937 ± 0.429
8.277ArgArg: 8.277 ± 1.7
6.408ArgSer: 6.408 ± 1.4
4.272ArgThr: 4.272 ± 0.699
4.939ArgVal: 4.939 ± 0.608
0.934ArgTrp: 0.934 ± 0.284
2.269ArgTyr: 2.269 ± 0.684
0.0ArgXaa: 0.0 ± 0.0
Ser
7.476SerAla: 7.476 ± 0.88
1.468SerCys: 1.468 ± 0.408
3.204SerAsp: 3.204 ± 0.772
4.138SerGlu: 4.138 ± 0.748
2.937SerPhe: 2.937 ± 0.638
4.405SerGly: 4.405 ± 0.494
0.534SerHis: 0.534 ± 0.33
0.934SerIle: 0.934 ± 0.295
1.602SerLys: 1.602 ± 0.395
6.141SerLeu: 6.141 ± 0.659
0.801SerMet: 0.801 ± 0.226
2.536SerAsn: 2.536 ± 0.493
3.738SerPro: 3.738 ± 0.796
2.136SerGln: 2.136 ± 0.533
6.808SerArg: 6.808 ± 0.638
4.005SerSer: 4.005 ± 1.053
3.871SerThr: 3.871 ± 0.819
5.073SerVal: 5.073 ± 1.14
0.133SerTrp: 0.133 ± 0.125
1.869SerTyr: 1.869 ± 0.416
0.0SerXaa: 0.0 ± 0.0
Thr
5.874ThrAla: 5.874 ± 0.949
1.335ThrCys: 1.335 ± 0.496
3.07ThrAsp: 3.07 ± 0.761
3.871ThrGlu: 3.871 ± 0.736
2.269ThrPhe: 2.269 ± 0.573
3.738ThrGly: 3.738 ± 0.822
0.934ThrHis: 0.934 ± 0.268
0.934ThrIle: 0.934 ± 0.371
2.136ThrLys: 2.136 ± 0.498
4.939ThrLeu: 4.939 ± 1.09
0.801ThrMet: 0.801 ± 0.604
2.002ThrAsn: 2.002 ± 0.631
2.937ThrPro: 2.937 ± 0.66
1.335ThrGln: 1.335 ± 0.441
5.34ThrArg: 5.34 ± 0.922
3.471ThrSer: 3.471 ± 0.931
1.869ThrThr: 1.869 ± 0.637
6.942ThrVal: 6.942 ± 1.033
0.534ThrTrp: 0.534 ± 0.305
1.068ThrTyr: 1.068 ± 0.38
0.0ThrXaa: 0.0 ± 0.0
Val
4.939ValAla: 4.939 ± 0.71
1.201ValCys: 1.201 ± 0.259
4.138ValAsp: 4.138 ± 0.681
2.403ValGlu: 2.403 ± 0.422
2.269ValPhe: 2.269 ± 0.325
4.005ValGly: 4.005 ± 0.691
1.869ValHis: 1.869 ± 0.47
2.937ValIle: 2.937 ± 0.769
2.002ValLys: 2.002 ± 0.334
6.408ValLeu: 6.408 ± 1.077
1.869ValMet: 1.869 ± 0.361
3.738ValAsn: 3.738 ± 0.953
4.672ValPro: 4.672 ± 1.153
2.136ValGln: 2.136 ± 0.532
5.206ValArg: 5.206 ± 0.927
5.74ValSer: 5.74 ± 0.914
5.74ValThr: 5.74 ± 0.771
3.471ValVal: 3.471 ± 0.887
1.201ValTrp: 1.201 ± 0.504
2.803ValTyr: 2.803 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.534TrpAla: 0.534 ± 0.324
0.133TrpCys: 0.133 ± 0.122
0.934TrpAsp: 0.934 ± 0.393
0.534TrpGlu: 0.534 ± 0.284
0.267TrpPhe: 0.267 ± 0.236
0.934TrpGly: 0.934 ± 0.369
0.267TrpHis: 0.267 ± 0.186
0.267TrpIle: 0.267 ± 0.186
0.4TrpLys: 0.4 ± 0.266
0.4TrpLeu: 0.4 ± 0.176
0.267TrpMet: 0.267 ± 0.168
0.4TrpAsn: 0.4 ± 0.189
0.667TrpPro: 0.667 ± 0.386
0.534TrpGln: 0.534 ± 0.227
1.201TrpArg: 1.201 ± 0.428
0.534TrpSer: 0.534 ± 0.288
0.0TrpThr: 0.0 ± 0.0
0.267TrpVal: 0.267 ± 0.174
0.267TrpTrp: 0.267 ± 0.206
0.4TrpTyr: 0.4 ± 0.185
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.67TyrAla: 2.67 ± 0.527
0.534TyrCys: 0.534 ± 0.352
1.735TyrAsp: 1.735 ± 0.431
1.869TyrGlu: 1.869 ± 0.317
1.735TyrPhe: 1.735 ± 0.394
2.269TyrGly: 2.269 ± 0.786
1.335TyrHis: 1.335 ± 0.384
1.201TyrIle: 1.201 ± 0.458
1.468TyrLys: 1.468 ± 0.526
4.005TyrLeu: 4.005 ± 0.719
0.667TyrMet: 0.667 ± 0.344
0.801TyrAsn: 0.801 ± 0.28
1.869TyrPro: 1.869 ± 0.416
1.201TyrGln: 1.201 ± 0.342
2.269TyrArg: 2.269 ± 0.716
1.068TyrSer: 1.068 ± 0.422
0.934TyrThr: 0.934 ± 0.292
2.269TyrVal: 2.269 ± 0.544
0.4TyrTrp: 0.4 ± 0.303
1.068TyrTyr: 1.068 ± 0.352
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 15 proteins (7492 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski