Amino acid dipepetide frequency for Hubei myriapoda virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.227AlaAla: 2.227 ± 0.744
0.81AlaCys: 0.81 ± 0.486
1.417AlaAsp: 1.417 ± 0.601
2.834AlaGlu: 2.834 ± 0.705
1.822AlaPhe: 1.822 ± 0.737
1.822AlaGly: 1.822 ± 0.527
1.012AlaHis: 1.012 ± 0.306
3.239AlaIle: 3.239 ± 1.522
1.417AlaLys: 1.417 ± 0.26
3.239AlaLeu: 3.239 ± 0.716
0.81AlaMet: 0.81 ± 0.39
1.619AlaAsn: 1.619 ± 0.567
0.81AlaPro: 0.81 ± 0.537
1.822AlaGln: 1.822 ± 0.985
3.036AlaArg: 3.036 ± 0.742
3.239AlaSer: 3.239 ± 1.005
3.239AlaThr: 3.239 ± 1.027
1.215AlaVal: 1.215 ± 0.517
0.202AlaTrp: 0.202 ± 0.122
1.822AlaTyr: 1.822 ± 0.342
0.0AlaXaa: 0.0 ± 0.0
Cys
0.81CysAla: 0.81 ± 0.304
0.405CysCys: 0.405 ± 0.223
0.405CysAsp: 0.405 ± 0.223
0.607CysGlu: 0.607 ± 0.25
0.607CysPhe: 0.607 ± 0.299
0.607CysGly: 0.607 ± 0.35
0.202CysHis: 0.202 ± 0.244
1.012CysIle: 1.012 ± 0.612
0.81CysLys: 0.81 ± 0.508
1.417CysLeu: 1.417 ± 0.601
1.012CysMet: 1.012 ± 0.537
0.81CysAsn: 0.81 ± 0.508
0.607CysPro: 0.607 ± 0.549
0.202CysGln: 0.202 ± 0.244
0.81CysArg: 0.81 ± 0.24
1.417CysSer: 1.417 ± 0.662
1.012CysThr: 1.012 ± 0.363
1.215CysVal: 1.215 ± 0.575
0.202CysTrp: 0.202 ± 0.122
1.012CysTyr: 1.012 ± 0.285
0.0CysXaa: 0.0 ± 0.0
Asp
1.417AspAla: 1.417 ± 0.48
0.81AspCys: 0.81 ± 0.486
2.227AspAsp: 2.227 ± 0.599
5.263AspGlu: 5.263 ± 0.611
2.024AspPhe: 2.024 ± 0.501
2.632AspGly: 2.632 ± 0.433
1.822AspHis: 1.822 ± 0.714
5.263AspIle: 5.263 ± 0.736
3.441AspLys: 3.441 ± 0.553
7.895AspLeu: 7.895 ± 1.332
1.619AspMet: 1.619 ± 0.658
3.036AspAsn: 3.036 ± 0.662
1.215AspPro: 1.215 ± 0.612
1.619AspGln: 1.619 ± 0.561
2.429AspArg: 2.429 ± 1.257
3.441AspSer: 3.441 ± 0.835
4.049AspThr: 4.049 ± 1.167
1.822AspVal: 1.822 ± 0.375
0.607AspTrp: 0.607 ± 0.54
2.227AspTyr: 2.227 ± 0.664
0.0AspXaa: 0.0 ± 0.0
Glu
3.036GluAla: 3.036 ± 1.659
0.405GluCys: 0.405 ± 0.488
3.441GluAsp: 3.441 ± 1.412
5.466GluGlu: 5.466 ± 0.727
2.429GluPhe: 2.429 ± 0.375
3.441GluGly: 3.441 ± 1.097
1.417GluHis: 1.417 ± 0.29
4.049GluIle: 4.049 ± 0.461
4.858GluLys: 4.858 ± 0.991
5.87GluLeu: 5.87 ± 1.129
2.834GluMet: 2.834 ± 1.005
2.227GluAsn: 2.227 ± 0.458
1.619GluPro: 1.619 ± 0.334
3.239GluGln: 3.239 ± 1.62
4.251GluArg: 4.251 ± 1.15
5.466GluSer: 5.466 ± 0.445
4.858GluThr: 4.858 ± 0.405
3.441GluVal: 3.441 ± 0.788
0.607GluTrp: 0.607 ± 0.3
1.417GluTyr: 1.417 ± 0.454
0.0GluXaa: 0.0 ± 0.0
Phe
0.607PheAla: 0.607 ± 0.3
0.405PheCys: 0.405 ± 0.223
4.251PheAsp: 4.251 ± 0.847
1.619PheGlu: 1.619 ± 0.467
2.024PhePhe: 2.024 ± 0.912
2.429PheGly: 2.429 ± 0.595
0.81PheHis: 0.81 ± 0.348
3.036PheIle: 3.036 ± 0.632
3.846PheLys: 3.846 ± 1.085
3.441PheLeu: 3.441 ± 0.59
0.405PheMet: 0.405 ± 0.244
2.429PheAsn: 2.429 ± 1.117
3.239PhePro: 3.239 ± 0.729
1.822PheGln: 1.822 ± 0.72
1.215PheArg: 1.215 ± 0.554
4.251PheSer: 4.251 ± 1.272
3.441PheThr: 3.441 ± 0.861
2.024PheVal: 2.024 ± 0.797
0.81PheTrp: 0.81 ± 0.304
1.619PheTyr: 1.619 ± 0.689
0.0PheXaa: 0.0 ± 0.0
Gly
1.417GlyAla: 1.417 ± 0.472
0.405GlyCys: 0.405 ± 0.223
2.632GlyAsp: 2.632 ± 0.649
3.644GlyGlu: 3.644 ± 0.677
2.632GlyPhe: 2.632 ± 0.836
3.036GlyGly: 3.036 ± 0.556
1.215GlyHis: 1.215 ± 0.37
4.251GlyIle: 4.251 ± 0.429
4.251GlyLys: 4.251 ± 1.509
6.68GlyLeu: 6.68 ± 0.842
1.012GlyMet: 1.012 ± 0.484
2.632GlyAsn: 2.632 ± 0.737
1.822GlyPro: 1.822 ± 1.032
1.619GlyGln: 1.619 ± 0.311
3.846GlyArg: 3.846 ± 1.364
3.036GlySer: 3.036 ± 0.276
3.036GlyThr: 3.036 ± 0.963
3.441GlyVal: 3.441 ± 0.931
0.81GlyTrp: 0.81 ± 0.446
2.429GlyTyr: 2.429 ± 1.097
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.202HisAsp: 0.202 ± 0.122
0.607HisGlu: 0.607 ± 0.223
2.227HisPhe: 2.227 ± 0.954
1.417HisGly: 1.417 ± 0.649
1.012HisHis: 1.012 ± 0.397
2.024HisIle: 2.024 ± 0.481
0.81HisLys: 0.81 ± 0.387
2.632HisLeu: 2.632 ± 0.961
0.0HisMet: 0.0 ± 0.0
1.619HisAsn: 1.619 ± 0.742
2.429HisPro: 2.429 ± 0.387
1.012HisGln: 1.012 ± 0.643
1.215HisArg: 1.215 ± 0.432
1.417HisSer: 1.417 ± 0.384
1.012HisThr: 1.012 ± 0.351
0.405HisVal: 0.405 ± 0.312
0.202HisTrp: 0.202 ± 0.253
1.012HisTyr: 1.012 ± 0.423
0.0HisXaa: 0.0 ± 0.0
Ile
3.441IleAla: 3.441 ± 0.835
1.417IleCys: 1.417 ± 0.276
4.656IleAsp: 4.656 ± 0.758
6.68IleGlu: 6.68 ± 1.17
3.239IlePhe: 3.239 ± 0.561
5.263IleGly: 5.263 ± 1.725
1.619IleHis: 1.619 ± 0.445
6.478IleIle: 6.478 ± 0.859
4.858IleLys: 4.858 ± 0.846
4.251IleLeu: 4.251 ± 0.534
3.036IleMet: 3.036 ± 0.598
3.644IleAsn: 3.644 ± 1.004
4.656IlePro: 4.656 ± 1.06
1.619IleGln: 1.619 ± 0.583
2.834IleArg: 2.834 ± 1.056
7.085IleSer: 7.085 ± 0.943
7.287IleThr: 7.287 ± 1.333
4.858IleVal: 4.858 ± 0.835
1.012IleTrp: 1.012 ± 0.363
2.834IleTyr: 2.834 ± 0.8
0.0IleXaa: 0.0 ± 0.0
Lys
2.227LysAla: 2.227 ± 0.69
0.607LysCys: 0.607 ± 0.35
4.251LysAsp: 4.251 ± 1.024
4.858LysGlu: 4.858 ± 1.391
2.429LysPhe: 2.429 ± 0.588
2.834LysGly: 2.834 ± 1.283
1.417LysHis: 1.417 ± 0.385
7.49LysIle: 7.49 ± 1.275
3.846LysLys: 3.846 ± 0.531
6.883LysLeu: 6.883 ± 0.502
2.834LysMet: 2.834 ± 0.551
3.036LysAsn: 3.036 ± 0.59
1.417LysPro: 1.417 ± 0.591
2.632LysGln: 2.632 ± 0.774
2.834LysArg: 2.834 ± 0.313
5.263LysSer: 5.263 ± 0.741
5.263LysThr: 5.263 ± 0.79
4.251LysVal: 4.251 ± 0.947
1.619LysTrp: 1.619 ± 0.437
1.012LysTyr: 1.012 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
3.239LeuAla: 3.239 ± 0.906
2.227LeuCys: 2.227 ± 1.14
4.656LeuAsp: 4.656 ± 0.585
4.251LeuGlu: 4.251 ± 0.628
4.656LeuPhe: 4.656 ± 1.799
5.061LeuGly: 5.061 ± 0.454
1.012LeuHis: 1.012 ± 0.335
7.085LeuIle: 7.085 ± 1.014
6.68LeuLys: 6.68 ± 1.701
9.109LeuLeu: 9.109 ± 1.204
2.429LeuMet: 2.429 ± 0.665
6.478LeuAsn: 6.478 ± 1.12
3.441LeuPro: 3.441 ± 1.047
2.429LeuGln: 2.429 ± 0.57
3.644LeuArg: 3.644 ± 0.756
6.68LeuSer: 6.68 ± 1.132
6.478LeuThr: 6.478 ± 2.36
4.251LeuVal: 4.251 ± 0.964
1.619LeuTrp: 1.619 ± 0.494
4.049LeuTyr: 4.049 ± 0.648
0.0LeuXaa: 0.0 ± 0.0
Met
2.024MetAla: 2.024 ± 0.727
0.81MetCys: 0.81 ± 0.499
1.012MetAsp: 1.012 ± 0.423
1.215MetGlu: 1.215 ± 0.343
1.215MetPhe: 1.215 ± 0.361
1.417MetGly: 1.417 ± 0.586
0.202MetHis: 0.202 ± 0.198
3.036MetIle: 3.036 ± 0.653
3.036MetLys: 3.036 ± 0.518
1.619MetLeu: 1.619 ± 0.578
1.215MetMet: 1.215 ± 0.575
1.619MetAsn: 1.619 ± 0.426
0.607MetPro: 0.607 ± 0.251
1.215MetGln: 1.215 ± 0.37
1.619MetArg: 1.619 ± 0.523
2.429MetSer: 2.429 ± 0.639
2.429MetThr: 2.429 ± 0.332
2.429MetVal: 2.429 ± 0.598
0.202MetTrp: 0.202 ± 0.26
1.417MetTyr: 1.417 ± 0.47
0.0MetXaa: 0.0 ± 0.0
Asn
2.429AsnAla: 2.429 ± 0.439
0.81AsnCys: 0.81 ± 0.508
2.227AsnAsp: 2.227 ± 0.661
2.429AsnGlu: 2.429 ± 0.439
3.036AsnPhe: 3.036 ± 0.767
2.632AsnGly: 2.632 ± 0.864
1.215AsnHis: 1.215 ± 0.537
4.858AsnIle: 4.858 ± 0.836
3.036AsnLys: 3.036 ± 0.615
5.87AsnLeu: 5.87 ± 1.441
1.012AsnMet: 1.012 ± 0.423
2.834AsnAsn: 2.834 ± 0.528
4.049AsnPro: 4.049 ± 0.687
2.834AsnGln: 2.834 ± 1.081
1.822AsnArg: 1.822 ± 0.662
3.846AsnSer: 3.846 ± 0.716
5.87AsnThr: 5.87 ± 0.744
2.834AsnVal: 2.834 ± 0.696
0.607AsnTrp: 0.607 ± 0.299
2.024AsnTyr: 2.024 ± 0.495
0.0AsnXaa: 0.0 ± 0.0
Pro
2.227ProAla: 2.227 ± 0.434
0.202ProCys: 0.202 ± 0.244
1.822ProAsp: 1.822 ± 0.399
3.846ProGlu: 3.846 ± 0.375
1.619ProPhe: 1.619 ± 0.764
1.822ProGly: 1.822 ± 0.512
0.405ProHis: 0.405 ± 0.172
3.644ProIle: 3.644 ± 1.014
3.846ProLys: 3.846 ± 1.129
2.834ProLeu: 2.834 ± 0.494
2.429ProMet: 2.429 ± 0.568
2.429ProAsn: 2.429 ± 0.869
2.227ProPro: 2.227 ± 0.828
2.227ProGln: 2.227 ± 0.396
1.619ProArg: 1.619 ± 0.269
3.239ProSer: 3.239 ± 0.608
5.466ProThr: 5.466 ± 1.367
3.239ProVal: 3.239 ± 0.815
0.607ProTrp: 0.607 ± 0.496
0.81ProTyr: 0.81 ± 0.345
0.0ProXaa: 0.0 ± 0.0
Gln
2.024GlnAla: 2.024 ± 0.859
0.607GlnCys: 0.607 ± 0.536
2.024GlnAsp: 2.024 ± 0.722
1.619GlnGlu: 1.619 ± 0.983
0.202GlnPhe: 0.202 ± 0.244
2.632GlnGly: 2.632 ± 0.593
1.012GlnHis: 1.012 ± 0.468
1.822GlnIle: 1.822 ± 0.562
3.036GlnLys: 3.036 ± 0.625
3.036GlnLeu: 3.036 ± 0.329
1.012GlnMet: 1.012 ± 0.309
2.227GlnAsn: 2.227 ± 0.396
3.036GlnPro: 3.036 ± 0.876
1.619GlnGln: 1.619 ± 0.555
1.822GlnArg: 1.822 ± 0.342
2.834GlnSer: 2.834 ± 0.973
2.632GlnThr: 2.632 ± 1.101
1.619GlnVal: 1.619 ± 0.383
1.215GlnTrp: 1.215 ± 0.644
1.822GlnTyr: 1.822 ± 0.532
0.0GlnXaa: 0.0 ± 0.0
Arg
0.81ArgAla: 0.81 ± 0.344
1.619ArgCys: 1.619 ± 0.627
2.429ArgAsp: 2.429 ± 0.541
4.453ArgGlu: 4.453 ± 0.538
3.036ArgPhe: 3.036 ± 0.885
3.239ArgGly: 3.239 ± 0.941
1.215ArgHis: 1.215 ± 0.391
1.417ArgIle: 1.417 ± 0.327
2.429ArgLys: 2.429 ± 0.562
4.251ArgLeu: 4.251 ± 0.812
2.227ArgMet: 2.227 ± 0.562
3.239ArgAsn: 3.239 ± 0.892
2.429ArgPro: 2.429 ± 0.348
2.632ArgGln: 2.632 ± 0.316
1.619ArgArg: 1.619 ± 0.335
2.834ArgSer: 2.834 ± 0.644
2.632ArgThr: 2.632 ± 1.106
1.417ArgVal: 1.417 ± 0.338
0.607ArgTrp: 0.607 ± 0.451
1.417ArgTyr: 1.417 ± 0.662
0.0ArgXaa: 0.0 ± 0.0
Ser
3.644SerAla: 3.644 ± 0.515
1.012SerCys: 1.012 ± 0.494
3.441SerAsp: 3.441 ± 0.82
5.061SerGlu: 5.061 ± 1.125
2.227SerPhe: 2.227 ± 0.572
3.239SerGly: 3.239 ± 0.567
1.619SerHis: 1.619 ± 0.486
7.692SerIle: 7.692 ± 0.92
4.656SerLys: 4.656 ± 0.993
6.68SerLeu: 6.68 ± 1.452
2.227SerMet: 2.227 ± 0.849
4.049SerAsn: 4.049 ± 0.833
3.239SerPro: 3.239 ± 0.868
3.036SerGln: 3.036 ± 0.788
3.441SerArg: 3.441 ± 1.021
6.883SerSer: 6.883 ± 1.398
5.263SerThr: 5.263 ± 1.0
4.049SerVal: 4.049 ± 1.201
1.012SerTrp: 1.012 ± 0.287
2.227SerTyr: 2.227 ± 0.756
0.0SerXaa: 0.0 ± 0.0
Thr
2.834ThrAla: 2.834 ± 0.638
0.81ThrCys: 0.81 ± 0.335
5.668ThrAsp: 5.668 ± 1.877
5.061ThrGlu: 5.061 ± 0.794
4.049ThrPhe: 4.049 ± 0.403
3.644ThrGly: 3.644 ± 1.183
1.417ThrHis: 1.417 ± 0.653
5.87ThrIle: 5.87 ± 1.224
3.846ThrLys: 3.846 ± 1.794
6.68ThrLeu: 6.68 ± 0.677
1.822ThrMet: 1.822 ± 0.831
4.453ThrAsn: 4.453 ± 0.944
4.858ThrPro: 4.858 ± 0.855
2.632ThrGln: 2.632 ± 0.934
3.441ThrArg: 3.441 ± 0.375
6.478ThrSer: 6.478 ± 0.706
7.895ThrThr: 7.895 ± 1.759
5.061ThrVal: 5.061 ± 0.356
0.81ThrTrp: 0.81 ± 0.387
2.429ThrTyr: 2.429 ± 0.921
0.0ThrXaa: 0.0 ± 0.0
Val
1.215ValAla: 1.215 ± 0.463
0.81ValCys: 0.81 ± 0.304
4.453ValAsp: 4.453 ± 0.107
1.822ValGlu: 1.822 ± 0.675
2.024ValPhe: 2.024 ± 0.578
4.251ValGly: 4.251 ± 1.672
1.417ValHis: 1.417 ± 0.499
3.644ValIle: 3.644 ± 0.621
4.656ValLys: 4.656 ± 0.949
3.441ValLeu: 3.441 ± 0.442
1.012ValMet: 1.012 ± 0.269
5.263ValAsn: 5.263 ± 1.233
2.227ValPro: 2.227 ± 0.996
1.619ValGln: 1.619 ± 0.51
1.619ValArg: 1.619 ± 0.575
2.834ValSer: 2.834 ± 0.476
4.858ValThr: 4.858 ± 1.117
2.429ValVal: 2.429 ± 0.677
0.405ValTrp: 0.405 ± 0.223
1.619ValTyr: 1.619 ± 0.677
0.0ValXaa: 0.0 ± 0.0
Trp
0.81TrpAla: 0.81 ± 0.717
0.607TrpCys: 0.607 ± 0.365
0.81TrpAsp: 0.81 ± 0.487
0.81TrpGlu: 0.81 ± 0.383
1.012TrpPhe: 1.012 ± 0.609
0.81TrpGly: 0.81 ± 0.461
0.202TrpHis: 0.202 ± 0.244
1.417TrpIle: 1.417 ± 0.631
1.012TrpLys: 1.012 ± 0.703
1.215TrpLeu: 1.215 ± 0.731
0.607TrpMet: 0.607 ± 0.578
0.81TrpAsn: 0.81 ± 0.499
0.405TrpPro: 0.405 ± 0.308
0.607TrpGln: 0.607 ± 0.3
0.607TrpArg: 0.607 ± 0.365
0.0TrpSer: 0.0 ± 0.0
1.215TrpThr: 1.215 ± 0.471
0.607TrpVal: 0.607 ± 0.344
0.405TrpTrp: 0.405 ± 0.244
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.215TyrAla: 1.215 ± 0.612
0.405TyrCys: 0.405 ± 0.249
2.834TyrAsp: 2.834 ± 0.946
2.227TyrGlu: 2.227 ± 0.71
1.215TyrPhe: 1.215 ± 0.592
1.619TyrGly: 1.619 ± 0.445
1.012TyrHis: 1.012 ± 0.477
3.441TyrIle: 3.441 ± 0.832
2.834TyrLys: 2.834 ± 0.711
2.429TyrLeu: 2.429 ± 1.042
1.012TyrMet: 1.012 ± 0.68
1.822TyrAsn: 1.822 ± 0.599
2.227TyrPro: 2.227 ± 0.502
1.417TyrGln: 1.417 ± 0.338
2.227TyrArg: 2.227 ± 0.61
2.024TyrSer: 2.024 ± 0.531
1.822TyrThr: 1.822 ± 0.551
1.012TyrVal: 1.012 ± 0.368
0.405TyrTrp: 0.405 ± 0.244
1.215TyrTyr: 1.215 ± 0.188
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (4941 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski