Amino acid dipepetide frequency for Huangpi Tick Virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.389AlaAla: 4.389 ± 1.947
1.291AlaCys: 1.291 ± 0.495
5.939AlaAsp: 5.939 ± 1.775
2.324AlaGlu: 2.324 ± 0.816
3.615AlaPhe: 3.615 ± 1.145
2.84AlaGly: 2.84 ± 0.416
2.066AlaHis: 2.066 ± 1.316
3.098AlaIle: 3.098 ± 0.589
0.775AlaLys: 0.775 ± 0.389
6.971AlaLeu: 6.971 ± 1.377
0.258AlaMet: 0.258 ± 0.33
1.549AlaAsn: 1.549 ± 0.682
4.131AlaPro: 4.131 ± 1.943
3.098AlaGln: 3.098 ± 0.615
4.131AlaArg: 4.131 ± 0.82
5.68AlaSer: 5.68 ± 0.856
5.422AlaThr: 5.422 ± 1.663
3.873AlaVal: 3.873 ± 0.83
0.775AlaTrp: 0.775 ± 0.332
2.066AlaTyr: 2.066 ± 0.604
0.258AlaXaa: 0.258 ± 0.141
Cys
1.033CysAla: 1.033 ± 0.696
0.258CysCys: 0.258 ± 0.141
1.033CysAsp: 1.033 ± 0.575
0.775CysGlu: 0.775 ± 0.311
0.258CysPhe: 0.258 ± 0.141
0.516CysGly: 0.516 ± 0.288
0.516CysHis: 0.516 ± 0.288
1.033CysIle: 1.033 ± 0.388
1.033CysLys: 1.033 ± 0.567
1.549CysLeu: 1.549 ± 0.517
0.516CysMet: 0.516 ± 0.318
0.516CysAsn: 0.516 ± 0.283
1.549CysPro: 1.549 ± 0.89
0.516CysGln: 0.516 ± 0.284
0.775CysArg: 0.775 ± 0.332
2.84CysSer: 2.84 ± 1.022
2.324CysThr: 2.324 ± 1.093
1.549CysVal: 1.549 ± 0.58
0.516CysTrp: 0.516 ± 0.283
0.258CysTyr: 0.258 ± 0.141
0.0CysXaa: 0.0 ± 0.0
Asp
2.84AspAla: 2.84 ± 0.822
0.258AspCys: 0.258 ± 0.33
3.615AspAsp: 3.615 ± 1.16
3.098AspGlu: 3.098 ± 0.452
3.615AspPhe: 3.615 ± 1.103
2.066AspGly: 2.066 ± 0.571
2.84AspHis: 2.84 ± 0.891
2.324AspIle: 2.324 ± 0.709
2.84AspLys: 2.84 ± 1.759
6.455AspLeu: 6.455 ± 0.929
1.549AspMet: 1.549 ± 0.399
1.549AspAsn: 1.549 ± 0.848
6.197AspPro: 6.197 ± 2.28
2.582AspGln: 2.582 ± 0.494
2.582AspArg: 2.582 ± 1.07
3.873AspSer: 3.873 ± 0.523
1.549AspThr: 1.549 ± 0.486
2.066AspVal: 2.066 ± 0.802
1.549AspTrp: 1.549 ± 0.931
3.098AspTyr: 3.098 ± 0.423
0.258AspXaa: 0.258 ± 0.141
Glu
4.389GluAla: 4.389 ± 1.243
0.516GluCys: 0.516 ± 0.288
2.066GluAsp: 2.066 ± 0.566
3.873GluGlu: 3.873 ± 1.308
1.033GluPhe: 1.033 ± 0.388
3.873GluGly: 3.873 ± 1.847
1.033GluHis: 1.033 ± 0.358
2.582GluIle: 2.582 ± 0.568
2.582GluLys: 2.582 ± 0.694
5.422GluLeu: 5.422 ± 0.925
1.033GluMet: 1.033 ± 0.447
1.033GluAsn: 1.033 ± 0.667
3.615GluPro: 3.615 ± 1.752
1.549GluGln: 1.549 ± 0.62
3.098GluArg: 3.098 ± 0.661
4.906GluSer: 4.906 ± 0.542
3.098GluThr: 3.098 ± 0.801
4.389GluVal: 4.389 ± 0.621
2.324GluTrp: 2.324 ± 0.608
1.549GluTyr: 1.549 ± 0.615
0.0GluXaa: 0.0 ± 0.0
Phe
1.807PheAla: 1.807 ± 0.582
0.775PheCys: 0.775 ± 0.332
2.84PheAsp: 2.84 ± 0.728
0.516PheGlu: 0.516 ± 0.284
1.807PhePhe: 1.807 ± 0.457
1.291PheGly: 1.291 ± 0.707
2.84PheHis: 2.84 ± 0.959
1.033PheIle: 1.033 ± 0.696
2.066PheLys: 2.066 ± 0.787
3.615PheLeu: 3.615 ± 1.699
1.033PheMet: 1.033 ± 0.358
1.033PheAsn: 1.033 ± 0.575
1.807PhePro: 1.807 ± 0.457
2.324PheGln: 2.324 ± 0.897
2.324PheArg: 2.324 ± 0.813
3.357PheSer: 3.357 ± 0.671
2.324PheThr: 2.324 ± 0.414
2.84PheVal: 2.84 ± 0.947
1.291PheTrp: 1.291 ± 0.497
1.291PheTyr: 1.291 ± 0.387
0.0PheXaa: 0.0 ± 0.0
Gly
3.357GlyAla: 3.357 ± 1.356
0.516GlyCys: 0.516 ± 0.283
3.098GlyAsp: 3.098 ± 0.723
2.84GlyGlu: 2.84 ± 1.201
2.324GlyPhe: 2.324 ± 0.726
3.098GlyGly: 3.098 ± 0.313
2.066GlyHis: 2.066 ± 0.333
2.066GlyIle: 2.066 ± 0.432
4.131GlyLys: 4.131 ± 0.635
8.262GlyLeu: 8.262 ± 1.143
1.549GlyMet: 1.549 ± 0.528
1.549GlyAsn: 1.549 ± 0.361
2.582GlyPro: 2.582 ± 0.836
2.84GlyGln: 2.84 ± 0.416
2.582GlyArg: 2.582 ± 1.165
4.389GlySer: 4.389 ± 1.214
3.615GlyThr: 3.615 ± 0.549
3.615GlyVal: 3.615 ± 0.787
0.516GlyTrp: 0.516 ± 0.283
1.291GlyTyr: 1.291 ± 0.89
0.0GlyXaa: 0.0 ± 0.0
His
1.549HisAla: 1.549 ± 0.577
0.775HisCys: 0.775 ± 0.424
1.549HisAsp: 1.549 ± 0.615
1.291HisGlu: 1.291 ± 0.72
0.775HisPhe: 0.775 ± 0.445
1.549HisGly: 1.549 ± 0.492
0.775HisHis: 0.775 ± 0.424
1.807HisIle: 1.807 ± 1.171
0.258HisLys: 0.258 ± 0.141
3.873HisLeu: 3.873 ± 1.172
1.291HisMet: 1.291 ± 0.387
1.291HisAsn: 1.291 ± 0.452
2.066HisPro: 2.066 ± 0.847
1.291HisGln: 1.291 ± 0.452
2.582HisArg: 2.582 ± 0.815
2.066HisSer: 2.066 ± 0.716
0.775HisThr: 0.775 ± 0.332
2.84HisVal: 2.84 ± 0.645
1.291HisTrp: 1.291 ± 0.387
0.775HisTyr: 0.775 ± 0.443
0.258HisXaa: 0.258 ± 0.396
Ile
1.549IleAla: 1.549 ± 0.622
1.033IleCys: 1.033 ± 0.388
2.582IleAsp: 2.582 ± 0.895
1.807IleGlu: 1.807 ± 0.377
3.357IlePhe: 3.357 ± 0.458
2.84IleGly: 2.84 ± 0.473
0.775IleHis: 0.775 ± 0.369
2.066IleIle: 2.066 ± 0.571
3.615IleLys: 3.615 ± 0.478
5.164IleLeu: 5.164 ± 0.937
1.807IleMet: 1.807 ± 0.499
1.807IleAsn: 1.807 ± 0.743
3.098IlePro: 3.098 ± 0.951
1.033IleGln: 1.033 ± 0.575
4.131IleArg: 4.131 ± 0.778
4.389IleSer: 4.389 ± 0.734
1.807IleThr: 1.807 ± 1.171
3.098IleVal: 3.098 ± 0.998
0.516IleTrp: 0.516 ± 0.345
1.033IleTyr: 1.033 ± 0.391
0.0IleXaa: 0.0 ± 0.0
Lys
3.615LysAla: 3.615 ± 0.6
1.033LysCys: 1.033 ± 0.647
2.324LysAsp: 2.324 ± 0.354
3.357LysGlu: 3.357 ± 0.65
1.033LysPhe: 1.033 ± 0.948
2.582LysGly: 2.582 ± 0.712
0.775LysHis: 0.775 ± 0.311
1.807LysIle: 1.807 ± 0.516
2.066LysLys: 2.066 ± 0.404
3.873LysLeu: 3.873 ± 1.172
0.516LysMet: 0.516 ± 0.305
1.033LysAsn: 1.033 ± 0.447
1.807LysPro: 1.807 ± 0.714
1.549LysGln: 1.549 ± 1.845
2.582LysArg: 2.582 ± 0.943
2.582LysSer: 2.582 ± 0.755
2.582LysThr: 2.582 ± 0.802
3.357LysVal: 3.357 ± 0.555
0.516LysTrp: 0.516 ± 0.539
0.775LysTyr: 0.775 ± 0.749
0.0LysXaa: 0.0 ± 0.0
Leu
6.971LeuAla: 6.971 ± 0.754
2.324LeuCys: 2.324 ± 0.366
2.582LeuAsp: 2.582 ± 0.816
6.455LeuGlu: 6.455 ± 0.721
3.873LeuPhe: 3.873 ± 1.839
5.68LeuGly: 5.68 ± 0.961
3.357LeuHis: 3.357 ± 0.897
5.68LeuIle: 5.68 ± 1.52
3.098LeuLys: 3.098 ± 0.406
10.07LeuLeu: 10.07 ± 2.935
3.098LeuMet: 3.098 ± 1.12
2.324LeuAsn: 2.324 ± 0.577
4.648LeuPro: 4.648 ± 1.29
4.389LeuGln: 4.389 ± 0.626
8.262LeuArg: 8.262 ± 1.78
9.295LeuSer: 9.295 ± 1.886
8.779LeuThr: 8.779 ± 0.761
5.939LeuVal: 5.939 ± 2.146
1.291LeuTrp: 1.291 ± 0.304
3.098LeuTyr: 3.098 ± 1.421
0.0LeuXaa: 0.0 ± 0.0
Met
1.807MetAla: 1.807 ± 0.864
0.258MetCys: 0.258 ± 0.141
2.324MetAsp: 2.324 ± 0.595
1.549MetGlu: 1.549 ± 0.955
0.516MetPhe: 0.516 ± 0.288
1.033MetGly: 1.033 ± 0.565
0.0MetHis: 0.0 ± 0.0
0.516MetIle: 0.516 ± 0.284
1.033MetLys: 1.033 ± 0.706
1.807MetLeu: 1.807 ± 0.471
0.516MetMet: 0.516 ± 0.283
0.775MetAsn: 0.775 ± 0.389
0.775MetPro: 0.775 ± 0.311
1.549MetGln: 1.549 ± 0.613
2.066MetArg: 2.066 ± 0.521
1.807MetSer: 1.807 ± 0.363
1.033MetThr: 1.033 ± 0.337
1.549MetVal: 1.549 ± 0.848
0.775MetTrp: 0.775 ± 0.445
1.807MetTyr: 1.807 ± 0.549
0.0MetXaa: 0.0 ± 0.0
Asn
1.033AsnAla: 1.033 ± 0.928
0.258AsnCys: 0.258 ± 0.341
1.549AsnAsp: 1.549 ± 0.615
1.291AsnGlu: 1.291 ± 0.304
1.549AsnPhe: 1.549 ± 0.615
1.291AsnGly: 1.291 ± 0.556
1.549AsnHis: 1.549 ± 0.848
1.033AsnIle: 1.033 ± 0.331
0.516AsnLys: 0.516 ± 0.539
4.131AsnLeu: 4.131 ± 1.067
0.516AsnMet: 0.516 ± 0.284
2.582AsnAsn: 2.582 ± 0.921
2.582AsnPro: 2.582 ± 0.442
1.291AsnGln: 1.291 ± 0.634
1.033AsnArg: 1.033 ± 0.447
3.357AsnSer: 3.357 ± 1.524
1.807AsnThr: 1.807 ± 0.363
2.582AsnVal: 2.582 ± 0.348
0.775AsnTrp: 0.775 ± 0.396
0.775AsnTyr: 0.775 ± 0.29
0.0AsnXaa: 0.0 ± 0.0
Pro
6.713ProAla: 6.713 ± 2.711
0.775ProCys: 0.775 ± 0.611
4.906ProAsp: 4.906 ± 1.337
4.906ProGlu: 4.906 ± 2.337
2.324ProPhe: 2.324 ± 0.608
3.357ProGly: 3.357 ± 0.674
0.775ProHis: 0.775 ± 0.332
1.807ProIle: 1.807 ± 0.55
0.775ProLys: 0.775 ± 0.332
7.23ProLeu: 7.23 ± 1.152
0.775ProMet: 0.775 ± 0.424
1.291ProAsn: 1.291 ± 0.327
5.422ProPro: 5.422 ± 2.741
2.066ProGln: 2.066 ± 1.274
3.098ProArg: 3.098 ± 0.559
6.971ProSer: 6.971 ± 2.112
4.131ProThr: 4.131 ± 1.303
3.615ProVal: 3.615 ± 0.866
1.291ProTrp: 1.291 ± 0.757
0.775ProTyr: 0.775 ± 0.311
0.0ProXaa: 0.0 ± 0.0
Gln
3.873GlnAla: 3.873 ± 1.31
1.033GlnCys: 1.033 ± 0.308
3.357GlnAsp: 3.357 ± 1.62
1.291GlnGlu: 1.291 ± 0.585
1.807GlnPhe: 1.807 ± 0.492
2.582GlnGly: 2.582 ± 0.993
0.516GlnHis: 0.516 ± 0.284
2.066GlnIle: 2.066 ± 0.853
2.324GlnLys: 2.324 ± 0.953
2.066GlnLeu: 2.066 ± 0.522
1.549GlnMet: 1.549 ± 0.589
1.549GlnAsn: 1.549 ± 0.581
1.807GlnPro: 1.807 ± 0.201
1.807GlnGln: 1.807 ± 0.552
3.357GlnArg: 3.357 ± 0.712
1.807GlnSer: 1.807 ± 0.646
3.357GlnThr: 3.357 ± 1.425
2.582GlnVal: 2.582 ± 0.646
0.775GlnTrp: 0.775 ± 0.611
0.0GlnTyr: 0.0 ± 0.0
0.258GlnXaa: 0.258 ± 0.141
Arg
3.357ArgAla: 3.357 ± 1.708
1.291ArgCys: 1.291 ± 0.89
4.648ArgAsp: 4.648 ± 1.316
4.389ArgGlu: 4.389 ± 1.303
2.84ArgPhe: 2.84 ± 0.643
3.357ArgGly: 3.357 ± 0.902
2.324ArgHis: 2.324 ± 0.691
2.324ArgIle: 2.324 ± 0.414
2.582ArgLys: 2.582 ± 0.954
6.197ArgLeu: 6.197 ± 1.63
2.582ArgMet: 2.582 ± 0.743
2.582ArgAsn: 2.582 ± 0.826
4.389ArgPro: 4.389 ± 1.544
3.098ArgGln: 3.098 ± 0.698
4.389ArgArg: 4.389 ± 0.869
4.648ArgSer: 4.648 ± 1.466
4.906ArgThr: 4.906 ± 0.968
3.615ArgVal: 3.615 ± 0.868
0.775ArgTrp: 0.775 ± 0.332
1.033ArgTyr: 1.033 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
5.939SerAla: 5.939 ± 1.644
2.324SerCys: 2.324 ± 1.204
5.939SerAsp: 5.939 ± 1.137
5.68SerGlu: 5.68 ± 1.547
1.291SerPhe: 1.291 ± 0.387
5.422SerGly: 5.422 ± 0.859
2.324SerHis: 2.324 ± 0.691
6.197SerIle: 6.197 ± 1.162
2.066SerLys: 2.066 ± 0.81
9.295SerLeu: 9.295 ± 2.13
1.033SerMet: 1.033 ± 0.603
2.84SerAsn: 2.84 ± 0.635
3.873SerPro: 3.873 ± 0.663
2.582SerGln: 2.582 ± 1.371
4.389SerArg: 4.389 ± 1.0
8.779SerSer: 8.779 ± 3.349
6.197SerThr: 6.197 ± 1.491
6.197SerVal: 6.197 ± 0.741
2.324SerTrp: 2.324 ± 0.691
3.873SerTyr: 3.873 ± 1.392
0.0SerXaa: 0.0 ± 0.0
Thr
3.873ThrAla: 3.873 ± 0.659
2.84ThrCys: 2.84 ± 1.022
2.582ThrAsp: 2.582 ± 0.464
3.357ThrGlu: 3.357 ± 0.458
1.291ThrPhe: 1.291 ± 0.632
5.422ThrGly: 5.422 ± 0.635
3.357ThrHis: 3.357 ± 1.821
3.357ThrIle: 3.357 ± 0.532
2.324ThrLys: 2.324 ± 0.595
4.389ThrLeu: 4.389 ± 0.742
1.033ThrMet: 1.033 ± 0.38
1.291ThrAsn: 1.291 ± 0.585
4.648ThrPro: 4.648 ± 1.263
2.324ThrGln: 2.324 ± 0.354
3.873ThrArg: 3.873 ± 0.409
8.521ThrSer: 8.521 ± 1.277
4.389ThrThr: 4.389 ± 1.163
4.131ThrVal: 4.131 ± 0.666
0.775ThrTrp: 0.775 ± 0.311
2.324ThrTyr: 2.324 ± 0.704
0.0ThrXaa: 0.0 ± 0.0
Val
3.873ValAla: 3.873 ± 0.846
0.775ValCys: 0.775 ± 0.311
1.291ValAsp: 1.291 ± 0.582
2.84ValGlu: 2.84 ± 0.968
0.775ValPhe: 0.775 ± 0.332
4.906ValGly: 4.906 ± 1.063
1.549ValHis: 1.549 ± 0.669
4.906ValIle: 4.906 ± 0.714
3.873ValLys: 3.873 ± 0.768
6.971ValLeu: 6.971 ± 0.845
1.033ValMet: 1.033 ± 0.303
2.84ValAsn: 2.84 ± 0.475
5.68ValPro: 5.68 ± 1.173
2.582ValGln: 2.582 ± 0.854
5.68ValArg: 5.68 ± 0.658
4.648ValSer: 4.648 ± 1.015
4.389ValThr: 4.389 ± 0.955
5.164ValVal: 5.164 ± 1.047
1.033ValTrp: 1.033 ± 0.331
0.516ValTyr: 0.516 ± 0.318
0.0ValXaa: 0.0 ± 0.0
Trp
1.291TrpAla: 1.291 ± 0.387
0.0TrpCys: 0.0 ± 0.0
0.775TrpAsp: 0.775 ± 0.332
0.258TrpGlu: 0.258 ± 0.141
2.066TrpPhe: 2.066 ± 0.604
1.291TrpGly: 1.291 ± 0.362
0.775TrpHis: 0.775 ± 0.29
0.775TrpIle: 0.775 ± 0.445
1.033TrpLys: 1.033 ± 0.388
0.775TrpLeu: 0.775 ± 0.311
0.516TrpMet: 0.516 ± 0.288
1.291TrpAsn: 1.291 ± 0.461
0.258TrpPro: 0.258 ± 0.141
0.258TrpGln: 0.258 ± 0.141
1.807TrpArg: 1.807 ± 0.73
2.582TrpSer: 2.582 ± 1.413
2.066TrpThr: 2.066 ± 0.603
0.775TrpVal: 0.775 ± 0.638
0.0TrpTrp: 0.0 ± 0.0
0.516TrpTyr: 0.516 ± 0.467
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.291TyrAla: 1.291 ± 0.582
0.775TyrCys: 0.775 ± 0.349
1.807TyrAsp: 1.807 ± 0.457
1.807TyrGlu: 1.807 ± 1.042
1.807TyrPhe: 1.807 ± 0.201
1.291TyrGly: 1.291 ± 0.582
0.516TyrHis: 0.516 ± 0.284
1.033TyrIle: 1.033 ± 0.358
0.775TyrLys: 0.775 ± 0.311
3.098TyrLeu: 3.098 ± 0.526
0.775TyrMet: 0.775 ± 0.602
0.775TyrAsn: 0.775 ± 0.29
2.066TyrPro: 2.066 ± 1.5
1.033TyrGln: 1.033 ± 0.358
2.582TyrArg: 2.582 ± 0.504
2.066TyrSer: 2.066 ± 0.49
1.549TyrThr: 1.549 ± 0.816
1.807TyrVal: 1.807 ± 0.73
0.0TyrTrp: 0.0 ± 0.0
1.549TyrTyr: 1.549 ± 0.746
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.258XaaCys: 0.258 ± 0.141
0.0XaaAsp: 0.0 ± 0.0
0.258XaaGlu: 0.258 ± 0.141
0.258XaaPhe: 0.258 ± 0.396
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.258XaaMet: 0.258 ± 0.141
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3874 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski