Amino acid dipepetide frequency for Beihai hepe-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.362AlaAla: 6.362 ± 2.18
1.212AlaCys: 1.212 ± 0.64
4.544AlaAsp: 4.544 ± 2.071
3.938AlaGlu: 3.938 ± 1.833
3.332AlaPhe: 3.332 ± 0.848
5.15AlaGly: 5.15 ± 0.99
3.029AlaHis: 3.029 ± 0.921
6.059AlaIle: 6.059 ± 1.637
2.121AlaLys: 2.121 ± 0.762
4.847AlaLeu: 4.847 ± 0.87
3.332AlaMet: 3.332 ± 0.504
0.909AlaAsn: 0.909 ± 0.317
5.756AlaPro: 5.756 ± 0.453
3.029AlaGln: 3.029 ± 0.595
3.938AlaArg: 3.938 ± 1.063
3.332AlaSer: 3.332 ± 1.043
4.241AlaThr: 4.241 ± 0.276
4.847AlaVal: 4.847 ± 1.062
0.909AlaTrp: 0.909 ± 0.542
3.029AlaTyr: 3.029 ± 0.41
0.0AlaXaa: 0.0 ± 0.0
Cys
0.606CysAla: 0.606 ± 0.32
0.303CysCys: 0.303 ± 0.16
0.303CysAsp: 0.303 ± 0.16
1.212CysGlu: 1.212 ± 0.64
0.0CysPhe: 0.0 ± 0.0
0.303CysGly: 0.303 ± 0.16
1.515CysHis: 1.515 ± 0.8
1.212CysIle: 1.212 ± 0.419
1.515CysLys: 1.515 ± 0.62
1.212CysLeu: 1.212 ± 0.587
0.0CysMet: 0.0 ± 0.0
1.212CysAsn: 1.212 ± 0.698
0.606CysPro: 0.606 ± 0.542
0.303CysGln: 0.303 ± 0.16
0.606CysArg: 0.606 ± 0.32
2.424CysSer: 2.424 ± 0.597
1.212CysThr: 1.212 ± 0.419
0.909CysVal: 0.909 ± 0.317
0.303CysTrp: 0.303 ± 0.16
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.332AspAla: 3.332 ± 1.134
1.212AspCys: 1.212 ± 0.362
3.029AspAsp: 3.029 ± 0.448
3.332AspGlu: 3.332 ± 0.879
3.029AspPhe: 3.029 ± 0.595
2.121AspGly: 2.121 ± 0.762
3.029AspHis: 3.029 ± 1.018
2.726AspIle: 2.726 ± 0.78
2.726AspLys: 2.726 ± 1.025
5.15AspLeu: 5.15 ± 1.368
0.909AspMet: 0.909 ± 0.779
3.332AspAsn: 3.332 ± 0.808
2.726AspPro: 2.726 ± 1.015
3.029AspGln: 3.029 ± 0.43
1.515AspArg: 1.515 ± 0.669
3.029AspSer: 3.029 ± 0.595
3.029AspThr: 3.029 ± 0.499
3.029AspVal: 3.029 ± 0.614
0.606AspTrp: 0.606 ± 0.32
3.635AspTyr: 3.635 ± 1.032
0.0AspXaa: 0.0 ± 0.0
Glu
2.726GluAla: 2.726 ± 0.671
0.909GluCys: 0.909 ± 0.48
2.424GluAsp: 2.424 ± 1.281
4.544GluGlu: 4.544 ± 0.94
2.424GluPhe: 2.424 ± 0.907
2.726GluGly: 2.726 ± 0.785
2.121GluHis: 2.121 ± 0.661
4.241GluIle: 4.241 ± 1.322
2.726GluLys: 2.726 ± 0.78
5.756GluLeu: 5.756 ± 0.942
2.121GluMet: 2.121 ± 1.038
3.635GluAsn: 3.635 ± 0.936
2.726GluPro: 2.726 ± 1.064
3.029GluGln: 3.029 ± 1.203
1.515GluArg: 1.515 ± 0.297
4.241GluSer: 4.241 ± 2.572
5.15GluThr: 5.15 ± 1.928
4.847GluVal: 4.847 ± 1.047
0.0GluTrp: 0.0 ± 0.0
1.818GluTyr: 1.818 ± 0.66
0.0GluXaa: 0.0 ± 0.0
Phe
2.121PheAla: 2.121 ± 0.762
0.606PheCys: 0.606 ± 0.32
3.332PheAsp: 3.332 ± 1.284
2.424PheGlu: 2.424 ± 0.521
3.635PhePhe: 3.635 ± 0.822
2.424PheGly: 2.424 ± 1.134
1.818PheHis: 1.818 ± 1.047
2.121PheIle: 2.121 ± 0.903
1.818PheLys: 1.818 ± 0.634
2.424PheLeu: 2.424 ± 1.016
0.909PheMet: 0.909 ± 0.317
2.424PheAsn: 2.424 ± 0.69
1.818PhePro: 1.818 ± 0.781
1.212PheGln: 1.212 ± 0.999
2.121PheArg: 2.121 ± 1.649
2.424PheSer: 2.424 ± 0.756
1.818PheThr: 1.818 ± 0.775
3.332PheVal: 3.332 ± 1.519
0.909PheTrp: 0.909 ± 0.52
2.424PheTyr: 2.424 ± 1.197
0.0PheXaa: 0.0 ± 0.0
Gly
4.847GlyAla: 4.847 ± 0.932
0.909GlyCys: 0.909 ± 0.48
2.726GlyAsp: 2.726 ± 1.052
3.635GlyGlu: 3.635 ± 0.901
2.726GlyPhe: 2.726 ± 1.118
4.241GlyGly: 4.241 ± 1.012
1.515GlyHis: 1.515 ± 1.657
3.029GlyIle: 3.029 ± 0.734
4.241GlyLys: 4.241 ± 0.978
3.635GlyLeu: 3.635 ± 1.085
1.818GlyMet: 1.818 ± 0.961
3.938GlyAsn: 3.938 ± 2.252
1.515GlyPro: 1.515 ± 0.869
1.818GlyGln: 1.818 ± 0.775
3.029GlyArg: 3.029 ± 0.734
3.332GlySer: 3.332 ± 1.246
5.453GlyThr: 5.453 ± 0.579
4.241GlyVal: 4.241 ± 1.033
0.909GlyTrp: 0.909 ± 0.317
2.424GlyTyr: 2.424 ± 0.723
0.0GlyXaa: 0.0 ± 0.0
His
2.121HisAla: 2.121 ± 0.898
0.909HisCys: 0.909 ± 0.48
2.726HisAsp: 2.726 ± 1.17
2.121HisGlu: 2.121 ± 0.762
1.818HisPhe: 1.818 ± 0.587
2.424HisGly: 2.424 ± 1.031
0.606HisHis: 0.606 ± 0.32
1.818HisIle: 1.818 ± 1.318
1.818HisLys: 1.818 ± 0.66
2.121HisLeu: 2.121 ± 0.898
0.0HisMet: 0.0 ± 0.0
2.424HisAsn: 2.424 ± 0.837
2.726HisPro: 2.726 ± 0.785
0.0HisGln: 0.0 ± 0.0
1.515HisArg: 1.515 ± 0.46
1.818HisSer: 1.818 ± 0.625
1.818HisThr: 1.818 ± 0.545
2.424HisVal: 2.424 ± 1.771
0.606HisTrp: 0.606 ± 0.401
1.515HisTyr: 1.515 ± 0.648
0.0HisXaa: 0.0 ± 0.0
Ile
4.544IleAla: 4.544 ± 1.961
1.212IleCys: 1.212 ± 0.579
3.635IleAsp: 3.635 ± 0.851
2.726IleGlu: 2.726 ± 0.812
1.818IlePhe: 1.818 ± 0.545
4.544IleGly: 4.544 ± 1.89
1.818IleHis: 1.818 ± 0.273
4.544IleIle: 4.544 ± 1.969
3.029IleLys: 3.029 ± 0.991
3.938IleLeu: 3.938 ± 0.83
0.909IleMet: 0.909 ± 0.317
2.726IleAsn: 2.726 ± 1.025
4.544IlePro: 4.544 ± 1.203
3.029IleGln: 3.029 ± 1.366
1.515IleArg: 1.515 ± 0.539
4.241IleSer: 4.241 ± 1.804
5.453IleThr: 5.453 ± 1.874
4.544IleVal: 4.544 ± 1.711
0.303IleTrp: 0.303 ± 0.16
3.938IleTyr: 3.938 ± 0.621
0.0IleXaa: 0.0 ± 0.0
Lys
3.938LysAla: 3.938 ± 1.17
0.909LysCys: 0.909 ± 0.48
2.726LysAsp: 2.726 ± 1.441
3.029LysGlu: 3.029 ± 1.178
2.424LysPhe: 2.424 ± 1.104
3.938LysGly: 3.938 ± 0.844
1.212LysHis: 1.212 ± 0.802
4.241LysIle: 4.241 ± 1.193
2.726LysLys: 2.726 ± 1.052
2.726LysLeu: 2.726 ± 0.791
1.515LysMet: 1.515 ± 0.539
2.726LysAsn: 2.726 ± 0.843
2.424LysPro: 2.424 ± 0.466
4.544LysGln: 4.544 ± 1.172
1.515LysArg: 1.515 ± 0.744
1.818LysSer: 1.818 ± 0.273
4.847LysThr: 4.847 ± 1.456
4.544LysVal: 4.544 ± 1.218
1.212LysTrp: 1.212 ± 0.419
1.515LysTyr: 1.515 ± 0.509
0.0LysXaa: 0.0 ± 0.0
Leu
4.847LeuAla: 4.847 ± 1.88
1.212LeuCys: 1.212 ± 0.64
4.847LeuAsp: 4.847 ± 1.255
4.241LeuGlu: 4.241 ± 1.303
2.424LeuPhe: 2.424 ± 1.016
4.544LeuGly: 4.544 ± 1.718
1.818LeuHis: 1.818 ± 0.977
5.15LeuIle: 5.15 ± 1.953
3.938LeuLys: 3.938 ± 0.588
4.241LeuLeu: 4.241 ± 1.183
2.121LeuMet: 2.121 ± 0.727
4.847LeuAsn: 4.847 ± 0.879
4.847LeuPro: 4.847 ± 1.446
3.635LeuGln: 3.635 ± 2.245
2.121LeuArg: 2.121 ± 1.026
4.847LeuSer: 4.847 ± 1.4
4.544LeuThr: 4.544 ± 1.599
4.241LeuVal: 4.241 ± 1.038
0.606LeuTrp: 0.606 ± 0.32
2.121LeuTyr: 2.121 ± 1.349
0.0LeuXaa: 0.0 ± 0.0
Met
3.029MetAla: 3.029 ± 0.857
0.303MetCys: 0.303 ± 0.441
1.212MetAsp: 1.212 ± 0.362
1.212MetGlu: 1.212 ± 0.64
1.212MetPhe: 1.212 ± 0.587
0.909MetGly: 0.909 ± 0.317
1.212MetHis: 1.212 ± 0.64
0.303MetIle: 0.303 ± 0.16
1.818MetLys: 1.818 ± 0.634
2.121MetLeu: 2.121 ± 0.527
0.909MetMet: 0.909 ± 0.48
1.212MetAsn: 1.212 ± 0.877
0.606MetPro: 0.606 ± 0.32
1.212MetGln: 1.212 ± 0.829
1.212MetArg: 1.212 ± 0.362
1.515MetSer: 1.515 ± 0.297
1.212MetThr: 1.212 ± 1.081
0.909MetVal: 0.909 ± 0.377
0.303MetTrp: 0.303 ± 0.587
0.303MetTyr: 0.303 ± 0.441
0.0MetXaa: 0.0 ± 0.0
Asn
3.029AsnAla: 3.029 ± 0.963
0.606AsnCys: 0.606 ± 0.525
1.818AsnAsp: 1.818 ± 0.628
2.424AsnGlu: 2.424 ± 0.874
3.332AsnPhe: 3.332 ± 2.175
3.029AsnGly: 3.029 ± 0.921
1.212AsnHis: 1.212 ± 0.64
4.241AsnIle: 4.241 ± 1.843
1.515AsnLys: 1.515 ± 0.509
3.938AsnLeu: 3.938 ± 1.205
1.818AsnMet: 1.818 ± 0.634
1.818AsnAsn: 1.818 ± 1.047
2.424AsnPro: 2.424 ± 0.447
0.909AsnGln: 0.909 ± 0.542
2.726AsnArg: 2.726 ± 0.796
2.726AsnSer: 2.726 ± 1.015
2.726AsnThr: 2.726 ± 0.455
2.726AsnVal: 2.726 ± 0.895
0.909AsnTrp: 0.909 ± 0.488
4.241AsnTyr: 4.241 ± 2.194
0.0AsnXaa: 0.0 ± 0.0
Pro
1.818ProAla: 1.818 ± 1.204
0.303ProCys: 0.303 ± 0.587
3.938ProAsp: 3.938 ± 1.972
3.635ProGlu: 3.635 ± 1.218
2.121ProPhe: 2.121 ± 1.787
3.332ProGly: 3.332 ± 1.178
0.303ProHis: 0.303 ± 0.16
4.241ProIle: 4.241 ± 1.029
3.938ProLys: 3.938 ± 0.881
4.544ProLeu: 4.544 ± 1.614
0.606ProMet: 0.606 ± 0.349
3.029ProAsn: 3.029 ± 0.593
3.635ProPro: 3.635 ± 1.179
1.818ProGln: 1.818 ± 0.628
1.818ProArg: 1.818 ± 0.628
4.241ProSer: 4.241 ± 1.843
3.635ProThr: 3.635 ± 0.901
2.121ProVal: 2.121 ± 0.585
0.303ProTrp: 0.303 ± 0.16
3.332ProTyr: 3.332 ± 1.333
0.0ProXaa: 0.0 ± 0.0
Gln
2.726GlnAla: 2.726 ± 0.583
0.606GlnCys: 0.606 ± 0.349
4.241GlnAsp: 4.241 ± 1.23
2.424GlnGlu: 2.424 ± 0.837
2.424GlnPhe: 2.424 ± 1.558
2.726GlnGly: 2.726 ± 1.03
0.303GlnHis: 0.303 ± 0.441
1.515GlnIle: 1.515 ± 0.539
2.121GlnLys: 2.121 ± 0.781
4.847GlnLeu: 4.847 ± 1.511
0.909GlnMet: 0.909 ± 1.399
1.515GlnAsn: 1.515 ± 1.287
1.212GlnPro: 1.212 ± 0.714
3.635GlnGln: 3.635 ± 1.139
0.909GlnArg: 0.909 ± 0.48
4.544GlnSer: 4.544 ± 1.218
4.241GlnThr: 4.241 ± 1.038
2.726GlnVal: 2.726 ± 0.785
1.212GlnTrp: 1.212 ± 0.802
0.909GlnTyr: 0.909 ± 0.377
0.0GlnXaa: 0.0 ± 0.0
Arg
3.635ArgAla: 3.635 ± 1.202
0.606ArgCys: 0.606 ± 0.32
1.515ArgAsp: 1.515 ± 0.509
2.121ArgGlu: 2.121 ± 1.121
1.818ArgPhe: 1.818 ± 1.035
2.121ArgGly: 2.121 ± 0.777
1.818ArgHis: 1.818 ± 0.775
1.818ArgIle: 1.818 ± 1.204
2.726ArgLys: 2.726 ± 0.785
3.029ArgLeu: 3.029 ± 0.448
0.909ArgMet: 0.909 ± 0.317
1.515ArgAsn: 1.515 ± 0.764
0.909ArgPro: 0.909 ± 0.779
0.909ArgGln: 0.909 ± 0.317
0.0ArgArg: 0.0 ± 0.0
2.121ArgSer: 2.121 ± 0.898
3.938ArgThr: 3.938 ± 1.324
3.029ArgVal: 3.029 ± 1.069
0.303ArgTrp: 0.303 ± 0.16
3.029ArgTyr: 3.029 ± 0.614
0.0ArgXaa: 0.0 ± 0.0
Ser
6.665SerAla: 6.665 ± 1.388
1.212SerCys: 1.212 ± 0.877
2.726SerAsp: 2.726 ± 0.455
3.938SerGlu: 3.938 ± 1.646
2.424SerPhe: 2.424 ± 0.737
3.635SerGly: 3.635 ± 1.033
1.818SerHis: 1.818 ± 0.753
4.544SerIle: 4.544 ± 0.982
4.544SerLys: 4.544 ± 0.509
4.544SerLeu: 4.544 ± 1.098
0.909SerMet: 0.909 ± 1.07
2.121SerAsn: 2.121 ± 1.602
3.332SerPro: 3.332 ± 1.365
4.241SerGln: 4.241 ± 0.884
1.212SerArg: 1.212 ± 0.714
5.453SerSer: 5.453 ± 3.091
4.241SerThr: 4.241 ± 2.467
6.059SerVal: 6.059 ± 0.661
0.606SerTrp: 0.606 ± 0.882
1.818SerTyr: 1.818 ± 1.342
0.0SerXaa: 0.0 ± 0.0
Thr
5.453ThrAla: 5.453 ± 0.699
0.909ThrCys: 0.909 ± 0.48
3.029ThrAsp: 3.029 ± 1.178
3.938ThrGlu: 3.938 ± 0.324
3.029ThrPhe: 3.029 ± 0.728
5.15ThrGly: 5.15 ± 2.068
3.938ThrHis: 3.938 ± 1.269
5.453ThrIle: 5.453 ± 1.26
3.938ThrLys: 3.938 ± 0.825
4.544ThrLeu: 4.544 ± 1.011
0.303ThrMet: 0.303 ± 0.16
2.726ThrAsn: 2.726 ± 0.671
4.544ThrPro: 4.544 ± 0.998
3.332ThrGln: 3.332 ± 0.687
2.726ThrArg: 2.726 ± 1.441
5.15ThrSer: 5.15 ± 1.346
4.847ThrThr: 4.847 ± 1.659
5.453ThrVal: 5.453 ± 1.744
1.212ThrTrp: 1.212 ± 0.419
3.029ThrTyr: 3.029 ± 1.737
0.0ThrXaa: 0.0 ± 0.0
Val
7.876ValAla: 7.876 ± 1.38
0.606ValCys: 0.606 ± 0.32
2.121ValAsp: 2.121 ± 0.907
5.453ValGlu: 5.453 ± 1.953
1.212ValPhe: 1.212 ± 0.547
4.241ValGly: 4.241 ± 0.668
2.424ValHis: 2.424 ± 0.671
3.635ValIle: 3.635 ± 1.136
3.635ValLys: 3.635 ± 0.967
2.726ValLeu: 2.726 ± 0.583
0.909ValMet: 0.909 ± 0.716
2.726ValAsn: 2.726 ± 1.03
5.756ValPro: 5.756 ± 2.085
2.726ValGln: 2.726 ± 0.677
3.635ValArg: 3.635 ± 1.139
4.847ValSer: 4.847 ± 1.309
5.15ValThr: 5.15 ± 0.773
5.15ValVal: 5.15 ± 0.903
0.0ValTrp: 0.0 ± 0.0
3.029ValTyr: 3.029 ± 0.904
0.0ValXaa: 0.0 ± 0.0
Trp
1.818TrpAla: 1.818 ± 0.961
0.303TrpCys: 0.303 ± 0.48
0.0TrpAsp: 0.0 ± 0.0
0.303TrpGlu: 0.303 ± 0.441
0.606TrpPhe: 0.606 ± 0.32
0.303TrpGly: 0.303 ± 0.16
0.303TrpHis: 0.303 ± 0.16
0.606TrpIle: 0.606 ± 0.76
0.606TrpLys: 0.606 ± 0.401
1.515TrpLeu: 1.515 ± 0.648
0.303TrpMet: 0.303 ± 0.16
0.606TrpAsn: 0.606 ± 0.32
0.0TrpPro: 0.0 ± 0.0
1.212TrpGln: 1.212 ± 0.902
0.909TrpArg: 0.909 ± 0.48
0.606TrpSer: 0.606 ± 0.349
0.909TrpThr: 0.909 ± 0.377
0.909TrpVal: 0.909 ± 0.656
0.0TrpTrp: 0.0 ± 0.0
0.303TrpTyr: 0.303 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.424TyrAla: 2.424 ± 0.333
0.909TyrCys: 0.909 ± 0.654
3.635TyrAsp: 3.635 ± 0.73
3.332TyrGlu: 3.332 ± 0.879
0.303TyrPhe: 0.303 ± 0.16
2.121TyrGly: 2.121 ± 0.727
1.515TyrHis: 1.515 ± 0.564
1.515TyrIle: 1.515 ± 0.79
3.029TyrLys: 3.029 ± 1.24
3.332TyrLeu: 3.332 ± 0.561
1.212TyrMet: 1.212 ± 0.583
2.726TyrAsn: 2.726 ± 0.577
0.606TyrPro: 0.606 ± 0.349
2.121TyrGln: 2.121 ± 0.334
3.332TyrArg: 3.332 ± 1.333
3.332TyrSer: 3.332 ± 1.188
4.241TyrThr: 4.241 ± 2.194
1.818TyrVal: 1.818 ± 0.625
0.909TyrTrp: 0.909 ± 0.317
3.635TyrTyr: 3.635 ± 1.735
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3302 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski