Amino acid dipepetide frequency for Beihai hepe-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.91AlaAla: 5.91 ± 1.571
1.724AlaCys: 1.724 ± 0.718
3.694AlaAsp: 3.694 ± 0.941
5.171AlaGlu: 5.171 ± 0.701
2.709AlaPhe: 2.709 ± 0.418
3.94AlaGly: 3.94 ± 1.185
1.97AlaHis: 1.97 ± 0.547
4.925AlaIle: 4.925 ± 0.903
4.186AlaLys: 4.186 ± 0.446
6.402AlaLeu: 6.402 ± 2.279
0.739AlaMet: 0.739 ± 0.415
3.694AlaAsn: 3.694 ± 0.445
4.186AlaPro: 4.186 ± 0.437
1.97AlaGln: 1.97 ± 0.604
4.432AlaArg: 4.432 ± 0.581
6.156AlaSer: 6.156 ± 1.266
4.432AlaThr: 4.432 ± 1.801
7.141AlaVal: 7.141 ± 2.209
0.246AlaTrp: 0.246 ± 0.224
1.477AlaTyr: 1.477 ± 0.746
0.0AlaXaa: 0.0 ± 0.0
Cys
0.985CysAla: 0.985 ± 0.553
0.0CysCys: 0.0 ± 0.0
0.492CysAsp: 0.492 ± 0.277
0.492CysGlu: 0.492 ± 0.393
0.985CysPhe: 0.985 ± 0.403
0.0CysGly: 0.0 ± 0.0
0.492CysHis: 0.492 ± 0.559
1.231CysIle: 1.231 ± 0.691
0.739CysLys: 0.739 ± 0.819
0.985CysLeu: 0.985 ± 0.553
0.246CysMet: 0.246 ± 0.138
0.492CysAsn: 0.492 ± 0.559
0.0CysPro: 0.0 ± 0.0
0.739CysGln: 0.739 ± 0.415
1.477CysArg: 1.477 ± 1.076
0.739CysSer: 0.739 ± 0.415
0.985CysThr: 0.985 ± 0.318
1.477CysVal: 1.477 ± 0.83
0.246CysTrp: 0.246 ± 0.138
0.246CysTyr: 0.246 ± 0.138
0.0CysXaa: 0.0 ± 0.0
Asp
4.679AspAla: 4.679 ± 0.976
0.246AspCys: 0.246 ± 0.138
4.925AspAsp: 4.925 ± 0.185
2.709AspGlu: 2.709 ± 0.822
3.447AspPhe: 3.447 ± 0.791
3.447AspGly: 3.447 ± 1.402
0.492AspHis: 0.492 ± 0.277
3.694AspIle: 3.694 ± 1.903
1.477AspLys: 1.477 ± 1.076
5.91AspLeu: 5.91 ± 0.736
0.985AspMet: 0.985 ± 0.274
3.201AspAsn: 3.201 ± 0.946
2.955AspPro: 2.955 ± 0.999
1.231AspGln: 1.231 ± 0.832
1.97AspArg: 1.97 ± 0.604
3.694AspSer: 3.694 ± 1.192
4.925AspThr: 4.925 ± 0.58
5.171AspVal: 5.171 ± 0.289
2.955AspTrp: 2.955 ± 1.786
1.231AspTyr: 1.231 ± 0.721
0.0AspXaa: 0.0 ± 0.0
Glu
3.201GluAla: 3.201 ± 0.868
0.739GluCys: 0.739 ± 0.214
2.955GluAsp: 2.955 ± 0.667
2.709GluGlu: 2.709 ± 0.581
2.216GluPhe: 2.216 ± 0.868
2.955GluGly: 2.955 ± 0.645
0.739GluHis: 0.739 ± 0.214
2.709GluIle: 2.709 ± 0.581
2.216GluLys: 2.216 ± 0.755
5.171GluLeu: 5.171 ± 0.289
1.231GluMet: 1.231 ± 0.45
1.724GluAsn: 1.724 ± 0.184
2.955GluPro: 2.955 ± 0.581
2.709GluGln: 2.709 ± 0.779
2.216GluArg: 2.216 ± 1.47
3.447GluSer: 3.447 ± 1.032
4.679GluThr: 4.679 ± 2.28
6.895GluVal: 6.895 ± 1.199
0.985GluTrp: 0.985 ± 0.318
1.97GluTyr: 1.97 ± 1.106
0.0GluXaa: 0.0 ± 0.0
Phe
2.462PheAla: 2.462 ± 0.673
0.492PheCys: 0.492 ± 0.559
2.955PheAsp: 2.955 ± 0.833
2.216PheGlu: 2.216 ± 0.971
1.231PhePhe: 1.231 ± 0.24
0.246PheGly: 0.246 ± 0.138
0.985PheHis: 0.985 ± 0.478
2.216PheIle: 2.216 ± 0.33
2.216PheLys: 2.216 ± 0.497
2.709PheLeu: 2.709 ± 1.037
1.231PheMet: 1.231 ± 0.691
2.709PheAsn: 2.709 ± 0.987
0.739PhePro: 0.739 ± 0.374
2.216PheGln: 2.216 ± 0.33
1.231PheArg: 1.231 ± 0.691
3.94PheSer: 3.94 ± 1.394
2.955PheThr: 2.955 ± 0.779
2.709PheVal: 2.709 ± 1.046
0.985PheTrp: 0.985 ± 0.592
1.724PheTyr: 1.724 ± 0.371
0.0PheXaa: 0.0 ± 0.0
Gly
2.216GlyAla: 2.216 ± 1.067
0.492GlyCys: 0.492 ± 0.277
3.201GlyAsp: 3.201 ± 0.29
2.462GlyGlu: 2.462 ± 0.737
1.724GlyPhe: 1.724 ± 0.184
2.709GlyGly: 2.709 ± 0.71
1.724GlyHis: 1.724 ± 0.627
3.694GlyIle: 3.694 ± 1.732
1.477GlyLys: 1.477 ± 0.569
2.216GlyLeu: 2.216 ± 0.991
2.462GlyMet: 2.462 ± 1.051
1.231GlyAsn: 1.231 ± 0.473
4.186GlyPro: 4.186 ± 1.083
1.97GlyGln: 1.97 ± 0.229
1.97GlyArg: 1.97 ± 0.568
4.925GlySer: 4.925 ± 1.844
5.171GlyThr: 5.171 ± 0.886
4.432GlyVal: 4.432 ± 1.025
0.739GlyTrp: 0.739 ± 0.374
1.231GlyTyr: 1.231 ± 0.473
0.0GlyXaa: 0.0 ± 0.0
His
0.739HisAla: 0.739 ± 0.538
0.246HisCys: 0.246 ± 0.612
1.724HisAsp: 1.724 ± 0.764
0.985HisGlu: 0.985 ± 1.142
0.739HisPhe: 0.739 ± 0.415
1.724HisGly: 1.724 ± 0.968
0.0HisHis: 0.0 ± 0.0
0.739HisIle: 0.739 ± 0.541
1.477HisLys: 1.477 ± 1.179
1.97HisLeu: 1.97 ± 0.839
0.246HisMet: 0.246 ± 0.138
1.477HisAsn: 1.477 ± 0.705
0.739HisPro: 0.739 ± 0.373
1.477HisGln: 1.477 ± 0.28
0.492HisArg: 0.492 ± 0.17
1.97HisSer: 1.97 ± 0.635
1.477HisThr: 1.477 ± 0.747
3.447HisVal: 3.447 ± 1.251
0.246HisTrp: 0.246 ± 0.138
0.739HisTyr: 0.739 ± 0.415
0.0HisXaa: 0.0 ± 0.0
Ile
4.925IleAla: 4.925 ± 0.881
0.492IleCys: 0.492 ± 0.277
4.925IleAsp: 4.925 ± 1.2
5.417IleGlu: 5.417 ± 0.927
1.477IlePhe: 1.477 ± 1.366
2.216IleGly: 2.216 ± 0.261
0.985IleHis: 0.985 ± 0.553
3.201IleIle: 3.201 ± 0.928
3.447IleLys: 3.447 ± 0.923
2.709IleLeu: 2.709 ± 0.559
1.231IleMet: 1.231 ± 0.691
1.97IleAsn: 1.97 ± 0.907
4.186IlePro: 4.186 ± 1.219
1.477IleGln: 1.477 ± 0.571
2.462IleArg: 2.462 ± 0.722
4.925IleSer: 4.925 ± 0.825
5.417IleThr: 5.417 ± 0.987
4.186IleVal: 4.186 ± 1.043
0.985IleTrp: 0.985 ± 0.592
1.231IleTyr: 1.231 ± 0.598
0.0IleXaa: 0.0 ± 0.0
Lys
4.432LysAla: 4.432 ± 0.224
0.492LysCys: 0.492 ± 0.277
1.231LysAsp: 1.231 ± 0.598
2.462LysGlu: 2.462 ± 1.071
1.97LysPhe: 1.97 ± 0.839
2.955LysGly: 2.955 ± 1.003
0.739LysHis: 0.739 ± 0.819
3.694LysIle: 3.694 ± 1.163
1.97LysLys: 1.97 ± 0.864
3.94LysLeu: 3.94 ± 0.468
0.985LysMet: 0.985 ± 0.403
1.724LysAsn: 1.724 ± 0.371
7.141LysPro: 7.141 ± 2.563
2.955LysGln: 2.955 ± 0.779
2.462LysArg: 2.462 ± 0.895
2.955LysSer: 2.955 ± 0.584
3.694LysThr: 3.694 ± 0.993
3.447LysVal: 3.447 ± 0.983
1.477LysTrp: 1.477 ± 0.428
3.694LysTyr: 3.694 ± 0.851
0.0LysXaa: 0.0 ± 0.0
Leu
4.925LeuAla: 4.925 ± 0.59
1.477LeuCys: 1.477 ± 0.83
5.171LeuAsp: 5.171 ± 0.713
3.201LeuGlu: 3.201 ± 0.847
2.216LeuPhe: 2.216 ± 0.976
4.186LeuGly: 4.186 ± 0.721
2.216LeuHis: 2.216 ± 0.817
2.462LeuIle: 2.462 ± 0.737
5.664LeuLys: 5.664 ± 2.218
8.372LeuLeu: 8.372 ± 2.973
1.477LeuMet: 1.477 ± 0.444
3.94LeuAsn: 3.94 ± 0.969
2.955LeuPro: 2.955 ± 1.003
2.955LeuGln: 2.955 ± 1.395
3.694LeuArg: 3.694 ± 1.565
6.156LeuSer: 6.156 ± 2.283
6.156LeuThr: 6.156 ± 0.344
5.171LeuVal: 5.171 ± 1.551
1.231LeuTrp: 1.231 ± 0.361
2.955LeuTyr: 2.955 ± 0.833
0.0LeuXaa: 0.0 ± 0.0
Met
1.97MetAla: 1.97 ± 0.757
0.985MetCys: 0.985 ± 1.119
0.739MetAsp: 0.739 ± 0.362
1.231MetGlu: 1.231 ± 0.719
0.739MetPhe: 0.739 ± 0.538
1.231MetGly: 1.231 ± 0.518
0.985MetHis: 0.985 ± 0.553
0.492MetIle: 0.492 ± 0.277
1.231MetLys: 1.231 ± 0.693
2.462MetLeu: 2.462 ± 1.111
0.246MetMet: 0.246 ± 0.612
0.492MetAsn: 0.492 ± 0.277
1.724MetPro: 1.724 ± 0.615
0.492MetGln: 0.492 ± 0.277
0.0MetArg: 0.0 ± 0.0
2.216MetSer: 2.216 ± 0.975
1.724MetThr: 1.724 ± 0.567
0.985MetVal: 0.985 ± 0.456
0.492MetTrp: 0.492 ± 0.449
0.246MetTyr: 0.246 ± 0.224
0.0MetXaa: 0.0 ± 0.0
Asn
2.462AsnAla: 2.462 ± 1.345
0.739AsnCys: 0.739 ± 0.415
4.679AsnAsp: 4.679 ± 0.725
2.462AsnGlu: 2.462 ± 0.452
2.462AsnPhe: 2.462 ± 0.29
2.216AsnGly: 2.216 ± 0.858
0.739AsnHis: 0.739 ± 0.415
2.955AsnIle: 2.955 ± 0.833
1.477AsnLys: 1.477 ± 1.081
4.186AsnLeu: 4.186 ± 0.827
1.477AsnMet: 1.477 ± 0.232
2.216AsnAsn: 2.216 ± 1.067
1.724AsnPro: 1.724 ± 0.184
1.231AsnGln: 1.231 ± 0.518
2.216AsnArg: 2.216 ± 0.975
3.694AsnSer: 3.694 ± 0.908
4.679AsnThr: 4.679 ± 2.741
3.201AsnVal: 3.201 ± 0.546
0.492AsnTrp: 0.492 ± 0.277
2.216AsnTyr: 2.216 ± 0.923
0.0AsnXaa: 0.0 ± 0.0
Pro
7.387ProAla: 7.387 ± 1.835
0.492ProCys: 0.492 ± 0.277
2.709ProAsp: 2.709 ± 1.28
3.94ProGlu: 3.94 ± 0.336
1.97ProPhe: 1.97 ± 0.548
4.186ProGly: 4.186 ± 0.311
0.985ProHis: 0.985 ± 0.552
3.94ProIle: 3.94 ± 1.137
3.447ProLys: 3.447 ± 0.741
2.462ProLeu: 2.462 ± 0.479
0.246ProMet: 0.246 ± 0.612
2.216ProAsn: 2.216 ± 0.497
2.216ProPro: 2.216 ± 1.245
0.985ProGln: 0.985 ± 0.456
1.231ProArg: 1.231 ± 0.24
6.156ProSer: 6.156 ± 0.673
5.171ProThr: 5.171 ± 0.629
8.865ProVal: 8.865 ± 4.715
0.246ProTrp: 0.246 ± 0.224
0.985ProTyr: 0.985 ± 0.553
0.0ProXaa: 0.0 ± 0.0
Gln
3.447GlnAla: 3.447 ± 0.369
0.492GlnCys: 0.492 ± 0.277
0.739GlnAsp: 0.739 ± 0.362
0.739GlnGlu: 0.739 ± 0.214
0.985GlnPhe: 0.985 ± 0.592
2.955GlnGly: 2.955 ± 0.849
0.492GlnHis: 0.492 ± 0.277
0.739GlnIle: 0.739 ± 0.84
1.724GlnLys: 1.724 ± 0.704
1.97GlnLeu: 1.97 ± 0.429
1.724GlnMet: 1.724 ± 0.528
2.216GlnAsn: 2.216 ± 1.004
2.216GlnPro: 2.216 ± 0.975
1.477GlnGln: 1.477 ± 0.747
0.492GlnArg: 0.492 ± 0.277
3.201GlnSer: 3.201 ± 1.023
3.201GlnThr: 3.201 ± 0.602
3.694GlnVal: 3.694 ± 1.081
0.246GlnTrp: 0.246 ± 0.224
1.231GlnTyr: 1.231 ± 0.693
0.0GlnXaa: 0.0 ± 0.0
Arg
3.447ArgAla: 3.447 ± 0.992
0.0ArgCys: 0.0 ± 0.0
1.97ArgAsp: 1.97 ± 0.568
2.709ArgGlu: 2.709 ± 1.248
3.447ArgPhe: 3.447 ± 1.374
2.462ArgGly: 2.462 ± 1.792
1.477ArgHis: 1.477 ± 0.83
2.216ArgIle: 2.216 ± 0.522
1.97ArgLys: 1.97 ± 0.864
3.447ArgLeu: 3.447 ± 0.923
0.739ArgMet: 0.739 ± 0.552
2.709ArgAsn: 2.709 ± 0.372
2.955ArgPro: 2.955 ± 0.417
1.231ArgGln: 1.231 ± 0.24
2.216ArgArg: 2.216 ± 0.976
2.216ArgSer: 2.216 ± 0.624
2.216ArgThr: 2.216 ± 0.562
2.216ArgVal: 2.216 ± 0.33
0.246ArgTrp: 0.246 ± 0.138
0.492ArgTyr: 0.492 ± 0.393
0.0ArgXaa: 0.0 ± 0.0
Ser
4.186SerAla: 4.186 ± 1.204
1.477SerCys: 1.477 ± 0.569
4.432SerAsp: 4.432 ± 0.522
1.97SerGlu: 1.97 ± 0.568
3.447SerPhe: 3.447 ± 1.273
3.201SerGly: 3.201 ± 0.884
1.231SerHis: 1.231 ± 0.721
6.649SerIle: 6.649 ± 1.911
4.679SerLys: 4.679 ± 0.828
3.694SerLeu: 3.694 ± 1.542
1.724SerMet: 1.724 ± 1.232
6.895SerAsn: 6.895 ± 0.617
4.925SerPro: 4.925 ± 2.091
1.97SerGln: 1.97 ± 0.484
2.955SerArg: 2.955 ± 1.064
7.387SerSer: 7.387 ± 0.646
6.649SerThr: 6.649 ± 1.956
7.634SerVal: 7.634 ± 2.744
0.492SerTrp: 0.492 ± 0.17
1.231SerTyr: 1.231 ± 0.473
0.0SerXaa: 0.0 ± 0.0
Thr
6.156ThrAla: 6.156 ± 1.34
0.985ThrCys: 0.985 ± 0.553
5.417ThrAsp: 5.417 ± 0.992
3.201ThrGlu: 3.201 ± 0.563
2.462ThrPhe: 2.462 ± 0.609
1.724ThrGly: 1.724 ± 1.163
1.97ThrHis: 1.97 ± 0.798
6.649ThrIle: 6.649 ± 2.172
7.141ThrLys: 7.141 ± 2.064
6.895ThrLeu: 6.895 ± 0.678
1.724ThrMet: 1.724 ± 1.029
3.201ThrAsn: 3.201 ± 1.178
4.432ThrPro: 4.432 ± 1.531
3.201ThrGln: 3.201 ± 0.418
3.201ThrArg: 3.201 ± 0.475
5.171ThrSer: 5.171 ± 0.653
5.91ThrThr: 5.91 ± 0.692
9.111ThrVal: 9.111 ± 5.664
1.231ThrTrp: 1.231 ± 0.535
0.739ThrTyr: 0.739 ± 0.538
0.0ThrXaa: 0.0 ± 0.0
Val
8.126ValAla: 8.126 ± 1.619
0.492ValCys: 0.492 ± 0.277
4.432ValAsp: 4.432 ± 1.088
8.126ValGlu: 8.126 ± 3.099
2.709ValPhe: 2.709 ± 0.898
5.417ValGly: 5.417 ± 0.599
1.97ValHis: 1.97 ± 0.547
3.694ValIle: 3.694 ± 1.864
5.91ValLys: 5.91 ± 0.898
6.402ValLeu: 6.402 ± 1.86
1.231ValMet: 1.231 ± 0.599
4.432ValAsn: 4.432 ± 1.288
6.895ValPro: 6.895 ± 1.156
2.709ValGln: 2.709 ± 1.243
3.447ValArg: 3.447 ± 0.235
4.925ValSer: 4.925 ± 0.86
7.387ValThr: 7.387 ± 2.213
7.387ValVal: 7.387 ± 2.439
3.447ValTrp: 3.447 ± 2.83
2.955ValTyr: 2.955 ± 0.645
0.0ValXaa: 0.0 ± 0.0
Trp
0.739TrpAla: 0.739 ± 0.214
0.0TrpCys: 0.0 ± 0.0
0.985TrpAsp: 0.985 ± 0.898
0.985TrpGlu: 0.985 ± 0.34
0.492TrpPhe: 0.492 ± 0.17
0.0TrpGly: 0.0 ± 0.0
1.231TrpHis: 1.231 ± 1.122
0.739TrpIle: 0.739 ± 0.415
0.492TrpLys: 0.492 ± 0.277
2.216TrpLeu: 2.216 ± 1.451
0.246TrpMet: 0.246 ± 0.224
0.0TrpAsn: 0.0 ± 0.0
0.739TrpPro: 0.739 ± 0.374
0.739TrpGln: 0.739 ± 0.374
0.739TrpArg: 0.739 ± 0.415
1.477TrpSer: 1.477 ± 1.037
1.97TrpThr: 1.97 ± 0.907
1.97TrpVal: 1.97 ± 0.681
0.246TrpTrp: 0.246 ± 0.224
1.477TrpTyr: 1.477 ± 1.037
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.709TyrAla: 2.709 ± 1.494
0.985TyrCys: 0.985 ± 0.403
1.97TyrAsp: 1.97 ± 0.229
1.231TyrGlu: 1.231 ± 0.691
0.739TyrPhe: 0.739 ± 0.214
2.462TyrGly: 2.462 ± 0.609
1.231TyrHis: 1.231 ± 0.719
1.231TyrIle: 1.231 ± 0.24
1.477TyrLys: 1.477 ± 0.569
2.462TyrLeu: 2.462 ± 1.383
0.246TyrMet: 0.246 ± 0.138
0.985TyrAsn: 0.985 ± 1.119
1.97TyrPro: 1.97 ± 0.229
0.246TyrGln: 0.246 ± 0.138
1.477TyrArg: 1.477 ± 0.83
1.477TyrSer: 1.477 ± 0.83
1.724TyrThr: 1.724 ± 0.965
3.201TyrVal: 3.201 ± 1.791
0.246TyrTrp: 0.246 ± 0.138
0.739TyrTyr: 0.739 ± 0.538
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski