Amino acid dipepetide frequency for Shahe hepe-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.524AlaAla: 8.524 ± 1.772
1.176AlaCys: 1.176 ± 0.473
3.821AlaAsp: 3.821 ± 0.962
6.761AlaGlu: 6.761 ± 2.652
2.352AlaPhe: 2.352 ± 0.928
2.646AlaGly: 2.646 ± 0.914
1.47AlaHis: 1.47 ± 0.6
4.115AlaIle: 4.115 ± 0.887
5.585AlaLys: 5.585 ± 1.516
6.761AlaLeu: 6.761 ± 0.477
1.47AlaMet: 1.47 ± 0.357
2.646AlaAsn: 2.646 ± 1.377
2.646AlaPro: 2.646 ± 0.886
3.527AlaGln: 3.527 ± 1.144
3.527AlaArg: 3.527 ± 0.895
6.761AlaSer: 6.761 ± 0.95
5.585AlaThr: 5.585 ± 0.833
4.409AlaVal: 4.409 ± 0.885
0.294AlaTrp: 0.294 ± 0.426
0.588AlaTyr: 0.588 ± 0.891
0.0AlaXaa: 0.0 ± 0.0
Cys
0.588CysAla: 0.588 ± 0.857
0.294CysCys: 0.294 ± 0.159
1.176CysAsp: 1.176 ± 0.473
0.294CysGlu: 0.294 ± 0.426
1.176CysPhe: 1.176 ± 0.634
2.058CysGly: 2.058 ± 0.741
0.588CysHis: 0.588 ± 0.37
0.882CysIle: 0.882 ± 0.414
2.939CysLys: 2.939 ± 0.582
1.176CysLeu: 1.176 ± 0.595
0.588CysMet: 0.588 ± 0.317
1.47CysAsn: 1.47 ± 0.422
0.882CysPro: 0.882 ± 0.476
0.588CysGln: 0.588 ± 0.37
0.0CysArg: 0.0 ± 0.0
0.882CysSer: 0.882 ± 0.476
1.176CysThr: 1.176 ± 0.609
0.294CysVal: 0.294 ± 0.159
0.294CysTrp: 0.294 ± 0.392
0.882CysTyr: 0.882 ± 0.81
0.0CysXaa: 0.0 ± 0.0
Asp
2.646AspAla: 2.646 ± 0.323
0.882AspCys: 0.882 ± 0.476
4.703AspAsp: 4.703 ± 1.614
3.821AspGlu: 3.821 ± 0.783
3.821AspPhe: 3.821 ± 1.0
3.233AspGly: 3.233 ± 0.856
1.47AspHis: 1.47 ± 0.515
6.173AspIle: 6.173 ± 2.342
4.703AspLys: 4.703 ± 2.155
5.291AspLeu: 5.291 ± 1.005
1.176AspMet: 1.176 ± 0.435
3.527AspAsn: 3.527 ± 0.788
2.058AspPro: 2.058 ± 1.345
1.764AspGln: 1.764 ± 0.922
1.176AspArg: 1.176 ± 0.634
2.646AspSer: 2.646 ± 2.527
3.233AspThr: 3.233 ± 0.957
3.233AspVal: 3.233 ± 1.381
0.588AspTrp: 0.588 ± 0.317
2.646AspTyr: 2.646 ± 1.052
0.0AspXaa: 0.0 ± 0.0
Glu
4.115GluAla: 4.115 ± 1.424
0.882GluCys: 0.882 ± 0.481
2.939GluAsp: 2.939 ± 1.031
4.997GluGlu: 4.997 ± 1.941
5.879GluPhe: 5.879 ± 2.088
2.058GluGly: 2.058 ± 0.729
1.47GluHis: 1.47 ± 0.6
2.939GluIle: 2.939 ± 0.864
4.997GluLys: 4.997 ± 0.593
5.291GluLeu: 5.291 ± 1.791
2.939GluMet: 2.939 ± 0.824
1.47GluAsn: 1.47 ± 0.793
3.527GluPro: 3.527 ± 0.895
1.47GluGln: 1.47 ± 0.786
2.646GluArg: 2.646 ± 0.507
2.646GluSer: 2.646 ± 0.779
3.527GluThr: 3.527 ± 1.004
4.703GluVal: 4.703 ± 1.109
0.0GluTrp: 0.0 ± 0.0
2.939GluTyr: 2.939 ± 1.231
0.0GluXaa: 0.0 ± 0.0
Phe
6.173PheAla: 6.173 ± 1.382
0.882PheCys: 0.882 ± 0.615
5.585PheAsp: 5.585 ± 1.803
3.233PheGlu: 3.233 ± 1.49
2.352PhePhe: 2.352 ± 0.626
4.997PheGly: 4.997 ± 1.017
1.176PheHis: 1.176 ± 0.435
3.821PheIle: 3.821 ± 0.99
2.058PheLys: 2.058 ± 2.355
3.233PheLeu: 3.233 ± 0.776
2.646PheMet: 2.646 ± 0.748
2.939PheAsn: 2.939 ± 1.068
0.588PhePro: 0.588 ± 0.356
2.646PheGln: 2.646 ± 0.784
1.176PheArg: 1.176 ± 0.774
2.058PheSer: 2.058 ± 2.077
5.291PheThr: 5.291 ± 1.675
2.939PheVal: 2.939 ± 0.634
0.588PheTrp: 0.588 ± 0.317
2.939PheTyr: 2.939 ± 1.099
0.0PheXaa: 0.0 ± 0.0
Gly
5.291GlyAla: 5.291 ± 1.619
0.882GlyCys: 0.882 ± 0.414
3.821GlyAsp: 3.821 ± 1.189
2.352GlyGlu: 2.352 ± 1.091
2.352GlyPhe: 2.352 ± 1.006
1.764GlyGly: 1.764 ± 0.365
1.47GlyHis: 1.47 ± 0.793
3.233GlyIle: 3.233 ± 0.69
5.585GlyLys: 5.585 ± 1.254
2.352GlyLeu: 2.352 ± 1.02
1.47GlyMet: 1.47 ± 0.622
2.646GlyAsn: 2.646 ± 0.618
3.233GlyPro: 3.233 ± 0.875
0.882GlyGln: 0.882 ± 0.393
1.47GlyArg: 1.47 ± 0.434
2.058GlySer: 2.058 ± 0.897
3.821GlyThr: 3.821 ± 2.644
5.291GlyVal: 5.291 ± 1.302
0.882GlyTrp: 0.882 ± 0.351
2.646GlyTyr: 2.646 ± 0.576
0.0GlyXaa: 0.0 ± 0.0
His
3.233HisAla: 3.233 ± 1.015
0.588HisCys: 0.588 ± 0.317
1.47HisAsp: 1.47 ± 0.524
0.588HisGlu: 0.588 ± 0.317
1.176HisPhe: 1.176 ± 0.411
1.764HisGly: 1.764 ± 0.755
0.588HisHis: 0.588 ± 0.317
2.646HisIle: 2.646 ± 0.939
2.058HisLys: 2.058 ± 0.741
1.176HisLeu: 1.176 ± 0.521
0.588HisMet: 0.588 ± 0.919
1.47HisAsn: 1.47 ± 0.57
0.0HisPro: 0.0 ± 0.0
2.352HisGln: 2.352 ± 0.568
0.294HisArg: 0.294 ± 0.159
2.058HisSer: 2.058 ± 0.658
1.47HisThr: 1.47 ± 0.793
0.882HisVal: 0.882 ± 0.393
0.294HisTrp: 0.294 ± 0.159
0.882HisTyr: 0.882 ± 0.476
0.0HisXaa: 0.0 ± 0.0
Ile
5.585IleAla: 5.585 ± 0.852
2.058IleCys: 2.058 ± 0.897
2.939IleAsp: 2.939 ± 0.703
4.997IleGlu: 4.997 ± 1.04
2.939IlePhe: 2.939 ± 1.668
3.527IleGly: 3.527 ± 1.233
1.47IleHis: 1.47 ± 0.616
4.115IleIle: 4.115 ± 0.899
3.527IleLys: 3.527 ± 1.083
4.997IleLeu: 4.997 ± 1.603
1.176IleMet: 1.176 ± 0.647
4.703IleAsn: 4.703 ± 1.533
3.821IlePro: 3.821 ± 0.988
2.939IleGln: 2.939 ± 1.048
1.764IleArg: 1.764 ± 0.642
6.467IleSer: 6.467 ± 1.433
5.585IleThr: 5.585 ± 0.997
5.585IleVal: 5.585 ± 2.089
0.294IleTrp: 0.294 ± 0.486
2.939IleTyr: 2.939 ± 0.891
0.0IleXaa: 0.0 ± 0.0
Lys
5.291LysAla: 5.291 ± 0.75
2.352LysCys: 2.352 ± 1.473
4.115LysAsp: 4.115 ± 1.067
3.821LysGlu: 3.821 ± 1.113
5.879LysPhe: 5.879 ± 1.139
4.409LysGly: 4.409 ± 1.045
0.882LysHis: 0.882 ± 0.351
6.467LysIle: 6.467 ± 1.742
5.879LysLys: 5.879 ± 1.002
6.761LysLeu: 6.761 ± 1.154
2.352LysMet: 2.352 ± 0.49
2.352LysAsn: 2.352 ± 0.861
2.352LysPro: 2.352 ± 0.543
2.939LysGln: 2.939 ± 1.206
2.939LysArg: 2.939 ± 0.755
1.47LysSer: 1.47 ± 0.656
5.291LysThr: 5.291 ± 1.521
2.939LysVal: 2.939 ± 0.703
0.882LysTrp: 0.882 ± 0.517
2.939LysTyr: 2.939 ± 0.659
0.0LysXaa: 0.0 ± 0.0
Leu
5.291LeuAla: 5.291 ± 1.32
0.882LeuCys: 0.882 ± 0.393
6.467LeuAsp: 6.467 ± 1.536
3.527LeuGlu: 3.527 ± 1.398
3.527LeuPhe: 3.527 ± 1.721
3.821LeuGly: 3.821 ± 2.101
3.233LeuHis: 3.233 ± 1.087
5.585LeuIle: 5.585 ± 1.487
5.879LeuLys: 5.879 ± 1.667
6.467LeuLeu: 6.467 ± 1.23
2.352LeuMet: 2.352 ± 0.976
4.409LeuAsn: 4.409 ± 1.999
3.527LeuPro: 3.527 ± 0.789
1.764LeuGln: 1.764 ± 1.084
2.939LeuArg: 2.939 ± 0.906
4.997LeuSer: 4.997 ± 2.284
5.291LeuThr: 5.291 ± 0.81
3.821LeuVal: 3.821 ± 0.996
1.176LeuTrp: 1.176 ± 0.683
2.646LeuTyr: 2.646 ± 1.709
0.0LeuXaa: 0.0 ± 0.0
Met
2.058MetAla: 2.058 ± 0.729
0.294MetCys: 0.294 ± 0.159
2.058MetAsp: 2.058 ± 0.658
1.47MetGlu: 1.47 ± 0.56
0.294MetPhe: 0.294 ± 0.159
1.176MetGly: 1.176 ± 0.521
0.588MetHis: 0.588 ± 0.412
3.527MetIle: 3.527 ± 0.949
2.939MetLys: 2.939 ± 0.709
0.882MetLeu: 0.882 ± 0.529
0.588MetMet: 0.588 ± 0.356
2.352MetAsn: 2.352 ± 1.8
2.058MetPro: 2.058 ± 1.11
0.588MetGln: 0.588 ± 0.412
1.764MetArg: 1.764 ± 0.432
2.646MetSer: 2.646 ± 0.635
2.646MetThr: 2.646 ± 0.886
2.058MetVal: 2.058 ± 0.538
0.0MetTrp: 0.0 ± 0.0
0.588MetTyr: 0.588 ± 0.356
0.0MetXaa: 0.0 ± 0.0
Asn
3.527AsnAla: 3.527 ± 1.116
0.882AsnCys: 0.882 ± 0.476
2.352AsnAsp: 2.352 ± 0.575
1.764AsnGlu: 1.764 ± 0.64
4.703AsnPhe: 4.703 ± 0.982
4.409AsnGly: 4.409 ± 1.024
1.47AsnHis: 1.47 ± 0.793
2.939AsnIle: 2.939 ± 1.053
4.409AsnLys: 4.409 ± 0.445
3.233AsnLeu: 3.233 ± 0.845
0.588AsnMet: 0.588 ± 0.467
1.764AsnAsn: 1.764 ± 1.069
2.058AsnPro: 2.058 ± 0.435
1.47AsnGln: 1.47 ± 0.793
2.058AsnArg: 2.058 ± 0.658
1.176AsnSer: 1.176 ± 0.473
2.939AsnThr: 2.939 ± 3.182
3.233AsnVal: 3.233 ± 1.062
0.294AsnTrp: 0.294 ± 0.669
2.058AsnTyr: 2.058 ± 1.466
0.0AsnXaa: 0.0 ± 0.0
Pro
1.47ProAla: 1.47 ± 0.524
0.0ProCys: 0.0 ± 0.0
2.939ProAsp: 2.939 ± 1.136
3.233ProGlu: 3.233 ± 0.834
2.939ProPhe: 2.939 ± 1.04
2.352ProGly: 2.352 ± 1.87
0.882ProHis: 0.882 ± 0.412
3.821ProIle: 3.821 ± 1.327
4.115ProLys: 4.115 ± 1.138
2.939ProLeu: 2.939 ± 0.564
2.352ProMet: 2.352 ± 0.543
1.764ProAsn: 1.764 ± 0.365
3.527ProPro: 3.527 ± 1.306
0.588ProGln: 0.588 ± 0.412
1.764ProArg: 1.764 ± 0.64
2.939ProSer: 2.939 ± 0.811
5.585ProThr: 5.585 ± 1.156
2.058ProVal: 2.058 ± 0.924
0.588ProTrp: 0.588 ± 0.317
1.47ProTyr: 1.47 ± 0.689
0.0ProXaa: 0.0 ± 0.0
Gln
2.646GlnAla: 2.646 ± 1.427
0.588GlnCys: 0.588 ± 0.317
1.176GlnAsp: 1.176 ± 0.411
2.058GlnGlu: 2.058 ± 0.587
2.352GlnPhe: 2.352 ± 0.626
1.176GlnGly: 1.176 ± 0.634
2.646GlnHis: 2.646 ± 0.862
2.646GlnIle: 2.646 ± 1.056
3.233GlnLys: 3.233 ± 0.286
3.821GlnLeu: 3.821 ± 2.37
0.882GlnMet: 0.882 ± 0.778
0.294GlnAsn: 0.294 ± 0.486
2.058GlnPro: 2.058 ± 0.584
1.764GlnGln: 1.764 ± 0.494
0.588GlnArg: 0.588 ± 0.317
2.058GlnSer: 2.058 ± 0.906
1.764GlnThr: 1.764 ± 1.132
1.47GlnVal: 1.47 ± 0.357
0.294GlnTrp: 0.294 ± 0.486
0.882GlnTyr: 0.882 ± 0.696
0.0GlnXaa: 0.0 ± 0.0
Arg
2.646ArgAla: 2.646 ± 0.804
0.0ArgCys: 0.0 ± 0.0
3.233ArgAsp: 3.233 ± 0.957
2.939ArgGlu: 2.939 ± 0.959
1.47ArgPhe: 1.47 ± 1.509
1.47ArgGly: 1.47 ± 0.56
1.764ArgHis: 1.764 ± 0.642
4.409ArgIle: 4.409 ± 1.221
1.47ArgLys: 1.47 ± 0.524
2.939ArgLeu: 2.939 ± 0.583
2.646ArgMet: 2.646 ± 0.323
0.882ArgAsn: 0.882 ± 0.393
2.939ArgPro: 2.939 ± 1.3
1.764ArgGln: 1.764 ± 0.432
2.352ArgArg: 2.352 ± 0.738
1.176ArgSer: 1.176 ± 0.634
0.294ArgThr: 0.294 ± 0.159
1.176ArgVal: 1.176 ± 0.632
0.882ArgTrp: 0.882 ± 0.412
2.352ArgTyr: 2.352 ± 0.926
0.0ArgXaa: 0.0 ± 0.0
Ser
3.821SerAla: 3.821 ± 0.816
1.47SerCys: 1.47 ± 0.566
1.764SerAsp: 1.764 ± 1.058
3.527SerGlu: 3.527 ± 1.346
5.585SerPhe: 5.585 ± 1.505
2.646SerGly: 2.646 ± 0.598
0.588SerHis: 0.588 ± 0.317
2.939SerIle: 2.939 ± 1.418
5.291SerLys: 5.291 ± 0.716
4.997SerLeu: 4.997 ± 1.249
1.764SerMet: 1.764 ± 0.691
2.939SerAsn: 2.939 ± 0.576
2.058SerPro: 2.058 ± 1.198
0.294SerGln: 0.294 ± 0.669
3.233SerArg: 3.233 ± 1.118
4.409SerSer: 4.409 ± 1.829
3.821SerThr: 3.821 ± 1.445
3.527SerVal: 3.527 ± 1.967
1.176SerTrp: 1.176 ± 0.709
2.058SerTyr: 2.058 ± 0.789
0.0SerXaa: 0.0 ± 0.0
Thr
5.291ThrAla: 5.291 ± 0.917
1.47ThrCys: 1.47 ± 0.434
2.352ThrAsp: 2.352 ± 0.474
3.821ThrGlu: 3.821 ± 1.424
3.821ThrPhe: 3.821 ± 0.764
4.115ThrGly: 4.115 ± 0.847
0.882ThrHis: 0.882 ± 0.393
4.115ThrIle: 4.115 ± 0.949
2.939ThrLys: 2.939 ± 1.393
8.23ThrLeu: 8.23 ± 1.628
0.882ThrMet: 0.882 ± 0.412
4.703ThrAsn: 4.703 ± 0.54
3.821ThrPro: 3.821 ± 1.626
2.352ThrGln: 2.352 ± 1.154
2.058ThrArg: 2.058 ± 0.781
5.585ThrSer: 5.585 ± 1.437
4.115ThrThr: 4.115 ± 0.744
3.233ThrVal: 3.233 ± 0.979
0.588ThrTrp: 0.588 ± 0.317
3.233ThrTyr: 3.233 ± 1.515
0.0ThrXaa: 0.0 ± 0.0
Val
2.939ValAla: 2.939 ± 0.971
1.176ValCys: 1.176 ± 0.521
3.821ValAsp: 3.821 ± 0.926
2.939ValGlu: 2.939 ± 0.811
2.939ValPhe: 2.939 ± 0.534
3.527ValGly: 3.527 ± 0.874
2.352ValHis: 2.352 ± 1.269
4.703ValIle: 4.703 ± 0.822
2.058ValLys: 2.058 ± 0.658
3.821ValLeu: 3.821 ± 0.794
1.764ValMet: 1.764 ± 0.62
2.352ValAsn: 2.352 ± 0.47
4.703ValPro: 4.703 ± 1.05
2.058ValGln: 2.058 ± 0.538
2.352ValArg: 2.352 ± 1.898
4.703ValSer: 4.703 ± 1.233
2.058ValThr: 2.058 ± 0.663
4.115ValVal: 4.115 ± 1.149
0.588ValTrp: 0.588 ± 0.891
1.764ValTyr: 1.764 ± 0.642
0.0ValXaa: 0.0 ± 0.0
Trp
0.882TrpAla: 0.882 ± 0.393
0.294TrpCys: 0.294 ± 0.159
0.588TrpAsp: 0.588 ± 0.356
1.176TrpGlu: 1.176 ± 0.634
0.0TrpPhe: 0.0 ± 0.0
0.294TrpGly: 0.294 ± 0.426
0.0TrpHis: 0.0 ± 0.0
0.882TrpIle: 0.882 ± 0.481
0.588TrpLys: 0.588 ± 0.317
0.882TrpLeu: 0.882 ± 1.045
0.294TrpMet: 0.294 ± 0.392
0.588TrpAsn: 0.588 ± 0.317
0.0TrpPro: 0.0 ± 0.0
0.882TrpGln: 0.882 ± 1.243
0.882TrpArg: 0.882 ± 0.517
0.0TrpSer: 0.0 ± 0.0
1.47TrpThr: 1.47 ± 0.96
0.588TrpVal: 0.588 ± 0.317
0.0TrpTrp: 0.0 ± 0.0
0.294TrpTyr: 0.294 ± 0.426
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.646TyrAla: 2.646 ± 1.709
1.176TyrCys: 1.176 ± 0.873
1.47TyrAsp: 1.47 ± 0.689
4.115TyrGlu: 4.115 ± 0.438
1.764TyrPhe: 1.764 ± 1.218
2.058TyrGly: 2.058 ± 0.747
0.588TyrHis: 0.588 ± 0.317
1.47TyrIle: 1.47 ± 0.656
2.058TyrLys: 2.058 ± 0.978
2.646TyrLeu: 2.646 ± 0.508
1.764TyrMet: 1.764 ± 1.027
2.058TyrAsn: 2.058 ± 0.997
1.47TyrPro: 1.47 ± 0.985
1.764TyrGln: 1.764 ± 0.637
3.821TyrArg: 3.821 ± 1.257
1.176TyrSer: 1.176 ± 0.83
2.646TyrThr: 2.646 ± 0.618
1.176TyrVal: 1.176 ± 0.566
0.882TyrTrp: 0.882 ± 0.769
1.176TyrTyr: 1.176 ± 0.566
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3403 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski