Amino acid dipepetide frequency for Hubei virga-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.626AlaAla: 4.626 ± 1.063
1.156AlaCys: 1.156 ± 0.641
4.915AlaAsp: 4.915 ± 0.903
2.891AlaGlu: 2.891 ± 1.384
1.156AlaPhe: 1.156 ± 0.554
2.024AlaGly: 2.024 ± 1.195
2.602AlaHis: 2.602 ± 1.213
4.915AlaIle: 4.915 ± 1.216
4.626AlaLys: 4.626 ± 2.215
5.493AlaLeu: 5.493 ± 0.991
2.891AlaMet: 2.891 ± 1.384
3.18AlaAsn: 3.18 ± 1.115
1.735AlaPro: 1.735 ± 0.831
1.446AlaGln: 1.446 ± 0.692
3.18AlaArg: 3.18 ± 1.552
3.758AlaSer: 3.758 ± 1.051
2.891AlaThr: 2.891 ± 1.039
5.493AlaVal: 5.493 ± 1.815
0.0AlaTrp: 0.0 ± 0.0
1.446AlaTyr: 1.446 ± 0.692
0.0AlaXaa: 0.0 ± 0.0
Cys
2.024CysAla: 2.024 ± 0.969
0.867CysCys: 0.867 ± 1.028
2.024CysAsp: 2.024 ± 0.871
1.156CysGlu: 1.156 ± 1.274
1.446CysPhe: 1.446 ± 0.692
2.313CysGly: 2.313 ± 1.923
0.867CysHis: 0.867 ± 0.415
0.289CysIle: 0.289 ± 0.138
1.156CysLys: 1.156 ± 0.554
1.156CysLeu: 1.156 ± 0.554
0.578CysMet: 0.578 ± 1.731
0.289CysAsn: 0.289 ± 0.138
1.446CysPro: 1.446 ± 2.133
0.289CysGln: 0.289 ± 0.138
0.867CysArg: 0.867 ± 0.415
2.313CysSer: 2.313 ± 0.883
1.735CysThr: 1.735 ± 0.545
0.867CysVal: 0.867 ± 0.415
0.0CysTrp: 0.0 ± 0.0
1.156CysTyr: 1.156 ± 0.962
0.0CysXaa: 0.0 ± 0.0
Asp
6.36AspAla: 6.36 ± 2.214
0.578AspCys: 0.578 ± 0.277
6.649AspAsp: 6.649 ± 2.071
3.469AspGlu: 3.469 ± 1.118
2.313AspPhe: 2.313 ± 1.107
4.626AspGly: 4.626 ± 0.982
1.156AspHis: 1.156 ± 0.962
3.469AspIle: 3.469 ± 0.935
3.758AspLys: 3.758 ± 1.44
6.36AspLeu: 6.36 ± 1.841
2.602AspMet: 2.602 ± 0.819
2.313AspAsn: 2.313 ± 1.199
3.18AspPro: 3.18 ± 1.552
1.735AspGln: 1.735 ± 0.545
3.18AspArg: 3.18 ± 0.826
6.071AspSer: 6.071 ± 1.622
5.493AspThr: 5.493 ± 1.575
6.938AspVal: 6.938 ± 3.322
0.289AspTrp: 0.289 ± 0.138
2.024AspTyr: 2.024 ± 1.195
0.0AspXaa: 0.0 ± 0.0
Glu
3.758GluAla: 3.758 ± 1.44
0.867GluCys: 0.867 ± 0.415
1.446GluAsp: 1.446 ± 0.578
2.602GluGlu: 2.602 ± 2.628
2.602GluPhe: 2.602 ± 1.22
3.469GluGly: 3.469 ± 1.441
0.289GluHis: 0.289 ± 0.138
2.024GluIle: 2.024 ± 1.195
3.469GluLys: 3.469 ± 1.315
3.18GluLeu: 3.18 ± 0.826
1.156GluMet: 1.156 ± 0.641
3.469GluAsn: 3.469 ± 1.315
2.024GluPro: 2.024 ± 1.361
0.289GluGln: 0.289 ± 0.138
5.493GluArg: 5.493 ± 0.907
4.626GluSer: 4.626 ± 3.598
4.047GluThr: 4.047 ± 1.181
4.915GluVal: 4.915 ± 1.642
0.867GluTrp: 0.867 ± 0.415
3.469GluTyr: 3.469 ± 0.781
0.0GluXaa: 0.0 ± 0.0
Phe
2.024PheAla: 2.024 ± 0.969
2.024PheCys: 2.024 ± 0.969
2.891PheAsp: 2.891 ± 0.727
2.891PheGlu: 2.891 ± 1.667
1.735PhePhe: 1.735 ± 0.831
1.735PheGly: 1.735 ± 0.831
0.578PheHis: 0.578 ± 0.277
1.446PheIle: 1.446 ± 1.233
1.446PheLys: 1.446 ± 0.912
3.469PheLeu: 3.469 ± 0.935
0.867PheMet: 0.867 ± 0.725
0.867PheAsn: 0.867 ± 0.415
2.024PhePro: 2.024 ± 0.545
1.156PheGln: 1.156 ± 0.554
2.024PheArg: 2.024 ± 0.871
3.469PheSer: 3.469 ± 1.661
2.313PheThr: 2.313 ± 0.58
3.18PheVal: 3.18 ± 1.523
0.0PheTrp: 0.0 ± 0.0
1.446PheTyr: 1.446 ± 1.233
0.0PheXaa: 0.0 ± 0.0
Gly
2.313GlyAla: 2.313 ± 0.883
0.867GlyCys: 0.867 ± 1.028
4.047GlyAsp: 4.047 ± 1.09
2.024GlyGlu: 2.024 ± 2.599
0.578GlyPhe: 0.578 ± 0.277
4.047GlyGly: 4.047 ± 0.906
0.867GlyHis: 0.867 ± 0.725
2.024GlyIle: 2.024 ± 2.599
5.204GlyLys: 5.204 ± 1.761
3.469GlyLeu: 3.469 ± 1.118
2.313GlyMet: 2.313 ± 1.107
2.602GlyAsn: 2.602 ± 0.643
2.024GlyPro: 2.024 ± 1.361
1.446GlyGln: 1.446 ± 1.801
1.446GlyArg: 1.446 ± 1.505
2.602GlySer: 2.602 ± 1.785
3.18GlyThr: 3.18 ± 0.827
4.047GlyVal: 4.047 ± 1.172
0.0GlyTrp: 0.0 ± 0.0
2.891GlyTyr: 2.891 ± 1.384
0.0GlyXaa: 0.0 ± 0.0
His
0.867HisAla: 0.867 ± 0.415
0.867HisCys: 0.867 ± 0.415
3.758HisAsp: 3.758 ± 1.747
1.735HisGlu: 1.735 ± 0.831
1.446HisPhe: 1.446 ± 0.692
0.867HisGly: 0.867 ± 0.725
0.578HisHis: 0.578 ± 0.277
1.446HisIle: 1.446 ± 0.912
2.313HisLys: 2.313 ± 1.538
1.446HisLeu: 1.446 ± 0.692
0.0HisMet: 0.0 ± 0.0
0.289HisAsn: 0.289 ± 0.932
1.446HisPro: 1.446 ± 0.912
0.289HisGln: 0.289 ± 0.138
1.156HisArg: 1.156 ± 0.554
1.735HisSer: 1.735 ± 0.831
2.891HisThr: 2.891 ± 1.688
1.735HisVal: 1.735 ± 1.449
0.578HisTrp: 0.578 ± 0.277
1.446HisTyr: 1.446 ± 1.233
0.0HisXaa: 0.0 ± 0.0
Ile
3.469IleAla: 3.469 ± 1.661
0.867IleCys: 0.867 ± 0.415
2.891IleAsp: 2.891 ± 0.969
4.337IleGlu: 4.337 ± 2.205
2.024IlePhe: 2.024 ± 0.969
2.313IleGly: 2.313 ± 1.2
1.446IleHis: 1.446 ± 1.233
4.337IleIle: 4.337 ± 1.691
1.446IleLys: 1.446 ± 0.692
4.047IleLeu: 4.047 ± 1.789
1.446IleMet: 1.446 ± 0.692
2.024IleAsn: 2.024 ± 0.969
1.156IlePro: 1.156 ± 0.641
0.578IleGln: 0.578 ± 0.277
4.915IleArg: 4.915 ± 0.903
2.891IleSer: 2.891 ± 0.727
3.758IleThr: 3.758 ± 2.757
5.204IleVal: 5.204 ± 3.361
0.867IleTrp: 0.867 ± 1.328
1.446IleTyr: 1.446 ± 0.578
0.0IleXaa: 0.0 ± 0.0
Lys
1.735LysAla: 1.735 ± 0.545
1.735LysCys: 1.735 ± 0.881
2.891LysAsp: 2.891 ± 1.384
2.024LysGlu: 2.024 ± 0.969
5.493LysPhe: 5.493 ± 1.881
2.024LysGly: 2.024 ± 0.969
1.156LysHis: 1.156 ± 0.554
2.313LysIle: 2.313 ± 1.538
2.602LysLys: 2.602 ± 3.084
4.626LysLeu: 4.626 ± 0.961
2.024LysMet: 2.024 ± 1.986
1.735LysAsn: 1.735 ± 0.831
2.891LysPro: 2.891 ± 0.969
1.735LysGln: 1.735 ± 0.881
4.915LysArg: 4.915 ± 2.895
5.782LysSer: 5.782 ± 1.517
2.024LysThr: 2.024 ± 0.545
4.915LysVal: 4.915 ± 2.353
0.867LysTrp: 0.867 ± 1.028
5.204LysTyr: 5.204 ± 0.985
0.0LysXaa: 0.0 ± 0.0
Leu
2.891LeuAla: 2.891 ± 1.157
0.578LeuCys: 0.578 ± 1.108
4.626LeuAsp: 4.626 ± 0.961
4.047LeuGlu: 4.047 ± 2.082
4.047LeuPhe: 4.047 ± 1.09
4.337LeuGly: 4.337 ± 2.076
2.602LeuHis: 2.602 ± 0.917
3.18LeuIle: 3.18 ± 1.523
6.938LeuLys: 6.938 ± 2.517
9.251LeuLeu: 9.251 ± 4.43
2.891LeuMet: 2.891 ± 1.384
2.024LeuAsn: 2.024 ± 0.871
4.337LeuPro: 4.337 ± 2.076
2.313LeuGln: 2.313 ± 2.548
6.36LeuArg: 6.36 ± 2.23
6.938LeuSer: 6.938 ± 2.484
6.071LeuThr: 6.071 ± 1.781
6.071LeuVal: 6.071 ± 1.359
0.867LeuTrp: 0.867 ± 0.415
3.758LeuTyr: 3.758 ± 1.209
0.0LeuXaa: 0.0 ± 0.0
Met
3.18MetAla: 3.18 ± 1.523
0.867MetCys: 0.867 ± 2.005
1.156MetAsp: 1.156 ± 0.641
1.735MetGlu: 1.735 ± 1.276
0.578MetPhe: 0.578 ± 0.277
0.289MetGly: 0.289 ± 0.138
1.446MetHis: 1.446 ± 0.692
2.891MetIle: 2.891 ± 0.969
2.024MetLys: 2.024 ± 0.969
4.915MetLeu: 4.915 ± 2.353
2.602MetMet: 2.602 ± 0.917
1.446MetAsn: 1.446 ± 0.578
1.446MetPro: 1.446 ± 0.912
0.578MetGln: 0.578 ± 0.277
3.18MetArg: 3.18 ± 1.523
2.024MetSer: 2.024 ± 1.361
2.891MetThr: 2.891 ± 0.892
1.156MetVal: 1.156 ± 0.554
0.578MetTrp: 0.578 ± 0.277
1.446MetTyr: 1.446 ± 0.692
0.0MetXaa: 0.0 ± 0.0
Asn
3.758AsnAla: 3.758 ± 1.081
0.867AsnCys: 0.867 ± 0.415
2.602AsnAsp: 2.602 ± 0.643
2.891AsnGlu: 2.891 ± 1.256
2.313AsnPhe: 2.313 ± 0.58
2.313AsnGly: 2.313 ± 1.065
0.289AsnHis: 0.289 ± 0.138
2.313AsnIle: 2.313 ± 1.107
2.024AsnLys: 2.024 ± 1.361
3.469AsnLeu: 3.469 ± 0.781
2.024AsnMet: 2.024 ± 0.969
2.602AsnAsn: 2.602 ± 1.246
1.156AsnPro: 1.156 ± 0.554
1.156AsnGln: 1.156 ± 0.554
2.602AsnArg: 2.602 ± 2.783
1.156AsnSer: 1.156 ± 1.274
2.891AsnThr: 2.891 ± 1.667
2.891AsnVal: 2.891 ± 1.384
0.867AsnTrp: 0.867 ± 0.415
1.735AsnTyr: 1.735 ± 1.276
0.0AsnXaa: 0.0 ± 0.0
Pro
2.024ProAla: 2.024 ± 2.04
0.289ProCys: 0.289 ± 0.138
3.469ProAsp: 3.469 ± 0.781
3.18ProGlu: 3.18 ± 2.548
0.867ProPhe: 0.867 ± 0.415
2.024ProGly: 2.024 ± 0.871
1.156ProHis: 1.156 ± 0.962
3.18ProIle: 3.18 ± 0.979
1.735ProLys: 1.735 ± 2.162
1.446ProLeu: 1.446 ± 0.578
1.735ProMet: 1.735 ± 0.831
2.313ProAsn: 2.313 ± 0.883
2.024ProPro: 2.024 ± 3.686
1.156ProGln: 1.156 ± 0.641
2.602ProArg: 2.602 ± 1.246
3.18ProSer: 3.18 ± 3.562
4.626ProThr: 4.626 ± 1.16
2.313ProVal: 2.313 ± 0.883
0.578ProTrp: 0.578 ± 0.277
1.156ProTyr: 1.156 ± 0.554
0.0ProXaa: 0.0 ± 0.0
Gln
2.024GlnAla: 2.024 ± 1.294
1.156GlnCys: 1.156 ± 0.962
1.446GlnAsp: 1.446 ± 1.233
0.289GlnGlu: 0.289 ± 0.138
0.289GlnPhe: 0.289 ± 0.932
1.446GlnGly: 1.446 ± 0.692
1.446GlnHis: 1.446 ± 0.692
1.446GlnIle: 1.446 ± 0.578
1.156GlnLys: 1.156 ± 1.274
2.313GlnLeu: 2.313 ± 1.107
0.578GlnMet: 0.578 ± 0.733
0.867GlnAsn: 0.867 ± 0.415
1.156GlnPro: 1.156 ± 0.554
0.578GlnGln: 0.578 ± 2.941
2.602GlnArg: 2.602 ± 1.22
0.867GlnSer: 0.867 ± 0.415
1.156GlnThr: 1.156 ± 2.739
1.156GlnVal: 1.156 ± 0.554
0.289GlnTrp: 0.289 ± 0.138
0.867GlnTyr: 0.867 ± 0.415
0.0GlnXaa: 0.0 ± 0.0
Arg
1.156ArgAla: 1.156 ± 0.641
1.735ArgCys: 1.735 ± 0.831
4.047ArgAsp: 4.047 ± 1.282
3.469ArgGlu: 3.469 ± 1.923
3.469ArgPhe: 3.469 ± 1.09
1.446ArgGly: 1.446 ± 1.505
2.313ArgHis: 2.313 ± 1.282
3.758ArgIle: 3.758 ± 1.852
4.337ArgLys: 4.337 ± 1.61
6.36ArgLeu: 6.36 ± 2.321
2.891ArgMet: 2.891 ± 0.727
3.18ArgAsn: 3.18 ± 0.826
1.735ArgPro: 1.735 ± 1.276
3.18ArgGln: 3.18 ± 2.0
5.204ArgArg: 5.204 ± 1.634
4.626ArgSer: 4.626 ± 2.852
3.469ArgThr: 3.469 ± 0.935
6.36ArgVal: 6.36 ± 1.407
1.156ArgTrp: 1.156 ± 1.618
4.337ArgTyr: 4.337 ± 1.691
0.0ArgXaa: 0.0 ± 0.0
Ser
5.204SerAla: 5.204 ± 3.361
2.024SerCys: 2.024 ± 2.678
6.36SerAsp: 6.36 ± 2.056
3.469SerGlu: 3.469 ± 0.935
1.735SerPhe: 1.735 ± 1.206
4.337SerGly: 4.337 ± 0.924
2.602SerHis: 2.602 ± 0.973
2.024SerIle: 2.024 ± 0.545
4.337SerLys: 4.337 ± 1.708
5.493SerLeu: 5.493 ± 1.815
2.602SerMet: 2.602 ± 0.643
2.024SerAsn: 2.024 ± 2.367
1.735SerPro: 1.735 ± 2.47
0.867SerGln: 0.867 ± 0.415
6.938SerArg: 6.938 ± 3.436
2.891SerSer: 2.891 ± 1.688
5.204SerThr: 5.204 ± 2.785
6.649SerVal: 6.649 ± 1.759
0.578SerTrp: 0.578 ± 1.394
3.758SerTyr: 3.758 ± 2.322
0.0SerXaa: 0.0 ± 0.0
Thr
5.493ThrAla: 5.493 ± 2.024
1.446ThrCys: 1.446 ± 0.912
6.938ThrAsp: 6.938 ± 2.615
4.626ThrGlu: 4.626 ± 0.982
1.735ThrPhe: 1.735 ± 0.545
2.891ThrGly: 2.891 ± 2.665
1.735ThrHis: 1.735 ± 1.706
3.18ThrIle: 3.18 ± 2.0
2.024ThrLys: 2.024 ± 1.167
4.626ThrLeu: 4.626 ± 2.463
3.469ThrMet: 3.469 ± 1.118
3.758ThrAsn: 3.758 ± 1.966
4.047ThrPro: 4.047 ± 4.242
0.578ThrGln: 0.578 ± 0.277
4.626ThrArg: 4.626 ± 1.122
4.337ThrSer: 4.337 ± 4.635
5.204ThrThr: 5.204 ± 3.633
4.337ThrVal: 4.337 ± 1.297
0.578ThrTrp: 0.578 ± 0.277
3.469ThrTyr: 3.469 ± 1.315
0.0ThrXaa: 0.0 ± 0.0
Val
4.626ValAla: 4.626 ± 2.215
3.18ValCys: 3.18 ± 1.523
4.915ValAsp: 4.915 ± 1.552
4.337ValGlu: 4.337 ± 1.297
2.024ValPhe: 2.024 ± 0.969
3.18ValGly: 3.18 ± 0.826
2.024ValHis: 2.024 ± 0.969
5.493ValIle: 5.493 ± 1.167
4.047ValLys: 4.047 ± 1.938
7.517ValLeu: 7.517 ± 2.778
1.735ValMet: 1.735 ± 0.831
4.047ValAsn: 4.047 ± 1.938
3.469ValPro: 3.469 ± 1.762
1.446ValGln: 1.446 ± 0.692
3.758ValArg: 3.758 ± 1.852
6.938ValSer: 6.938 ± 2.179
5.204ValThr: 5.204 ± 2.492
5.204ValVal: 5.204 ± 2.492
0.578ValTrp: 0.578 ± 0.823
2.602ValTyr: 2.602 ± 1.213
0.0ValXaa: 0.0 ± 0.0
Trp
0.578TrpAla: 0.578 ± 0.277
0.0TrpCys: 0.0 ± 0.0
1.156TrpAsp: 1.156 ± 0.962
0.867TrpGlu: 0.867 ± 0.415
0.578TrpPhe: 0.578 ± 1.394
0.0TrpGly: 0.0 ± 0.0
0.289TrpHis: 0.289 ± 0.138
0.578TrpIle: 0.578 ± 0.277
0.0TrpLys: 0.0 ± 0.0
1.735TrpLeu: 1.735 ± 1.206
0.289TrpMet: 0.289 ± 0.138
0.867TrpAsn: 0.867 ± 0.725
0.0TrpPro: 0.0 ± 0.0
0.289TrpGln: 0.289 ± 0.138
0.867TrpArg: 0.867 ± 1.328
1.156TrpSer: 1.156 ± 0.554
0.578TrpThr: 0.578 ± 0.823
0.289TrpVal: 0.289 ± 0.138
0.289TrpTrp: 0.289 ± 0.138
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.313TyrAla: 2.313 ± 1.107
1.156TyrCys: 1.156 ± 0.554
4.337TyrAsp: 4.337 ± 2.492
2.024TyrGlu: 2.024 ± 1.294
1.156TyrPhe: 1.156 ± 0.554
2.313TyrGly: 2.313 ± 1.2
1.446TyrHis: 1.446 ± 0.912
1.156TyrIle: 1.156 ± 0.641
4.047TyrLys: 4.047 ± 1.773
3.758TyrLeu: 3.758 ± 1.799
1.446TyrMet: 1.446 ± 0.692
2.024TyrAsn: 2.024 ± 0.545
2.024TyrPro: 2.024 ± 0.969
2.024TyrGln: 2.024 ± 1.618
2.313TyrArg: 2.313 ± 1.107
3.469TyrSer: 3.469 ± 3.395
3.469TyrThr: 3.469 ± 1.926
2.602TyrVal: 2.602 ± 1.246
0.578TyrTrp: 0.578 ± 0.277
2.313TyrTyr: 2.313 ± 1.065
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3460 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski