Amino acid dipepetide frequency for Hubei orthoptera virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.018AlaAla: 7.018 ± 1.879
0.702AlaCys: 0.702 ± 0.721
2.807AlaAsp: 2.807 ± 0.752
2.807AlaGlu: 2.807 ± 0.315
4.912AlaPhe: 4.912 ± 0.285
4.912AlaGly: 4.912 ± 0.285
0.702AlaHis: 0.702 ± 0.346
2.105AlaIle: 2.105 ± 0.03
5.614AlaLys: 5.614 ± 0.63
9.123AlaLeu: 9.123 ± 2.976
3.509AlaMet: 3.509 ± 0.406
5.614AlaAsn: 5.614 ± 0.63
4.912AlaPro: 4.912 ± 0.782
0.702AlaGln: 0.702 ± 0.721
4.211AlaArg: 4.211 ± 1.128
2.807AlaSer: 2.807 ± 0.752
4.912AlaThr: 4.912 ± 2.419
6.316AlaVal: 6.316 ± 1.158
2.105AlaTrp: 2.105 ± 0.03
2.105AlaTyr: 2.105 ± 1.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.702CysAla: 0.702 ± 0.346
0.702CysCys: 0.702 ± 0.346
0.702CysAsp: 0.702 ± 0.346
0.0CysGlu: 0.0 ± 0.0
0.702CysPhe: 0.702 ± 0.346
0.702CysGly: 0.702 ± 0.346
0.0CysHis: 0.0 ± 0.0
0.702CysIle: 0.702 ± 0.346
0.702CysLys: 0.702 ± 0.346
1.404CysLeu: 1.404 ± 0.376
2.105CysMet: 2.105 ± 2.164
0.702CysAsn: 0.702 ± 0.346
0.0CysPro: 0.0 ± 0.0
0.702CysGln: 0.702 ± 0.346
2.807CysArg: 2.807 ± 1.819
0.702CysSer: 0.702 ± 0.346
0.702CysThr: 0.702 ± 0.721
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.404AspAla: 1.404 ± 0.376
0.702AspCys: 0.702 ± 0.346
3.509AspAsp: 3.509 ± 0.661
4.912AspGlu: 4.912 ± 0.285
0.702AspPhe: 0.702 ± 0.721
7.719AspGly: 7.719 ± 0.6
2.105AspHis: 2.105 ± 0.03
4.211AspIle: 4.211 ± 2.194
2.807AspLys: 2.807 ± 0.752
6.316AspLeu: 6.316 ± 0.091
0.702AspMet: 0.702 ± 0.346
2.105AspAsn: 2.105 ± 2.164
4.912AspPro: 4.912 ± 0.285
2.105AspGln: 2.105 ± 0.03
0.702AspArg: 0.702 ± 0.721
4.912AspSer: 4.912 ± 1.352
2.807AspThr: 2.807 ± 0.752
5.614AspVal: 5.614 ± 0.436
0.702AspTrp: 0.702 ± 0.721
3.509AspTyr: 3.509 ± 1.728
0.0AspXaa: 0.0 ± 0.0
Glu
6.316GluAla: 6.316 ± 2.043
0.0GluCys: 0.0 ± 0.0
2.105GluAsp: 2.105 ± 1.097
1.404GluGlu: 1.404 ± 0.691
4.211GluPhe: 4.211 ± 1.128
0.0GluGly: 0.0 ± 0.0
0.702GluHis: 0.702 ± 0.346
0.702GluIle: 0.702 ± 0.346
2.105GluLys: 2.105 ± 0.03
5.614GluLeu: 5.614 ± 0.63
0.0GluMet: 0.0 ± 0.0
0.702GluAsn: 0.702 ± 0.346
4.211GluPro: 4.211 ± 2.073
2.105GluGln: 2.105 ± 0.03
2.105GluArg: 2.105 ± 0.03
1.404GluSer: 1.404 ± 0.376
4.211GluThr: 4.211 ± 1.128
2.807GluVal: 2.807 ± 0.315
1.404GluTrp: 1.404 ± 0.691
1.404GluTyr: 1.404 ± 1.443
0.0GluXaa: 0.0 ± 0.0
Phe
0.702PheAla: 0.702 ± 0.721
0.702PheCys: 0.702 ± 0.721
1.404PheAsp: 1.404 ± 0.376
0.702PheGlu: 0.702 ± 0.346
0.702PhePhe: 0.702 ± 0.346
2.807PheGly: 2.807 ± 2.885
3.509PheHis: 3.509 ± 0.661
2.105PheIle: 2.105 ± 1.037
1.404PheLys: 1.404 ± 0.691
4.211PheLeu: 4.211 ± 1.006
0.702PheMet: 0.702 ± 0.263
4.211PheAsn: 4.211 ± 0.061
3.509PhePro: 3.509 ± 0.661
1.404PheGln: 1.404 ± 0.691
2.105PheArg: 2.105 ± 1.037
1.404PheSer: 1.404 ± 1.443
2.105PheThr: 2.105 ± 0.03
1.404PheVal: 1.404 ± 0.691
0.0PheTrp: 0.0 ± 0.0
0.702PheTyr: 0.702 ± 0.346
0.0PheXaa: 0.0 ± 0.0
Gly
5.614GlyAla: 5.614 ± 0.436
0.0GlyCys: 0.0 ± 0.0
5.614GlyAsp: 5.614 ± 0.63
0.702GlyGlu: 0.702 ± 0.346
1.404GlyPhe: 1.404 ± 0.691
2.105GlyGly: 2.105 ± 1.037
0.702GlyHis: 0.702 ± 0.346
2.807GlyIle: 2.807 ± 2.885
4.912GlyLys: 4.912 ± 0.285
4.211GlyLeu: 4.211 ± 1.006
1.404GlyMet: 1.404 ± 0.376
7.719GlyAsn: 7.719 ± 0.467
2.807GlyPro: 2.807 ± 0.752
1.404GlyGln: 1.404 ± 0.376
5.614GlyArg: 5.614 ± 0.63
4.912GlySer: 4.912 ± 0.285
5.614GlyThr: 5.614 ± 1.697
3.509GlyVal: 3.509 ± 0.406
0.702GlyTrp: 0.702 ± 0.721
2.807GlyTyr: 2.807 ± 0.315
0.0GlyXaa: 0.0 ± 0.0
His
0.702HisAla: 0.702 ± 0.721
0.0HisCys: 0.0 ± 0.0
0.702HisAsp: 0.702 ± 0.346
0.702HisGlu: 0.702 ± 0.346
0.0HisPhe: 0.0 ± 0.0
1.404HisGly: 1.404 ± 0.691
0.0HisHis: 0.0 ± 0.0
0.702HisIle: 0.702 ± 0.346
2.105HisLys: 2.105 ± 1.037
3.509HisLeu: 3.509 ± 0.661
0.702HisMet: 0.702 ± 0.346
0.702HisAsn: 0.702 ± 0.721
1.404HisPro: 1.404 ± 0.376
0.0HisGln: 0.0 ± 0.0
0.702HisArg: 0.702 ± 0.346
3.509HisSer: 3.509 ± 0.661
0.0HisThr: 0.0 ± 0.0
2.807HisVal: 2.807 ± 0.315
0.702HisTrp: 0.702 ± 0.346
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.807IleAla: 2.807 ± 1.819
0.702IleCys: 0.702 ± 0.346
1.404IleAsp: 1.404 ± 0.376
2.807IleGlu: 2.807 ± 0.315
2.105IlePhe: 2.105 ± 1.037
4.211IleGly: 4.211 ± 1.128
0.0IleHis: 0.0 ± 0.0
2.807IleIle: 2.807 ± 1.382
3.509IleLys: 3.509 ± 1.728
2.807IleLeu: 2.807 ± 0.752
0.702IleMet: 0.702 ± 0.346
2.807IleAsn: 2.807 ± 0.752
2.807IlePro: 2.807 ± 0.752
1.404IleGln: 1.404 ± 0.376
3.509IleArg: 3.509 ± 1.473
2.105IleSer: 2.105 ± 1.097
5.614IleThr: 5.614 ± 0.63
3.509IleVal: 3.509 ± 0.661
0.702IleTrp: 0.702 ± 0.346
1.404IleTyr: 1.404 ± 1.443
0.0IleXaa: 0.0 ± 0.0
Lys
4.912LysAla: 4.912 ± 1.352
0.702LysCys: 0.702 ± 0.346
7.018LysAsp: 7.018 ± 0.255
2.105LysGlu: 2.105 ± 0.03
2.105LysPhe: 2.105 ± 0.03
3.509LysGly: 3.509 ± 1.728
1.404LysHis: 1.404 ± 0.691
4.912LysIle: 4.912 ± 0.782
0.0LysLys: 0.0 ± 0.0
6.316LysLeu: 6.316 ± 0.091
1.404LysMet: 1.404 ± 1.443
2.105LysAsn: 2.105 ± 1.037
2.807LysPro: 2.807 ± 1.382
4.211LysGln: 4.211 ± 2.073
2.105LysArg: 2.105 ± 1.097
5.614LysSer: 5.614 ± 1.697
4.912LysThr: 4.912 ± 1.352
3.509LysVal: 3.509 ± 1.728
2.105LysTrp: 2.105 ± 1.037
1.404LysTyr: 1.404 ± 0.376
0.0LysXaa: 0.0 ± 0.0
Leu
8.421LeuAla: 8.421 ± 0.121
0.0LeuCys: 0.0 ± 0.0
7.018LeuAsp: 7.018 ± 0.812
4.211LeuGlu: 4.211 ± 0.061
2.105LeuPhe: 2.105 ± 1.097
6.316LeuGly: 6.316 ± 0.091
2.105LeuHis: 2.105 ± 0.03
3.509LeuIle: 3.509 ± 0.661
5.614LeuLys: 5.614 ± 2.764
6.316LeuLeu: 6.316 ± 1.158
1.404LeuMet: 1.404 ± 0.691
4.211LeuAsn: 4.211 ± 0.061
6.316LeuPro: 6.316 ± 1.158
2.807LeuGln: 2.807 ± 1.382
6.316LeuArg: 6.316 ± 1.158
7.719LeuSer: 7.719 ± 0.6
7.018LeuThr: 7.018 ± 1.321
4.211LeuVal: 4.211 ± 0.061
1.404LeuTrp: 1.404 ± 0.376
0.702LeuTyr: 0.702 ± 0.346
0.0LeuXaa: 0.0 ± 0.0
Met
1.404MetAla: 1.404 ± 0.691
1.404MetCys: 1.404 ± 0.376
0.702MetAsp: 0.702 ± 0.721
1.404MetGlu: 1.404 ± 0.376
0.702MetPhe: 0.702 ± 0.346
0.702MetGly: 0.702 ± 0.721
0.702MetHis: 0.702 ± 0.346
1.404MetIle: 1.404 ± 0.376
1.404MetLys: 1.404 ± 1.443
0.702MetLeu: 0.702 ± 0.346
0.0MetMet: 0.0 ± 0.0
0.702MetAsn: 0.702 ± 0.346
0.0MetPro: 0.0 ± 0.0
0.0MetGln: 0.0 ± 0.0
1.404MetArg: 1.404 ± 1.443
3.509MetSer: 3.509 ± 0.406
1.404MetThr: 1.404 ± 0.691
2.105MetVal: 2.105 ± 1.097
0.0MetTrp: 0.0 ± 0.0
1.404MetTyr: 1.404 ± 0.691
0.0MetXaa: 0.0 ± 0.0
Asn
4.211AsnAla: 4.211 ± 0.061
1.404AsnCys: 1.404 ± 0.691
2.807AsnAsp: 2.807 ± 0.315
1.404AsnGlu: 1.404 ± 0.376
2.105AsnPhe: 2.105 ± 0.03
4.912AsnGly: 4.912 ± 0.285
0.702AsnHis: 0.702 ± 0.721
2.105AsnIle: 2.105 ± 0.03
4.912AsnLys: 4.912 ± 0.285
4.912AsnLeu: 4.912 ± 1.849
0.0AsnMet: 0.0 ± 0.0
2.105AsnAsn: 2.105 ± 0.03
2.105AsnPro: 2.105 ± 1.037
2.807AsnGln: 2.807 ± 0.315
1.404AsnArg: 1.404 ± 1.443
2.105AsnSer: 2.105 ± 1.097
4.211AsnThr: 4.211 ± 0.061
4.211AsnVal: 4.211 ± 1.006
0.702AsnTrp: 0.702 ± 0.721
1.404AsnTyr: 1.404 ± 0.691
0.0AsnXaa: 0.0 ± 0.0
Pro
4.912ProAla: 4.912 ± 1.849
1.404ProCys: 1.404 ± 0.376
3.509ProAsp: 3.509 ± 1.473
2.105ProGlu: 2.105 ± 0.03
3.509ProPhe: 3.509 ± 1.473
3.509ProGly: 3.509 ± 0.661
0.702ProHis: 0.702 ± 0.721
7.018ProIle: 7.018 ± 0.812
6.316ProLys: 6.316 ± 1.158
4.912ProLeu: 4.912 ± 2.419
0.0ProMet: 0.0 ± 0.0
2.105ProAsn: 2.105 ± 0.03
3.509ProPro: 3.509 ± 0.406
2.105ProGln: 2.105 ± 1.037
2.807ProArg: 2.807 ± 0.315
4.211ProSer: 4.211 ± 1.006
6.316ProThr: 6.316 ± 0.976
3.509ProVal: 3.509 ± 0.406
0.702ProTrp: 0.702 ± 0.346
2.105ProTyr: 2.105 ± 0.03
0.0ProXaa: 0.0 ± 0.0
Gln
2.807GlnAla: 2.807 ± 0.752
0.0GlnCys: 0.0 ± 0.0
3.509GlnAsp: 3.509 ± 0.661
0.702GlnGlu: 0.702 ± 0.721
0.0GlnPhe: 0.0 ± 0.0
0.702GlnGly: 0.702 ± 0.721
1.404GlnHis: 1.404 ± 0.691
0.0GlnIle: 0.0 ± 0.0
2.105GlnLys: 2.105 ± 1.037
3.509GlnLeu: 3.509 ± 0.661
0.0GlnMet: 0.0 ± 0.0
1.404GlnAsn: 1.404 ± 0.691
2.105GlnPro: 2.105 ± 0.03
0.702GlnGln: 0.702 ± 0.346
2.807GlnArg: 2.807 ± 0.315
0.702GlnSer: 0.702 ± 0.346
2.105GlnThr: 2.105 ± 0.03
1.404GlnVal: 1.404 ± 0.691
1.404GlnTrp: 1.404 ± 1.443
0.702GlnTyr: 0.702 ± 0.346
0.0GlnXaa: 0.0 ± 0.0
Arg
4.211ArgAla: 4.211 ± 1.128
0.0ArgCys: 0.0 ± 0.0
2.105ArgAsp: 2.105 ± 1.097
4.211ArgGlu: 4.211 ± 1.006
2.105ArgPhe: 2.105 ± 1.037
3.509ArgGly: 3.509 ± 0.661
2.105ArgHis: 2.105 ± 1.037
0.702ArgIle: 0.702 ± 0.346
4.211ArgLys: 4.211 ± 0.061
3.509ArgLeu: 3.509 ± 0.661
0.702ArgMet: 0.702 ± 0.721
2.807ArgAsn: 2.807 ± 0.752
2.105ArgPro: 2.105 ± 0.03
2.105ArgGln: 2.105 ± 1.097
4.211ArgArg: 4.211 ± 1.128
4.912ArgSer: 4.912 ± 0.782
6.316ArgThr: 6.316 ± 2.225
3.509ArgVal: 3.509 ± 1.473
0.0ArgTrp: 0.0 ± 0.0
2.105ArgTyr: 2.105 ± 0.03
0.0ArgXaa: 0.0 ± 0.0
Ser
6.316SerAla: 6.316 ± 0.091
1.404SerCys: 1.404 ± 0.376
4.912SerAsp: 4.912 ± 0.782
4.211SerGlu: 4.211 ± 1.006
2.807SerPhe: 2.807 ± 0.315
6.316SerGly: 6.316 ± 0.976
1.404SerHis: 1.404 ± 0.691
4.211SerIle: 4.211 ± 0.061
4.912SerLys: 4.912 ± 2.419
3.509SerLeu: 3.509 ± 0.661
0.702SerMet: 0.702 ± 0.721
4.211SerAsn: 4.211 ± 0.061
7.719SerPro: 7.719 ± 0.467
1.404SerGln: 1.404 ± 1.443
2.807SerArg: 2.807 ± 0.752
2.807SerSer: 2.807 ± 0.315
2.105SerThr: 2.105 ± 1.097
6.316SerVal: 6.316 ± 0.091
0.702SerTrp: 0.702 ± 0.721
2.807SerTyr: 2.807 ± 0.752
0.0SerXaa: 0.0 ± 0.0
Thr
6.316ThrAla: 6.316 ± 0.091
1.404ThrCys: 1.404 ± 0.376
4.912ThrAsp: 4.912 ± 0.285
2.807ThrGlu: 2.807 ± 0.315
1.404ThrPhe: 1.404 ± 0.376
3.509ThrGly: 3.509 ± 0.406
0.0ThrHis: 0.0 ± 0.0
2.105ThrIle: 2.105 ± 0.03
4.912ThrLys: 4.912 ± 1.352
4.912ThrLeu: 4.912 ± 0.782
2.105ThrMet: 2.105 ± 0.03
3.509ThrAsn: 3.509 ± 0.406
8.421ThrPro: 8.421 ± 1.188
2.105ThrGln: 2.105 ± 0.03
3.509ThrArg: 3.509 ± 1.728
6.316ThrSer: 6.316 ± 0.976
5.614ThrThr: 5.614 ± 0.63
7.018ThrVal: 7.018 ± 0.255
1.404ThrTrp: 1.404 ± 0.376
1.404ThrTyr: 1.404 ± 0.691
0.0ThrXaa: 0.0 ± 0.0
Val
5.614ValAla: 5.614 ± 1.503
1.404ValCys: 1.404 ± 0.376
6.316ValAsp: 6.316 ± 2.043
5.614ValGlu: 5.614 ± 1.503
2.105ValPhe: 2.105 ± 0.03
2.807ValGly: 2.807 ± 0.752
0.0ValHis: 0.0 ± 0.0
3.509ValIle: 3.509 ± 0.406
3.509ValLys: 3.509 ± 0.661
6.316ValLeu: 6.316 ± 0.976
4.211ValMet: 4.211 ± 0.876
1.404ValAsn: 1.404 ± 0.691
3.509ValPro: 3.509 ± 0.406
0.0ValGln: 0.0 ± 0.0
4.211ValArg: 4.211 ± 1.006
7.719ValSer: 7.719 ± 0.467
4.211ValThr: 4.211 ± 0.061
7.018ValVal: 7.018 ± 0.812
0.0ValTrp: 0.0 ± 0.0
2.807ValTyr: 2.807 ± 1.382
0.0ValXaa: 0.0 ± 0.0
Trp
1.404TrpAla: 1.404 ± 0.691
0.702TrpCys: 0.702 ± 0.721
0.702TrpAsp: 0.702 ± 0.721
0.0TrpGlu: 0.0 ± 0.0
1.404TrpPhe: 1.404 ± 0.691
0.702TrpGly: 0.702 ± 0.721
0.702TrpHis: 0.702 ± 0.346
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.807TrpLeu: 2.807 ± 0.752
0.0TrpMet: 0.0 ± 0.0
0.702TrpAsn: 0.702 ± 0.346
0.702TrpPro: 0.702 ± 0.346
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.404TrpSer: 1.404 ± 1.443
1.404TrpThr: 1.404 ± 1.443
1.404TrpVal: 1.404 ± 0.691
0.0TrpTrp: 0.0 ± 0.0
0.702TrpTyr: 0.702 ± 0.346
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.105TyrAla: 2.105 ± 1.037
0.702TyrCys: 0.702 ± 0.346
0.702TyrAsp: 0.702 ± 0.721
0.702TyrGlu: 0.702 ± 0.346
1.404TyrPhe: 1.404 ± 0.691
4.211TyrGly: 4.211 ± 1.006
1.404TyrHis: 1.404 ± 0.376
2.105TyrIle: 2.105 ± 0.03
2.105TyrLys: 2.105 ± 1.037
2.807TyrLeu: 2.807 ± 1.382
0.0TyrMet: 0.0 ± 0.0
0.702TyrAsn: 0.702 ± 0.721
1.404TyrPro: 1.404 ± 1.443
0.0TyrGln: 0.0 ± 0.0
2.105TyrArg: 2.105 ± 1.037
2.807TyrSer: 2.807 ± 0.752
2.105TyrThr: 2.105 ± 0.03
2.105TyrVal: 2.105 ± 1.037
0.0TyrTrp: 0.0 ± 0.0
0.702TyrTyr: 0.702 ± 0.346
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1426 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski