Amino acid dipepetide frequency for Hubei noda-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.161AlaAla: 5.161 ± 0.512
0.645AlaCys: 0.645 ± 0.312
3.871AlaAsp: 3.871 ± 1.104
3.871AlaGlu: 3.871 ± 2.095
4.516AlaPhe: 4.516 ± 0.2
6.452AlaGly: 6.452 ± 1.136
0.645AlaHis: 0.645 ± 0.312
7.097AlaIle: 7.097 ± 1.447
3.871AlaLys: 3.871 ± 1.104
5.806AlaLeu: 5.806 ± 0.168
1.29AlaMet: 1.29 ± 0.368
3.226AlaAsn: 3.226 ± 2.407
5.161AlaPro: 5.161 ± 1.471
2.581AlaGln: 2.581 ± 0.736
1.29AlaArg: 1.29 ± 0.368
8.387AlaSer: 8.387 ± 1.08
2.581AlaThr: 2.581 ± 1.248
1.935AlaVal: 1.935 ± 0.056
1.29AlaTrp: 1.29 ± 0.624
3.226AlaTyr: 3.226 ± 0.568
0.0AlaXaa: 0.0 ± 0.0
Cys
2.581CysAla: 2.581 ± 0.736
1.29CysCys: 1.29 ± 0.624
0.645CysAsp: 0.645 ± 0.312
0.645CysGlu: 0.645 ± 0.312
0.645CysPhe: 0.645 ± 0.312
0.645CysGly: 0.645 ± 0.312
0.0CysHis: 0.0 ± 0.0
0.645CysIle: 0.645 ± 0.312
0.0CysLys: 0.0 ± 0.0
1.935CysLeu: 1.935 ± 0.056
0.645CysMet: 0.645 ± 0.312
0.645CysAsn: 0.645 ± 0.68
0.645CysPro: 0.645 ± 0.312
1.29CysGln: 1.29 ± 0.624
0.645CysArg: 0.645 ± 0.312
0.645CysSer: 0.645 ± 0.312
0.645CysThr: 0.645 ± 0.68
1.935CysVal: 1.935 ± 0.936
0.645CysTrp: 0.645 ± 0.68
1.935CysTyr: 1.935 ± 0.936
0.0CysXaa: 0.0 ± 0.0
Asp
2.581AspAla: 2.581 ± 1.727
2.581AspCys: 2.581 ± 0.256
2.581AspAsp: 2.581 ± 0.256
3.226AspGlu: 3.226 ± 0.568
2.581AspPhe: 2.581 ± 0.736
3.871AspGly: 3.871 ± 0.112
0.645AspHis: 0.645 ± 0.312
4.516AspIle: 4.516 ± 0.2
3.226AspLys: 3.226 ± 0.568
7.742AspLeu: 7.742 ± 1.216
1.29AspMet: 1.29 ± 0.75
3.226AspAsn: 3.226 ± 1.559
4.516AspPro: 4.516 ± 1.783
1.29AspGln: 1.29 ± 0.624
3.226AspArg: 3.226 ± 1.559
3.871AspSer: 3.871 ± 0.112
2.581AspThr: 2.581 ± 1.248
5.806AspVal: 5.806 ± 0.168
0.0AspTrp: 0.0 ± 0.0
2.581AspTyr: 2.581 ± 1.248
0.0AspXaa: 0.0 ± 0.0
Glu
3.871GluAla: 3.871 ± 0.112
0.645GluCys: 0.645 ± 0.312
1.29GluAsp: 1.29 ± 0.624
1.29GluGlu: 1.29 ± 0.624
5.161GluPhe: 5.161 ± 0.48
1.29GluGly: 1.29 ± 0.368
1.935GluHis: 1.935 ± 0.936
2.581GluIle: 2.581 ± 0.256
0.645GluLys: 0.645 ± 0.312
3.871GluLeu: 3.871 ± 1.104
0.645GluMet: 0.645 ± 0.312
3.226GluAsn: 3.226 ± 0.568
0.645GluPro: 0.645 ± 0.312
1.29GluGln: 1.29 ± 0.624
1.935GluArg: 1.935 ± 0.056
1.29GluSer: 1.29 ± 0.368
1.29GluThr: 1.29 ± 0.624
3.226GluVal: 3.226 ± 0.568
0.645GluTrp: 0.645 ± 0.312
3.226GluTyr: 3.226 ± 1.415
0.0GluXaa: 0.0 ± 0.0
Phe
1.935PheAla: 1.935 ± 0.056
0.645PheCys: 0.645 ± 0.312
4.516PheAsp: 4.516 ± 0.2
2.581PheGlu: 2.581 ± 0.736
0.0PhePhe: 0.0 ± 0.0
1.935PheGly: 1.935 ± 0.056
0.0PheHis: 0.0 ± 0.0
1.29PheIle: 1.29 ± 0.624
3.871PheLys: 3.871 ± 0.88
1.935PheLeu: 1.935 ± 0.056
1.29PheMet: 1.29 ± 0.368
1.29PheAsn: 1.29 ± 0.368
1.935PhePro: 1.935 ± 0.936
1.29PheGln: 1.29 ± 0.624
2.581PheArg: 2.581 ± 0.256
3.226PheSer: 3.226 ± 0.568
3.226PheThr: 3.226 ± 0.568
1.29PheVal: 1.29 ± 0.624
0.0PheTrp: 0.0 ± 0.0
1.29PheTyr: 1.29 ± 0.624
0.0PheXaa: 0.0 ± 0.0
Gly
2.581GlyAla: 2.581 ± 1.248
0.645GlyCys: 0.645 ± 0.312
1.935GlyAsp: 1.935 ± 0.936
0.0GlyGlu: 0.0 ± 0.0
1.29GlyPhe: 1.29 ± 0.624
1.935GlyGly: 1.935 ± 0.936
0.0GlyHis: 0.0 ± 0.0
3.871GlyIle: 3.871 ± 1.104
2.581GlyLys: 2.581 ± 1.248
3.871GlyLeu: 3.871 ± 0.88
0.645GlyMet: 0.645 ± 0.312
4.516GlyAsn: 4.516 ± 1.783
1.29GlyPro: 1.29 ± 0.624
1.935GlyGln: 1.935 ± 0.056
4.516GlyArg: 4.516 ± 0.792
5.806GlySer: 5.806 ± 1.16
6.452GlyThr: 6.452 ± 0.848
4.516GlyVal: 4.516 ± 0.792
0.645GlyTrp: 0.645 ± 0.68
4.516GlyTyr: 4.516 ± 0.792
0.0GlyXaa: 0.0 ± 0.0
His
1.935HisAla: 1.935 ± 1.048
0.0HisCys: 0.0 ± 0.0
0.645HisAsp: 0.645 ± 0.312
1.29HisGlu: 1.29 ± 0.624
0.645HisPhe: 0.645 ± 0.312
0.645HisGly: 0.645 ± 0.312
0.645HisHis: 0.645 ± 0.312
0.645HisIle: 0.645 ± 0.312
0.0HisLys: 0.0 ± 0.0
2.581HisLeu: 2.581 ± 0.736
0.0HisMet: 0.0 ± 0.0
0.645HisAsn: 0.645 ± 0.312
1.29HisPro: 1.29 ± 0.368
1.29HisGln: 1.29 ± 0.624
1.29HisArg: 1.29 ± 0.368
0.645HisSer: 0.645 ± 0.312
0.645HisThr: 0.645 ± 0.312
1.935HisVal: 1.935 ± 0.936
0.0HisTrp: 0.0 ± 0.0
0.645HisTyr: 0.645 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
4.516IleAla: 4.516 ± 0.2
0.645IleCys: 0.645 ± 0.312
3.226IleAsp: 3.226 ± 0.424
2.581IleGlu: 2.581 ± 0.736
2.581IlePhe: 2.581 ± 1.248
1.29IleGly: 1.29 ± 0.624
1.935IleHis: 1.935 ± 0.056
3.871IleIle: 3.871 ± 0.112
1.29IleLys: 1.29 ± 0.624
6.452IleLeu: 6.452 ± 0.144
2.581IleMet: 2.581 ± 1.248
3.226IleAsn: 3.226 ± 1.559
3.226IlePro: 3.226 ± 0.424
1.29IleGln: 1.29 ± 0.624
5.161IleArg: 5.161 ± 2.463
5.161IleSer: 5.161 ± 1.471
2.581IleThr: 2.581 ± 1.248
3.871IleVal: 3.871 ± 0.112
0.645IleTrp: 0.645 ± 0.312
3.871IleTyr: 3.871 ± 1.104
0.0IleXaa: 0.0 ± 0.0
Lys
1.29LysAla: 1.29 ± 0.624
0.0LysCys: 0.0 ± 0.0
2.581LysAsp: 2.581 ± 0.256
3.871LysGlu: 3.871 ± 1.871
0.0LysPhe: 0.0 ± 0.0
1.935LysGly: 1.935 ± 0.936
0.645LysHis: 0.645 ± 0.312
3.871LysIle: 3.871 ± 0.112
0.0LysLys: 0.0 ± 0.0
4.516LysLeu: 4.516 ± 1.192
0.645LysMet: 0.645 ± 0.286
1.29LysAsn: 1.29 ± 0.624
5.161LysPro: 5.161 ± 0.48
1.29LysGln: 1.29 ± 0.368
5.806LysArg: 5.806 ± 0.824
1.29LysSer: 1.29 ± 0.624
1.935LysThr: 1.935 ± 0.936
2.581LysVal: 2.581 ± 1.727
0.645LysTrp: 0.645 ± 0.68
2.581LysTyr: 2.581 ± 0.256
0.0LysXaa: 0.0 ± 0.0
Leu
5.161LeuAla: 5.161 ± 0.512
1.935LeuCys: 1.935 ± 0.056
5.161LeuAsp: 5.161 ± 1.471
1.935LeuGlu: 1.935 ± 0.056
5.161LeuPhe: 5.161 ± 0.512
7.742LeuGly: 7.742 ± 1.216
1.935LeuHis: 1.935 ± 1.048
2.581LeuIle: 2.581 ± 0.736
3.226LeuLys: 3.226 ± 0.568
3.871LeuLeu: 3.871 ± 0.112
1.29LeuMet: 1.29 ± 0.624
1.935LeuAsn: 1.935 ± 0.056
5.161LeuPro: 5.161 ± 1.471
3.871LeuGln: 3.871 ± 1.871
7.742LeuArg: 7.742 ± 0.224
5.806LeuSer: 5.806 ± 0.168
5.806LeuThr: 5.806 ± 0.824
5.806LeuVal: 5.806 ± 0.168
0.645LeuTrp: 0.645 ± 0.68
1.29LeuTyr: 1.29 ± 0.368
0.0LeuXaa: 0.0 ± 0.0
Met
1.935MetAla: 1.935 ± 0.056
1.935MetCys: 1.935 ± 0.936
1.935MetAsp: 1.935 ± 0.936
0.645MetGlu: 0.645 ± 0.312
0.0MetPhe: 0.0 ± 0.0
1.935MetGly: 1.935 ± 0.056
0.0MetHis: 0.0 ± 0.0
0.645MetIle: 0.645 ± 0.312
1.29MetLys: 1.29 ± 0.368
0.0MetLeu: 0.0 ± 0.0
0.645MetMet: 0.645 ± 0.312
1.29MetAsn: 1.29 ± 0.368
1.935MetPro: 1.935 ± 1.048
0.0MetGln: 0.0 ± 0.0
0.645MetArg: 0.645 ± 0.312
0.0MetSer: 0.0 ± 0.0
1.935MetThr: 1.935 ± 1.048
1.29MetVal: 1.29 ± 0.624
0.0MetTrp: 0.0 ± 0.0
1.29MetTyr: 1.29 ± 0.624
0.0MetXaa: 0.0 ± 0.0
Asn
5.161AsnAla: 5.161 ± 0.512
1.29AsnCys: 1.29 ± 0.624
1.935AsnAsp: 1.935 ± 0.056
5.161AsnGlu: 5.161 ± 0.512
1.29AsnPhe: 1.29 ± 0.368
2.581AsnGly: 2.581 ± 1.248
0.645AsnHis: 0.645 ± 0.312
3.226AsnIle: 3.226 ± 0.424
3.226AsnLys: 3.226 ± 0.424
5.161AsnLeu: 5.161 ± 4.446
0.645AsnMet: 0.645 ± 0.312
9.677AsnAsn: 9.677 ± 4.246
3.226AsnPro: 3.226 ± 0.424
3.226AsnGln: 3.226 ± 1.415
4.516AsnArg: 4.516 ± 1.192
2.581AsnSer: 2.581 ± 0.736
5.806AsnThr: 5.806 ± 2.151
2.581AsnVal: 2.581 ± 0.736
0.645AsnTrp: 0.645 ± 0.312
0.645AsnTyr: 0.645 ± 0.312
0.0AsnXaa: 0.0 ± 0.0
Pro
3.226ProAla: 3.226 ± 1.415
0.645ProCys: 0.645 ± 0.68
5.161ProAsp: 5.161 ± 0.512
0.645ProGlu: 0.645 ± 0.312
1.29ProPhe: 1.29 ± 0.368
7.097ProGly: 7.097 ± 0.536
3.226ProHis: 3.226 ± 0.424
5.161ProIle: 5.161 ± 0.48
2.581ProLys: 2.581 ± 0.256
2.581ProLeu: 2.581 ± 1.248
0.645ProMet: 0.645 ± 0.68
2.581ProAsn: 2.581 ± 0.736
2.581ProPro: 2.581 ± 0.736
1.29ProGln: 1.29 ± 0.368
3.226ProArg: 3.226 ± 0.424
5.161ProSer: 5.161 ± 2.463
6.452ProThr: 6.452 ± 0.848
2.581ProVal: 2.581 ± 1.248
0.0ProTrp: 0.0 ± 0.0
2.581ProTyr: 2.581 ± 0.256
0.0ProXaa: 0.0 ± 0.0
Gln
3.226GlnAla: 3.226 ± 0.568
0.0GlnCys: 0.0 ± 0.0
1.935GlnAsp: 1.935 ± 1.048
1.29GlnGlu: 1.29 ± 0.624
1.935GlnPhe: 1.935 ± 0.056
0.645GlnGly: 0.645 ± 0.68
1.29GlnHis: 1.29 ± 0.624
1.29GlnIle: 1.29 ± 0.368
1.935GlnLys: 1.935 ± 0.936
4.516GlnLeu: 4.516 ± 1.192
0.645GlnMet: 0.645 ± 0.312
2.581GlnAsn: 2.581 ± 0.736
2.581GlnPro: 2.581 ± 0.736
0.0GlnGln: 0.0 ± 0.0
2.581GlnArg: 2.581 ± 0.256
3.226GlnSer: 3.226 ± 1.559
3.226GlnThr: 3.226 ± 0.424
0.645GlnVal: 0.645 ± 0.312
0.645GlnTrp: 0.645 ± 0.68
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
1.935ArgAla: 1.935 ± 1.048
0.0ArgCys: 0.0 ± 0.0
5.806ArgAsp: 5.806 ± 0.824
3.871ArgGlu: 3.871 ± 0.88
3.871ArgPhe: 3.871 ± 0.88
1.29ArgGly: 1.29 ± 1.359
0.0ArgHis: 0.0 ± 0.0
3.871ArgIle: 3.871 ± 1.871
2.581ArgLys: 2.581 ± 1.248
5.161ArgLeu: 5.161 ± 0.48
0.645ArgMet: 0.645 ± 0.68
4.516ArgAsn: 4.516 ± 1.192
4.516ArgPro: 4.516 ± 2.183
2.581ArgGln: 2.581 ± 0.736
9.677ArgArg: 9.677 ± 0.28
9.032ArgSer: 9.032 ± 0.4
3.226ArgThr: 3.226 ± 0.424
3.226ArgVal: 3.226 ± 0.568
1.29ArgTrp: 1.29 ± 0.368
1.935ArgTyr: 1.935 ± 0.936
0.0ArgXaa: 0.0 ± 0.0
Ser
9.677SerAla: 9.677 ± 0.712
1.29SerCys: 1.29 ± 0.368
5.161SerAsp: 5.161 ± 0.512
1.935SerGlu: 1.935 ± 0.056
2.581SerPhe: 2.581 ± 1.248
5.161SerGly: 5.161 ± 0.48
1.29SerHis: 1.29 ± 0.624
3.871SerIle: 3.871 ± 0.112
3.871SerLys: 3.871 ± 0.112
7.097SerLeu: 7.097 ± 0.536
1.29SerMet: 1.29 ± 0.624
3.871SerAsn: 3.871 ± 3.087
3.871SerPro: 3.871 ± 0.112
1.935SerGln: 1.935 ± 0.056
4.516SerArg: 4.516 ± 2.183
8.387SerSer: 8.387 ± 5.862
5.161SerThr: 5.161 ± 1.471
8.387SerVal: 8.387 ± 2.887
1.29SerTrp: 1.29 ± 0.368
3.871SerTyr: 3.871 ± 0.88
0.0SerXaa: 0.0 ± 0.0
Thr
6.452ThrAla: 6.452 ± 1.839
1.29ThrCys: 1.29 ± 1.359
4.516ThrAsp: 4.516 ± 0.2
1.29ThrGlu: 1.29 ± 0.624
0.645ThrPhe: 0.645 ± 0.312
1.29ThrGly: 1.29 ± 0.368
1.29ThrHis: 1.29 ± 0.624
3.871ThrIle: 3.871 ± 0.112
0.645ThrLys: 0.645 ± 0.312
5.806ThrLeu: 5.806 ± 0.824
0.645ThrMet: 0.645 ± 0.312
7.742ThrAsn: 7.742 ± 0.768
6.452ThrPro: 6.452 ± 1.839
1.935ThrGln: 1.935 ± 1.048
5.161ThrArg: 5.161 ± 1.503
5.161ThrSer: 5.161 ± 1.503
5.161ThrThr: 5.161 ± 0.512
1.935ThrVal: 1.935 ± 0.936
0.645ThrTrp: 0.645 ± 0.312
5.806ThrTyr: 5.806 ± 2.151
0.0ThrXaa: 0.0 ± 0.0
Val
3.871ValAla: 3.871 ± 1.104
0.645ValCys: 0.645 ± 0.312
4.516ValAsp: 4.516 ± 0.2
1.935ValGlu: 1.935 ± 1.048
1.29ValPhe: 1.29 ± 0.624
2.581ValGly: 2.581 ± 0.736
0.645ValHis: 0.645 ± 0.68
5.806ValIle: 5.806 ± 1.16
3.871ValLys: 3.871 ± 0.112
3.871ValLeu: 3.871 ± 0.112
0.645ValMet: 0.645 ± 0.312
2.581ValAsn: 2.581 ± 1.727
2.581ValPro: 2.581 ± 1.248
3.226ValGln: 3.226 ± 0.568
2.581ValArg: 2.581 ± 1.248
9.677ValSer: 9.677 ± 0.712
2.581ValThr: 2.581 ± 1.248
8.387ValVal: 8.387 ± 0.088
0.645ValTrp: 0.645 ± 0.312
1.935ValTyr: 1.935 ± 0.056
0.0ValXaa: 0.0 ± 0.0
Trp
1.29TrpAla: 1.29 ± 0.624
0.645TrpCys: 0.645 ± 0.312
2.581TrpAsp: 2.581 ± 0.256
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.645TrpGly: 0.645 ± 0.68
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.645TrpLeu: 0.645 ± 0.312
0.645TrpMet: 0.645 ± 0.68
0.0TrpAsn: 0.0 ± 0.0
0.645TrpPro: 0.645 ± 0.312
0.0TrpGln: 0.0 ± 0.0
0.645TrpArg: 0.645 ± 0.312
1.935TrpSer: 1.935 ± 1.048
1.935TrpThr: 1.935 ± 1.048
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.516TyrAla: 4.516 ± 1.192
1.935TyrCys: 1.935 ± 0.936
2.581TyrAsp: 2.581 ± 0.736
1.935TyrGlu: 1.935 ± 0.056
0.645TyrPhe: 0.645 ± 0.312
2.581TyrGly: 2.581 ± 0.256
0.645TyrHis: 0.645 ± 0.312
1.29TyrIle: 1.29 ± 0.624
3.226TyrLys: 3.226 ± 0.568
0.645TyrLeu: 0.645 ± 0.312
1.935TyrMet: 1.935 ± 0.056
5.161TyrAsn: 5.161 ± 0.48
1.29TyrPro: 1.29 ± 1.359
2.581TyrGln: 2.581 ± 0.256
1.29TyrArg: 1.29 ± 0.624
3.226TyrSer: 3.226 ± 2.407
4.516TyrThr: 4.516 ± 0.2
1.935TyrVal: 1.935 ± 0.056
1.29TyrTrp: 1.29 ± 0.624
3.871TyrTyr: 3.871 ± 1.871
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1551 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski