Amino acid dipepetide frequency for Hubei tombus-like virus 43

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.407AlaAla: 6.407 ± 1.294
1.068AlaCys: 1.068 ± 0.694
4.805AlaAsp: 4.805 ± 1.186
1.068AlaGlu: 1.068 ± 0.519
3.203AlaPhe: 3.203 ± 1.474
2.67AlaGly: 2.67 ± 0.367
2.67AlaHis: 2.67 ± 1.121
6.407AlaIle: 6.407 ± 1.002
5.339AlaLys: 5.339 ± 0.781
4.271AlaLeu: 4.271 ± 0.784
3.203AlaMet: 3.203 ± 0.727
2.67AlaAsn: 2.67 ± 0.386
3.737AlaPro: 3.737 ± 0.708
1.602AlaGln: 1.602 ± 0.364
3.737AlaArg: 3.737 ± 0.475
6.941AlaSer: 6.941 ± 0.983
3.203AlaThr: 3.203 ± 0.727
4.271AlaVal: 4.271 ± 1.066
0.534AlaTrp: 0.534 ± 0.483
1.602AlaTyr: 1.602 ± 0.364
0.0AlaXaa: 0.0 ± 0.0
Cys
1.602CysAla: 1.602 ± 1.026
0.534CysCys: 0.534 ± 0.342
2.136CysAsp: 2.136 ± 0.783
1.068CysGlu: 1.068 ± 0.684
0.0CysPhe: 0.0 ± 0.0
1.068CysGly: 1.068 ± 0.392
0.534CysHis: 0.534 ± 0.342
1.602CysIle: 1.602 ± 0.554
0.0CysLys: 0.0 ± 0.0
1.068CysLeu: 1.068 ± 0.694
2.136CysMet: 2.136 ± 0.679
1.602CysAsn: 1.602 ± 0.592
2.136CysPro: 2.136 ± 0.978
0.0CysGln: 0.0 ± 0.0
1.068CysArg: 1.068 ± 0.392
1.602CysSer: 1.602 ± 0.592
1.068CysThr: 1.068 ± 0.519
0.534CysVal: 0.534 ± 0.65
0.0CysTrp: 0.0 ± 0.0
1.068CysTyr: 1.068 ± 1.3
0.0CysXaa: 0.0 ± 0.0
Asp
3.737AspAla: 3.737 ± 0.687
0.534AspCys: 0.534 ± 0.342
3.737AspAsp: 3.737 ± 0.977
4.271AspGlu: 4.271 ± 0.904
1.068AspPhe: 1.068 ± 0.684
3.737AspGly: 3.737 ± 1.807
1.602AspHis: 1.602 ± 0.592
1.602AspIle: 1.602 ± 0.364
4.805AspLys: 4.805 ± 1.75
6.407AspLeu: 6.407 ± 1.313
2.136AspMet: 2.136 ± 1.422
1.602AspAsn: 1.602 ± 0.592
2.67AspPro: 2.67 ± 0.386
2.67AspGln: 2.67 ± 0.386
3.203AspArg: 3.203 ± 1.474
2.136AspSer: 2.136 ± 0.129
4.271AspThr: 4.271 ± 1.416
4.271AspVal: 4.271 ± 0.64
1.068AspTrp: 1.068 ± 0.392
3.203AspTyr: 3.203 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
3.203GluAla: 3.203 ± 3.15
1.602GluCys: 1.602 ± 1.026
0.534GluAsp: 0.534 ± 0.483
1.602GluGlu: 1.602 ± 1.004
1.602GluPhe: 1.602 ± 0.81
2.67GluGly: 2.67 ± 0.755
1.602GluHis: 1.602 ± 1.026
3.737GluIle: 3.737 ± 1.807
1.068GluLys: 1.068 ± 0.392
3.737GluLeu: 3.737 ± 1.68
2.136GluMet: 2.136 ± 1.875
2.136GluAsn: 2.136 ± 1.275
1.602GluPro: 1.602 ± 0.364
4.271GluGln: 4.271 ± 0.784
2.67GluArg: 2.67 ± 0.897
2.67GluSer: 2.67 ± 1.148
1.602GluThr: 1.602 ± 1.004
2.67GluVal: 2.67 ± 0.367
0.534GluTrp: 0.534 ± 0.483
1.068GluTyr: 1.068 ± 0.684
0.0GluXaa: 0.0 ± 0.0
Phe
1.068PheAla: 1.068 ± 0.392
1.602PheCys: 1.602 ± 0.364
4.271PheAsp: 4.271 ± 0.257
0.534PheGlu: 0.534 ± 0.342
1.068PhePhe: 1.068 ± 0.684
0.534PheGly: 0.534 ± 0.483
1.068PheHis: 1.068 ± 0.392
0.0PheIle: 0.0 ± 0.0
1.602PheLys: 1.602 ± 1.026
4.271PheLeu: 4.271 ± 0.257
0.534PheMet: 0.534 ± 0.342
1.068PheAsn: 1.068 ± 0.966
1.602PhePro: 1.602 ± 1.004
1.068PheGln: 1.068 ± 0.684
2.136PheArg: 2.136 ± 0.783
1.068PheSer: 1.068 ± 0.966
3.203PheThr: 3.203 ± 1.474
4.271PheVal: 4.271 ± 0.656
0.0PheTrp: 0.0 ± 0.0
2.136PheTyr: 2.136 ± 0.834
0.0PheXaa: 0.0 ± 0.0
Gly
0.534GlyAla: 0.534 ± 0.65
0.0GlyCys: 0.0 ± 0.0
3.203GlyAsp: 3.203 ± 1.614
3.203GlyGlu: 3.203 ± 0.727
2.136GlyPhe: 2.136 ± 0.834
4.271GlyGly: 4.271 ± 0.904
2.67GlyHis: 2.67 ± 1.628
0.534GlyIle: 0.534 ± 0.483
3.737GlyLys: 3.737 ± 1.374
5.873GlyLeu: 5.873 ± 1.988
3.203GlyMet: 3.203 ± 0.647
0.534GlyAsn: 0.534 ± 0.342
3.737GlyPro: 3.737 ± 1.374
1.068GlyGln: 1.068 ± 0.519
3.737GlyArg: 3.737 ± 0.425
4.271GlySer: 4.271 ± 1.637
2.67GlyThr: 2.67 ± 0.386
2.136GlyVal: 2.136 ± 0.816
1.068GlyTrp: 1.068 ± 0.519
0.534GlyTyr: 0.534 ± 0.342
0.0GlyXaa: 0.0 ± 0.0
His
3.203HisAla: 3.203 ± 2.251
0.0HisCys: 0.0 ± 0.0
2.136HisAsp: 2.136 ± 0.129
3.203HisGlu: 3.203 ± 0.865
0.534HisPhe: 0.534 ± 0.342
0.0HisGly: 0.0 ± 0.0
1.068HisHis: 1.068 ± 0.694
2.67HisIle: 2.67 ± 1.148
1.068HisLys: 1.068 ± 0.519
3.203HisLeu: 3.203 ± 1.109
1.602HisMet: 1.602 ± 1.126
1.068HisAsn: 1.068 ± 0.519
1.068HisPro: 1.068 ± 0.392
2.136HisGln: 2.136 ± 0.783
1.602HisArg: 1.602 ± 0.364
3.203HisSer: 3.203 ± 0.647
0.534HisThr: 0.534 ± 0.483
2.67HisVal: 2.67 ± 0.897
0.534HisTrp: 0.534 ± 0.65
1.602HisTyr: 1.602 ± 0.554
0.0HisXaa: 0.0 ± 0.0
Ile
3.203IleAla: 3.203 ± 1.109
1.068IleCys: 1.068 ± 0.392
4.271IleAsp: 4.271 ± 0.257
4.805IleGlu: 4.805 ± 1.79
1.602IlePhe: 1.602 ± 0.81
4.805IleGly: 4.805 ± 1.186
3.203IleHis: 3.203 ± 0.865
3.737IleIle: 3.737 ± 0.425
3.737IleLys: 3.737 ± 1.036
3.737IleLeu: 3.737 ± 1.374
2.67IleMet: 2.67 ± 0.386
4.271IleAsn: 4.271 ± 0.64
5.339IlePro: 5.339 ± 1.923
1.068IleGln: 1.068 ± 0.519
2.136IleArg: 2.136 ± 0.834
2.136IleSer: 2.136 ± 1.275
3.737IleThr: 3.737 ± 0.687
3.203IleVal: 3.203 ± 1.184
0.534IleTrp: 0.534 ± 0.483
1.068IleTyr: 1.068 ± 0.684
0.0IleXaa: 0.0 ± 0.0
Lys
4.271LysAla: 4.271 ± 2.092
0.534LysCys: 0.534 ± 0.342
3.203LysAsp: 3.203 ± 0.698
2.136LysGlu: 2.136 ± 1.368
0.534LysPhe: 0.534 ± 0.65
2.67LysGly: 2.67 ± 0.755
2.136LysHis: 2.136 ± 1.038
3.737LysIle: 3.737 ± 0.425
1.602LysLys: 1.602 ± 0.554
1.602LysLeu: 1.602 ± 1.449
0.0LysMet: 0.0 ± 0.0
3.737LysAsn: 3.737 ± 0.425
2.67LysPro: 2.67 ± 1.06
5.339LysGln: 5.339 ± 2.12
3.203LysArg: 3.203 ± 0.265
5.873LysSer: 5.873 ± 1.781
2.136LysThr: 2.136 ± 1.414
5.873LysVal: 5.873 ± 0.512
1.068LysTrp: 1.068 ± 0.392
1.602LysTyr: 1.602 ± 0.554
0.0LysXaa: 0.0 ± 0.0
Leu
5.873LeuAla: 5.873 ± 1.689
1.602LeuCys: 1.602 ± 0.81
3.737LeuAsp: 3.737 ± 0.687
2.136LeuGlu: 2.136 ± 1.275
3.203LeuPhe: 3.203 ± 1.175
6.407LeuGly: 6.407 ± 1.043
2.67LeuHis: 2.67 ± 1.054
6.941LeuIle: 6.941 ± 1.524
5.339LeuLys: 5.339 ± 0.781
8.009LeuLeu: 8.009 ± 1.251
0.534LeuMet: 0.534 ± 0.342
2.67LeuAsn: 2.67 ± 0.367
6.941LeuPro: 6.941 ± 0.689
2.67LeuGln: 2.67 ± 0.386
4.271LeuArg: 4.271 ± 2.142
7.475LeuSer: 7.475 ± 2.768
5.339LeuThr: 5.339 ± 1.025
6.941LeuVal: 6.941 ± 0.282
1.602LeuTrp: 1.602 ± 0.554
2.136LeuTyr: 2.136 ± 0.783
0.0LeuXaa: 0.0 ± 0.0
Met
2.67MetAla: 2.67 ± 1.06
1.602MetCys: 1.602 ± 1.95
2.136MetAsp: 2.136 ± 0.978
2.136MetGlu: 2.136 ± 1.038
1.602MetPhe: 1.602 ± 0.364
1.068MetGly: 1.068 ± 0.684
1.068MetHis: 1.068 ± 0.684
1.602MetIle: 1.602 ± 0.592
0.0MetLys: 0.0 ± 0.0
2.136MetLeu: 2.136 ± 0.679
0.534MetMet: 0.534 ± 0.342
2.136MetAsn: 2.136 ± 0.783
1.068MetPro: 1.068 ± 0.694
1.602MetGln: 1.602 ± 0.554
1.602MetArg: 1.602 ± 0.364
2.67MetSer: 2.67 ± 1.102
0.534MetThr: 0.534 ± 0.483
0.0MetVal: 0.0 ± 0.0
1.068MetTrp: 1.068 ± 0.519
0.534MetTyr: 0.534 ± 0.65
0.0MetXaa: 0.0 ± 0.0
Asn
3.203AsnAla: 3.203 ± 0.698
1.602AsnCys: 1.602 ± 0.364
2.67AsnAsp: 2.67 ± 1.102
1.602AsnGlu: 1.602 ± 0.554
1.602AsnPhe: 1.602 ± 1.004
1.068AsnGly: 1.068 ± 0.694
0.534AsnHis: 0.534 ± 0.342
4.271AsnIle: 4.271 ± 1.668
2.67AsnLys: 2.67 ± 0.755
2.136AsnLeu: 2.136 ± 1.389
1.602AsnMet: 1.602 ± 1.026
2.136AsnAsn: 2.136 ± 0.679
2.136AsnPro: 2.136 ± 0.783
2.136AsnGln: 2.136 ± 1.389
1.602AsnArg: 1.602 ± 0.81
4.271AsnSer: 4.271 ± 0.656
3.203AsnThr: 3.203 ± 0.727
2.67AsnVal: 2.67 ± 1.06
1.068AsnTrp: 1.068 ± 0.966
2.67AsnTyr: 2.67 ± 0.367
0.0AsnXaa: 0.0 ± 0.0
Pro
3.203ProAla: 3.203 ± 1.474
1.602ProCys: 1.602 ± 1.126
2.67ProAsp: 2.67 ± 1.71
4.805ProGlu: 4.805 ± 1.05
3.203ProPhe: 3.203 ± 1.109
2.136ProGly: 2.136 ± 0.816
0.534ProHis: 0.534 ± 0.483
3.203ProIle: 3.203 ± 0.265
4.271ProLys: 4.271 ± 1.066
3.737ProLeu: 3.737 ± 2.081
1.068ProMet: 1.068 ± 0.694
4.271ProAsn: 4.271 ± 1.376
2.136ProPro: 2.136 ± 0.834
2.67ProGln: 2.67 ± 1.121
5.339ProArg: 5.339 ± 2.354
2.67ProSer: 2.67 ± 0.897
6.941ProThr: 6.941 ± 0.923
3.203ProVal: 3.203 ± 0.647
1.068ProTrp: 1.068 ± 0.519
0.534ProTyr: 0.534 ± 0.483
0.0ProXaa: 0.0 ± 0.0
Gln
1.602GlnAla: 1.602 ± 0.81
1.602GlnCys: 1.602 ± 0.592
1.068GlnAsp: 1.068 ± 0.684
0.534GlnGlu: 0.534 ± 0.342
1.068GlnPhe: 1.068 ± 0.519
1.068GlnGly: 1.068 ± 0.694
1.602GlnHis: 1.602 ± 0.364
2.67GlnIle: 2.67 ± 1.177
1.068GlnLys: 1.068 ± 0.694
3.737GlnLeu: 3.737 ± 1.268
2.136GlnMet: 2.136 ± 1.287
1.068GlnAsn: 1.068 ± 0.392
3.737GlnPro: 3.737 ± 0.977
3.737GlnGln: 3.737 ± 0.425
4.271GlnArg: 4.271 ± 1.632
2.67GlnSer: 2.67 ± 0.367
3.737GlnThr: 3.737 ± 2.224
1.602GlnVal: 1.602 ± 0.81
1.068GlnTrp: 1.068 ± 0.392
0.534GlnTyr: 0.534 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
3.737ArgAla: 3.737 ± 1.374
0.534ArgCys: 0.534 ± 0.342
2.136ArgAsp: 2.136 ± 0.783
1.602ArgGlu: 1.602 ± 0.592
3.737ArgPhe: 3.737 ± 1.374
3.737ArgGly: 3.737 ± 0.708
2.136ArgHis: 2.136 ± 0.783
2.136ArgIle: 2.136 ± 0.129
5.339ArgLys: 5.339 ± 1.048
5.873ArgLeu: 5.873 ± 2.205
0.0ArgMet: 0.0 ± 0.0
3.737ArgAsn: 3.737 ± 1.036
3.203ArgPro: 3.203 ± 1.109
1.068ArgGln: 1.068 ± 0.684
6.407ArgArg: 6.407 ± 2.124
3.203ArgSer: 3.203 ± 0.265
3.737ArgThr: 3.737 ± 0.425
4.805ArgVal: 4.805 ± 1.051
1.602ArgTrp: 1.602 ± 0.592
1.602ArgTyr: 1.602 ± 0.554
0.0ArgXaa: 0.0 ± 0.0
Ser
5.339SerAla: 5.339 ± 0.143
1.068SerCys: 1.068 ± 0.392
2.136SerAsp: 2.136 ± 0.129
3.203SerGlu: 3.203 ± 0.647
1.068SerPhe: 1.068 ± 0.392
4.271SerGly: 4.271 ± 1.166
2.136SerHis: 2.136 ± 0.834
5.873SerIle: 5.873 ± 2.597
2.136SerLys: 2.136 ± 1.763
9.61SerLeu: 9.61 ± 2.178
0.534SerMet: 0.534 ± 0.65
3.203SerAsn: 3.203 ± 1.317
3.203SerPro: 3.203 ± 0.265
0.534SerGln: 0.534 ± 0.483
4.805SerArg: 4.805 ± 0.431
7.475SerSer: 7.475 ± 2.461
7.475SerThr: 7.475 ± 1.53
8.009SerVal: 8.009 ± 1.157
1.602SerTrp: 1.602 ± 0.592
3.737SerTyr: 3.737 ± 0.687
0.0SerXaa: 0.0 ± 0.0
Thr
6.941ThrAla: 6.941 ± 2.63
2.136ThrCys: 2.136 ± 0.129
4.271ThrAsp: 4.271 ± 1.992
1.602ThrGlu: 1.602 ± 1.004
2.67ThrPhe: 2.67 ± 0.755
2.67ThrGly: 2.67 ± 1.148
2.67ThrHis: 2.67 ± 1.06
4.271ThrIle: 4.271 ± 1.359
3.203ThrLys: 3.203 ± 1.403
6.941ThrLeu: 6.941 ± 1.653
1.602ThrMet: 1.602 ± 1.026
0.534ThrAsn: 0.534 ± 0.65
5.339ThrPro: 5.339 ± 1.923
3.737ThrGln: 3.737 ± 0.475
4.271ThrArg: 4.271 ± 0.656
6.941ThrSer: 6.941 ± 2.933
3.737ThrThr: 3.737 ± 0.977
1.602ThrVal: 1.602 ± 0.592
0.534ThrTrp: 0.534 ± 0.483
1.602ThrTyr: 1.602 ± 0.592
0.0ThrXaa: 0.0 ± 0.0
Val
5.873ValAla: 5.873 ± 0.567
0.534ValCys: 0.534 ± 0.342
6.407ValAsp: 6.407 ± 1.294
1.602ValGlu: 1.602 ± 1.449
1.068ValPhe: 1.068 ± 0.966
2.67ValGly: 2.67 ± 1.06
2.136ValHis: 2.136 ± 0.679
3.203ValIle: 3.203 ± 1.414
5.339ValLys: 5.339 ± 1.794
5.873ValLeu: 5.873 ± 1.152
1.068ValMet: 1.068 ± 0.519
3.737ValAsn: 3.737 ± 0.475
4.271ValPro: 4.271 ± 1.158
2.136ValGln: 2.136 ± 0.783
3.203ValArg: 3.203 ± 1.414
4.805ValSer: 4.805 ± 3.352
6.407ValThr: 6.407 ± 3.765
5.339ValVal: 5.339 ± 0.776
1.602ValTrp: 1.602 ± 1.255
1.068ValTyr: 1.068 ± 0.392
0.0ValXaa: 0.0 ± 0.0
Trp
1.068TrpAla: 1.068 ± 0.966
1.068TrpCys: 1.068 ± 0.392
1.602TrpAsp: 1.602 ± 0.364
0.0TrpGlu: 0.0 ± 0.0
0.534TrpPhe: 0.534 ± 0.342
0.534TrpGly: 0.534 ± 0.483
0.0TrpHis: 0.0 ± 0.0
0.534TrpIle: 0.534 ± 0.342
0.534TrpLys: 0.534 ± 0.342
2.67TrpLeu: 2.67 ± 0.367
0.534TrpMet: 0.534 ± 0.384
1.068TrpAsn: 1.068 ± 1.3
1.068TrpPro: 1.068 ± 0.392
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
1.602TrpSer: 1.602 ± 0.81
2.136TrpThr: 2.136 ± 1.038
2.136TrpVal: 2.136 ± 0.978
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.67TyrAla: 2.67 ± 0.367
0.534TyrCys: 0.534 ± 0.342
2.136TyrAsp: 2.136 ± 0.816
1.602TyrGlu: 1.602 ± 0.592
1.602TyrPhe: 1.602 ± 0.364
1.068TyrGly: 1.068 ± 0.392
1.068TyrHis: 1.068 ± 0.684
1.602TyrIle: 1.602 ± 0.364
1.068TyrLys: 1.068 ± 0.392
1.602TyrLeu: 1.602 ± 0.81
0.0TyrMet: 0.0 ± 0.0
1.602TyrAsn: 1.602 ± 0.81
1.602TyrPro: 1.602 ± 0.81
1.068TyrGln: 1.068 ± 0.392
1.068TyrArg: 1.068 ± 0.684
3.737TyrSer: 3.737 ± 0.475
1.602TyrThr: 1.602 ± 1.126
2.136TyrVal: 2.136 ± 0.816
0.534TyrTrp: 0.534 ± 0.342
1.068TyrTyr: 1.068 ± 0.519
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1874 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski