Amino acid dipepetide frequency for Hubei diptera virus 22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.461AlaAla: 8.461 ± 0.05
2.015AlaCys: 2.015 ± 0.389
6.044AlaAsp: 6.044 ± 0.637
5.238AlaGlu: 5.238 ± 0.17
2.015AlaPhe: 2.015 ± 0.389
4.432AlaGly: 4.432 ± 1.578
0.806AlaHis: 0.806 ± 0.205
4.029AlaIle: 4.029 ± 1.628
4.835AlaLys: 4.835 ± 1.175
5.641AlaLeu: 5.641 ± 0.835
1.612AlaMet: 1.612 ± 0.191
1.612AlaAsn: 1.612 ± 0.191
6.446AlaPro: 6.446 ± 2.569
2.417AlaGln: 2.417 ± 1.189
4.029AlaArg: 4.029 ± 0.425
8.864AlaSer: 8.864 ± 0.75
4.835AlaThr: 4.835 ± 1.175
7.252AlaVal: 7.252 ± 0.559
2.417AlaTrp: 2.417 ± 0.616
1.612AlaTyr: 1.612 ± 1.012
0.0AlaXaa: 0.0 ± 0.0
Cys
1.209CysAla: 1.209 ± 0.594
0.403CysCys: 0.403 ± 0.403
0.403CysAsp: 0.403 ± 0.198
1.209CysGlu: 1.209 ± 0.007
1.209CysPhe: 1.209 ± 0.594
0.806CysGly: 0.806 ± 0.205
0.0CysHis: 0.0 ± 0.0
0.806CysIle: 0.806 ± 0.396
0.0CysLys: 0.0 ± 0.0
2.015CysLeu: 2.015 ± 0.212
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.403CysPro: 0.403 ± 0.198
0.0CysGln: 0.0 ± 0.0
0.806CysArg: 0.806 ± 0.205
1.209CysSer: 1.209 ± 0.007
0.403CysThr: 0.403 ± 0.198
1.209CysVal: 1.209 ± 0.594
0.403CysTrp: 0.403 ± 0.403
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.029AspAla: 4.029 ± 1.026
1.209AspCys: 1.209 ± 0.007
4.835AspAsp: 4.835 ± 0.573
3.626AspGlu: 3.626 ± 0.021
1.209AspPhe: 1.209 ± 0.609
2.82AspGly: 2.82 ± 0.418
1.612AspHis: 1.612 ± 0.191
2.015AspIle: 2.015 ± 0.389
2.015AspLys: 2.015 ± 0.389
6.849AspLeu: 6.849 ± 0.361
1.612AspMet: 1.612 ± 1.614
2.015AspAsn: 2.015 ± 0.212
6.446AspPro: 6.446 ± 0.764
1.209AspGln: 1.209 ± 0.007
1.209AspArg: 1.209 ± 0.594
3.626AspSer: 3.626 ± 1.783
1.209AspThr: 1.209 ± 0.007
4.835AspVal: 4.835 ± 1.175
1.209AspTrp: 1.209 ± 0.609
2.015AspTyr: 2.015 ± 0.389
0.0AspXaa: 0.0 ± 0.0
Glu
2.417GluAla: 2.417 ± 0.587
0.403GluCys: 0.403 ± 0.198
1.612GluAsp: 1.612 ± 0.191
2.417GluGlu: 2.417 ± 0.014
2.015GluPhe: 2.015 ± 0.814
2.015GluGly: 2.015 ± 0.991
1.209GluHis: 1.209 ± 0.007
2.82GluIle: 2.82 ± 0.418
1.209GluLys: 1.209 ± 0.609
2.417GluLeu: 2.417 ± 1.217
0.806GluMet: 0.806 ± 0.205
0.806GluAsn: 0.806 ± 0.396
3.223GluPro: 3.223 ± 0.984
2.015GluGln: 2.015 ± 0.814
3.626GluArg: 3.626 ± 1.225
3.626GluSer: 3.626 ± 0.021
3.626GluThr: 3.626 ± 0.58
2.015GluVal: 2.015 ± 0.212
0.806GluTrp: 0.806 ± 0.396
0.403GluTyr: 0.403 ± 0.198
0.0GluXaa: 0.0 ± 0.0
Phe
2.82PheAla: 2.82 ± 0.418
0.0PheCys: 0.0 ± 0.0
1.612PheAsp: 1.612 ± 0.411
3.223PheGlu: 3.223 ± 0.821
0.403PhePhe: 0.403 ± 0.403
2.417PheGly: 2.417 ± 0.014
0.806PheHis: 0.806 ± 0.205
2.82PheIle: 2.82 ± 0.785
2.015PheLys: 2.015 ± 2.017
2.82PheLeu: 2.82 ± 1.019
0.403PheMet: 0.403 ± 0.403
1.209PheAsn: 1.209 ± 0.594
2.417PhePro: 2.417 ± 1.189
2.015PheGln: 2.015 ± 1.416
0.806PheArg: 0.806 ± 0.396
4.029PheSer: 4.029 ± 0.425
3.223PheThr: 3.223 ± 0.821
2.015PheVal: 2.015 ± 0.991
0.403PheTrp: 0.403 ± 0.198
1.612PheTyr: 1.612 ± 0.191
0.0PheXaa: 0.0 ± 0.0
Gly
4.029GlyAla: 4.029 ± 1.38
0.403GlyCys: 0.403 ± 0.198
2.82GlyAsp: 2.82 ± 0.785
2.417GlyGlu: 2.417 ± 0.587
4.029GlyPhe: 4.029 ± 2.23
5.641GlyGly: 5.641 ± 0.969
1.209GlyHis: 1.209 ± 1.21
1.209GlyIle: 1.209 ± 1.21
3.223GlyLys: 3.223 ± 2.024
6.849GlyLeu: 6.849 ± 0.241
1.612GlyMet: 1.612 ± 0.191
1.612GlyAsn: 1.612 ± 0.411
4.029GlyPro: 4.029 ± 1.981
1.612GlyGln: 1.612 ± 0.411
4.432GlyArg: 4.432 ± 0.976
4.432GlySer: 4.432 ± 0.375
4.029GlyThr: 4.029 ± 0.177
5.641GlyVal: 5.641 ± 1.571
1.209GlyTrp: 1.209 ± 1.21
2.82GlyTyr: 2.82 ± 0.184
0.0GlyXaa: 0.0 ± 0.0
His
1.612HisAla: 1.612 ± 0.411
0.403HisCys: 0.403 ± 0.403
0.403HisAsp: 0.403 ± 0.403
0.403HisGlu: 0.403 ± 0.403
0.806HisPhe: 0.806 ± 0.807
3.223HisGly: 3.223 ± 0.22
0.806HisHis: 0.806 ± 0.396
1.209HisIle: 1.209 ± 0.594
0.403HisLys: 0.403 ± 0.198
2.82HisLeu: 2.82 ± 0.184
0.806HisMet: 0.806 ± 0.396
0.403HisAsn: 0.403 ± 0.198
0.403HisPro: 0.403 ± 0.198
2.015HisGln: 2.015 ± 1.416
1.612HisArg: 1.612 ± 0.191
2.82HisSer: 2.82 ± 0.785
0.403HisThr: 0.403 ± 0.198
0.403HisVal: 0.403 ± 0.403
0.0HisTrp: 0.0 ± 0.0
1.209HisTyr: 1.209 ± 0.609
0.0HisXaa: 0.0 ± 0.0
Ile
4.835IleAla: 4.835 ± 0.573
0.403IleCys: 0.403 ± 0.198
1.612IleAsp: 1.612 ± 0.793
1.209IleGlu: 1.209 ± 0.609
1.612IlePhe: 1.612 ± 0.191
4.029IleGly: 4.029 ± 1.026
1.209IleHis: 1.209 ± 0.594
1.209IleIle: 1.209 ± 0.594
1.209IleLys: 1.209 ± 0.007
3.626IleLeu: 3.626 ± 1.225
2.417IleMet: 2.417 ± 0.014
1.612IleAsn: 1.612 ± 0.793
4.835IlePro: 4.835 ± 0.573
2.417IleGln: 2.417 ± 0.014
4.029IleArg: 4.029 ± 1.628
4.835IleSer: 4.835 ± 0.029
2.417IleThr: 2.417 ± 0.014
1.612IleVal: 1.612 ± 0.793
0.403IleTrp: 0.403 ± 0.403
2.417IleTyr: 2.417 ± 0.014
0.0IleXaa: 0.0 ± 0.0
Lys
1.209LysAla: 1.209 ± 0.007
0.0LysCys: 0.0 ± 0.0
0.806LysAsp: 0.806 ± 0.807
0.403LysGlu: 0.403 ± 0.198
1.612LysPhe: 1.612 ± 1.012
2.015LysGly: 2.015 ± 0.389
2.015LysHis: 2.015 ± 1.416
1.612LysIle: 1.612 ± 1.012
1.209LysLys: 1.209 ± 0.007
0.806LysLeu: 0.806 ± 0.205
1.612LysMet: 1.612 ± 0.411
1.612LysAsn: 1.612 ± 1.012
2.015LysPro: 2.015 ± 0.814
0.403LysGln: 0.403 ± 0.403
3.223LysArg: 3.223 ± 0.22
1.612LysSer: 1.612 ± 0.411
3.223LysThr: 3.223 ± 0.22
0.806LysVal: 0.806 ± 0.807
1.209LysTrp: 1.209 ± 0.594
1.209LysTyr: 1.209 ± 0.007
0.0LysXaa: 0.0 ± 0.0
Leu
7.252LeuAla: 7.252 ± 0.559
0.806LeuCys: 0.806 ± 0.396
4.432LeuAsp: 4.432 ± 1.43
4.029LeuGlu: 4.029 ± 0.778
2.82LeuPhe: 2.82 ± 0.785
5.641LeuGly: 5.641 ± 3.242
1.209LeuHis: 1.209 ± 0.594
2.417LeuIle: 2.417 ± 0.014
1.612LeuLys: 1.612 ± 0.411
4.835LeuLeu: 4.835 ± 0.573
2.015LeuMet: 2.015 ± 0.991
5.238LeuAsn: 5.238 ± 0.771
9.267LeuPro: 9.267 ± 0.255
1.209LeuGln: 1.209 ± 0.007
4.029LeuArg: 4.029 ± 0.425
6.849LeuSer: 6.849 ± 0.361
7.655LeuThr: 7.655 ± 2.251
6.044LeuVal: 6.044 ± 1.84
2.417LeuTrp: 2.417 ± 0.014
1.612LeuTyr: 1.612 ± 0.411
0.0LeuXaa: 0.0 ± 0.0
Met
5.641MetAla: 5.641 ± 0.234
0.0MetCys: 0.0 ± 0.0
1.209MetAsp: 1.209 ± 0.007
0.403MetGlu: 0.403 ± 0.198
1.209MetPhe: 1.209 ± 0.609
1.612MetGly: 1.612 ± 0.191
0.403MetHis: 0.403 ± 0.198
0.403MetIle: 0.403 ± 0.198
0.403MetLys: 0.403 ± 0.198
2.417MetLeu: 2.417 ± 0.014
0.403MetMet: 0.403 ± 0.198
1.612MetAsn: 1.612 ± 0.191
0.403MetPro: 0.403 ± 0.403
0.806MetGln: 0.806 ± 0.396
2.82MetArg: 2.82 ± 1.019
2.015MetSer: 2.015 ± 0.389
1.612MetThr: 1.612 ± 0.793
3.223MetVal: 3.223 ± 0.22
0.806MetTrp: 0.806 ± 0.396
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.417AsnAla: 2.417 ± 0.014
0.403AsnCys: 0.403 ± 0.198
1.209AsnAsp: 1.209 ± 0.594
1.209AsnGlu: 1.209 ± 0.609
2.015AsnPhe: 2.015 ± 0.212
2.417AsnGly: 2.417 ± 0.587
1.209AsnHis: 1.209 ± 0.007
2.417AsnIle: 2.417 ± 0.014
1.209AsnLys: 1.209 ± 0.609
2.82AsnLeu: 2.82 ± 0.785
1.612AsnMet: 1.612 ± 0.191
1.612AsnAsn: 1.612 ± 0.793
4.835AsnPro: 4.835 ± 0.573
2.417AsnGln: 2.417 ± 1.189
2.015AsnArg: 2.015 ± 0.212
2.417AsnSer: 2.417 ± 0.587
0.806AsnThr: 0.806 ± 0.205
2.82AsnVal: 2.82 ± 0.184
1.612AsnTrp: 1.612 ± 0.411
2.015AsnTyr: 2.015 ± 0.991
0.0AsnXaa: 0.0 ± 0.0
Pro
8.864ProAla: 8.864 ± 1.351
1.209ProCys: 1.209 ± 0.007
5.641ProAsp: 5.641 ± 0.835
2.82ProGlu: 2.82 ± 0.184
4.835ProPhe: 4.835 ± 1.175
5.641ProGly: 5.641 ± 0.368
1.612ProHis: 1.612 ± 0.191
2.417ProIle: 2.417 ± 0.587
0.806ProLys: 0.806 ± 0.205
6.044ProLeu: 6.044 ± 0.036
1.612ProMet: 1.612 ± 0.793
4.029ProAsn: 4.029 ± 1.981
5.641ProPro: 5.641 ± 0.234
3.626ProGln: 3.626 ± 0.58
4.835ProArg: 4.835 ± 0.63
8.058ProSer: 8.058 ± 0.955
7.655ProThr: 7.655 ± 1.96
5.238ProVal: 5.238 ± 1.373
1.209ProTrp: 1.209 ± 0.609
0.403ProTyr: 0.403 ± 0.403
0.0ProXaa: 0.0 ± 0.0
Gln
3.223GlnAla: 3.223 ± 0.984
0.0GlnCys: 0.0 ± 0.0
2.015GlnAsp: 2.015 ± 0.814
1.209GlnGlu: 1.209 ± 0.007
1.209GlnPhe: 1.209 ± 0.007
2.417GlnGly: 2.417 ± 1.189
0.806GlnHis: 0.806 ± 0.396
2.015GlnIle: 2.015 ± 0.212
0.806GlnLys: 0.806 ± 0.396
3.223GlnLeu: 3.223 ± 0.821
1.209GlnMet: 1.209 ± 0.514
1.612GlnAsn: 1.612 ± 0.793
3.223GlnPro: 3.223 ± 0.382
2.82GlnGln: 2.82 ± 0.184
2.015GlnArg: 2.015 ± 0.212
2.417GlnSer: 2.417 ± 0.616
2.417GlnThr: 2.417 ± 0.014
2.82GlnVal: 2.82 ± 0.785
0.806GlnTrp: 0.806 ± 0.807
2.015GlnTyr: 2.015 ± 1.416
0.0GlnXaa: 0.0 ± 0.0
Arg
6.446ArgAla: 6.446 ± 0.439
0.0ArgCys: 0.0 ± 0.0
2.82ArgAsp: 2.82 ± 0.785
2.417ArgGlu: 2.417 ± 0.587
3.223ArgPhe: 3.223 ± 0.382
2.417ArgGly: 2.417 ± 1.217
1.612ArgHis: 1.612 ± 1.012
3.626ArgIle: 3.626 ± 1.182
2.015ArgLys: 2.015 ± 1.416
5.238ArgLeu: 5.238 ± 1.635
2.417ArgMet: 2.417 ± 0.014
2.82ArgAsn: 2.82 ± 2.222
3.223ArgPro: 3.223 ± 0.22
2.015ArgGln: 2.015 ± 0.389
6.446ArgArg: 6.446 ± 0.439
4.432ArgSer: 4.432 ± 0.227
3.223ArgThr: 3.223 ± 0.821
4.029ArgVal: 4.029 ± 1.026
1.612ArgTrp: 1.612 ± 0.191
4.029ArgTyr: 4.029 ± 0.778
0.0ArgXaa: 0.0 ± 0.0
Ser
4.432SerAla: 4.432 ± 0.976
2.417SerCys: 2.417 ± 0.587
6.446SerAsp: 6.446 ± 0.764
2.417SerGlu: 2.417 ± 0.616
2.015SerPhe: 2.015 ± 0.814
4.835SerGly: 4.835 ± 0.029
0.403SerHis: 0.403 ± 0.403
4.835SerIle: 4.835 ± 0.63
2.82SerLys: 2.82 ± 1.019
10.475SerLeu: 10.475 ± 0.339
2.417SerMet: 2.417 ± 0.587
2.417SerAsn: 2.417 ± 0.014
5.641SerPro: 5.641 ± 1.571
4.432SerGln: 4.432 ± 0.227
5.641SerArg: 5.641 ± 0.969
5.238SerSer: 5.238 ± 2.237
6.446SerThr: 6.446 ± 1.366
2.417SerVal: 2.417 ± 0.587
2.82SerTrp: 2.82 ± 0.184
4.029SerTyr: 4.029 ± 0.425
0.0SerXaa: 0.0 ± 0.0
Thr
3.223ThrAla: 3.223 ± 0.984
1.209ThrCys: 1.209 ± 0.007
4.835ThrAsp: 4.835 ± 1.776
1.209ThrGlu: 1.209 ± 0.609
2.015ThrPhe: 2.015 ± 0.389
2.015ThrGly: 2.015 ± 0.389
1.612ThrHis: 1.612 ± 0.793
4.432ThrIle: 4.432 ± 0.227
0.403ThrLys: 0.403 ± 0.403
4.432ThrLeu: 4.432 ± 0.976
0.806ThrMet: 0.806 ± 0.396
2.82ThrAsn: 2.82 ± 0.418
9.67ThrPro: 9.67 ± 0.659
1.209ThrGln: 1.209 ± 0.594
5.641ThrArg: 5.641 ± 0.835
5.641ThrSer: 5.641 ± 0.234
6.446ThrThr: 6.446 ± 0.439
5.238ThrVal: 5.238 ± 0.771
1.612ThrTrp: 1.612 ± 0.411
4.029ThrTyr: 4.029 ± 0.425
0.0ThrXaa: 0.0 ± 0.0
Val
6.849ValAla: 6.849 ± 0.361
0.806ValCys: 0.806 ± 0.396
4.029ValAsp: 4.029 ± 1.38
1.209ValGlu: 1.209 ± 0.007
2.015ValPhe: 2.015 ± 0.212
4.029ValGly: 4.029 ± 0.425
1.209ValHis: 1.209 ± 0.609
3.626ValIle: 3.626 ± 0.021
1.612ValLys: 1.612 ± 1.012
4.432ValLeu: 4.432 ± 0.976
1.612ValMet: 1.612 ± 1.012
3.626ValAsn: 3.626 ± 1.783
4.432ValPro: 4.432 ± 1.578
3.223ValGln: 3.223 ± 0.984
3.223ValArg: 3.223 ± 0.821
5.641ValSer: 5.641 ± 0.969
6.044ValThr: 6.044 ± 0.036
4.432ValVal: 4.432 ± 0.976
2.015ValTrp: 2.015 ± 1.416
2.417ValTyr: 2.417 ± 0.587
0.0ValXaa: 0.0 ± 0.0
Trp
2.82TrpAla: 2.82 ± 0.418
0.0TrpCys: 0.0 ± 0.0
2.015TrpAsp: 2.015 ± 0.814
0.806TrpGlu: 0.806 ± 0.205
0.403TrpPhe: 0.403 ± 0.198
1.612TrpGly: 1.612 ± 0.411
1.209TrpHis: 1.209 ± 0.007
2.015TrpIle: 2.015 ± 0.389
0.0TrpLys: 0.0 ± 0.0
1.209TrpLeu: 1.209 ± 0.007
0.806TrpMet: 0.806 ± 0.205
2.015TrpAsn: 2.015 ± 0.389
1.612TrpPro: 1.612 ± 1.614
0.0TrpGln: 0.0 ± 0.0
1.209TrpArg: 1.209 ± 0.609
2.82TrpSer: 2.82 ± 1.019
0.806TrpThr: 0.806 ± 0.205
2.015TrpVal: 2.015 ± 0.389
0.403TrpTrp: 0.403 ± 0.198
0.403TrpTyr: 0.403 ± 0.403
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.82TyrAla: 2.82 ± 0.418
0.403TyrCys: 0.403 ± 0.403
0.806TyrAsp: 0.806 ± 0.396
1.612TyrGlu: 1.612 ± 0.191
0.403TyrPhe: 0.403 ± 0.403
3.223TyrGly: 3.223 ± 0.984
1.209TyrHis: 1.209 ± 0.007
2.015TyrIle: 2.015 ± 0.389
0.806TyrLys: 0.806 ± 0.205
2.417TyrLeu: 2.417 ± 1.217
1.209TyrMet: 1.209 ± 0.419
0.806TyrAsn: 0.806 ± 0.205
4.029TyrPro: 4.029 ± 0.177
2.417TyrGln: 2.417 ± 0.014
2.417TyrArg: 2.417 ± 0.616
2.015TyrSer: 2.015 ± 0.212
2.015TyrThr: 2.015 ± 0.991
2.417TyrVal: 2.417 ± 1.819
0.806TyrTrp: 0.806 ± 0.396
0.403TyrTyr: 0.403 ± 0.403
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski