Amino acid dipepetide frequency for Hubei permutotetra-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.013AlaAla: 7.013 ± 0.874
1.403AlaCys: 1.403 ± 0.61
2.104AlaAsp: 2.104 ± 0.261
4.208AlaGlu: 4.208 ± 1.829
1.403AlaPhe: 1.403 ± 0.61
5.61AlaGly: 5.61 ± 0.176
2.104AlaHis: 2.104 ± 0.261
4.909AlaIle: 4.909 ± 1.135
2.104AlaLys: 2.104 ± 0.261
3.506AlaLeu: 3.506 ± 2.178
0.701AlaMet: 0.701 ± 0.959
0.701AlaAsn: 0.701 ± 0.349
2.805AlaPro: 2.805 ± 1.395
3.506AlaGln: 3.506 ± 0.437
2.805AlaArg: 2.805 ± 1.219
4.208AlaSer: 4.208 ± 2.093
4.909AlaThr: 4.909 ± 2.442
5.61AlaVal: 5.61 ± 0.176
1.403AlaTrp: 1.403 ± 0.61
2.805AlaTyr: 2.805 ± 0.088
0.0AlaXaa: 0.0 ± 0.0
Cys
0.701CysAla: 0.701 ± 0.959
0.0CysCys: 0.0 ± 0.0
1.403CysAsp: 1.403 ± 1.917
0.0CysGlu: 0.0 ± 0.0
1.403CysPhe: 1.403 ± 0.61
0.701CysGly: 0.701 ± 0.349
0.0CysHis: 0.0 ± 0.0
1.403CysIle: 1.403 ± 0.698
0.0CysLys: 0.0 ± 0.0
2.805CysLeu: 2.805 ± 1.395
0.0CysMet: 0.0 ± 0.0
1.403CysAsn: 1.403 ± 0.61
1.403CysPro: 1.403 ± 1.917
0.0CysGln: 0.0 ± 0.0
1.403CysArg: 1.403 ± 0.698
0.0CysSer: 0.0 ± 0.0
0.701CysThr: 0.701 ± 0.349
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.701CysTyr: 0.701 ± 0.349
0.0CysXaa: 0.0 ± 0.0
Asp
1.403AspAla: 1.403 ± 0.61
2.104AspCys: 2.104 ± 1.568
2.104AspAsp: 2.104 ± 1.047
2.104AspGlu: 2.104 ± 1.047
2.805AspPhe: 2.805 ± 1.395
6.311AspGly: 6.311 ± 3.397
0.701AspHis: 0.701 ± 0.349
1.403AspIle: 1.403 ± 0.698
2.104AspLys: 2.104 ± 1.568
8.415AspLeu: 8.415 ± 2.879
2.104AspMet: 2.104 ± 1.568
2.104AspAsn: 2.104 ± 1.047
7.013AspPro: 7.013 ± 0.874
2.104AspGln: 2.104 ± 0.261
3.506AspArg: 3.506 ± 0.437
0.701AspSer: 0.701 ± 0.349
4.208AspThr: 4.208 ± 0.786
2.805AspVal: 2.805 ± 1.395
0.0AspTrp: 0.0 ± 0.0
3.506AspTyr: 3.506 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
3.506GluAla: 3.506 ± 0.871
1.403GluCys: 1.403 ± 0.698
3.506GluAsp: 3.506 ± 0.437
6.311GluGlu: 6.311 ± 1.832
1.403GluPhe: 1.403 ± 0.698
1.403GluGly: 1.403 ± 0.61
2.104GluHis: 2.104 ± 0.261
4.909GluIle: 4.909 ± 0.173
1.403GluLys: 1.403 ± 0.698
7.714GluLeu: 7.714 ± 2.53
1.403GluMet: 1.403 ± 1.917
2.805GluAsn: 2.805 ± 1.219
3.506GluPro: 3.506 ± 0.437
2.805GluGln: 2.805 ± 1.395
2.104GluArg: 2.104 ± 1.047
1.403GluSer: 1.403 ± 0.698
4.208GluThr: 4.208 ± 2.093
4.208GluVal: 4.208 ± 0.522
0.701GluTrp: 0.701 ± 0.959
0.701GluTyr: 0.701 ± 0.349
0.0GluXaa: 0.0 ± 0.0
Phe
2.805PheAla: 2.805 ± 0.088
0.701PheCys: 0.701 ± 0.349
2.805PheAsp: 2.805 ± 0.088
1.403PheGlu: 1.403 ± 0.61
2.805PhePhe: 2.805 ± 1.219
2.805PheGly: 2.805 ± 0.088
0.701PheHis: 0.701 ± 0.349
0.701PheIle: 0.701 ± 0.349
3.506PheLys: 3.506 ± 0.871
4.909PheLeu: 4.909 ± 0.173
0.701PheMet: 0.701 ± 0.3
1.403PheAsn: 1.403 ± 0.61
2.104PhePro: 2.104 ± 1.047
2.805PheGln: 2.805 ± 1.395
1.403PheArg: 1.403 ± 0.61
4.208PheSer: 4.208 ± 0.522
2.805PheThr: 2.805 ± 1.219
0.701PheVal: 0.701 ± 0.349
0.701PheTrp: 0.701 ± 0.349
2.104PheTyr: 2.104 ± 1.047
0.0PheXaa: 0.0 ± 0.0
Gly
4.909GlyAla: 4.909 ± 1.48
0.0GlyCys: 0.0 ± 0.0
2.104GlyAsp: 2.104 ± 1.568
5.61GlyGlu: 5.61 ± 1.483
6.311GlyPhe: 6.311 ± 0.525
4.208GlyGly: 4.208 ± 1.829
0.701GlyHis: 0.701 ± 0.959
4.208GlyIle: 4.208 ± 1.829
5.61GlyLys: 5.61 ± 0.176
3.506GlyLeu: 3.506 ± 0.437
0.0GlyMet: 0.0 ± 0.0
1.403GlyAsn: 1.403 ± 0.698
1.403GlyPro: 1.403 ± 0.61
1.403GlyGln: 1.403 ± 0.698
4.208GlyArg: 4.208 ± 0.786
4.208GlySer: 4.208 ± 1.829
6.311GlyThr: 6.311 ± 0.525
1.403GlyVal: 1.403 ± 0.698
0.701GlyTrp: 0.701 ± 0.959
0.701GlyTyr: 0.701 ± 0.959
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.701HisCys: 0.701 ± 0.959
0.701HisAsp: 0.701 ± 0.959
0.0HisGlu: 0.0 ± 0.0
1.403HisPhe: 1.403 ± 0.61
2.104HisGly: 2.104 ± 0.261
0.701HisHis: 0.701 ± 0.349
0.701HisIle: 0.701 ± 0.349
3.506HisLys: 3.506 ± 0.437
2.805HisLeu: 2.805 ± 1.219
1.403HisMet: 1.403 ± 0.61
0.701HisAsn: 0.701 ± 0.349
2.805HisPro: 2.805 ± 1.219
0.701HisGln: 0.701 ± 0.349
0.701HisArg: 0.701 ± 0.959
0.0HisSer: 0.0 ± 0.0
0.701HisThr: 0.701 ± 0.959
0.701HisVal: 0.701 ± 0.959
1.403HisTrp: 1.403 ± 0.698
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.506IleAla: 3.506 ± 1.744
0.0IleCys: 0.0 ± 0.0
4.208IleAsp: 4.208 ± 0.522
4.208IleGlu: 4.208 ± 1.829
0.701IlePhe: 0.701 ± 0.349
2.104IleGly: 2.104 ± 0.261
2.104IleHis: 2.104 ± 1.047
4.909IleIle: 4.909 ± 1.135
4.208IleLys: 4.208 ± 0.786
3.506IleLeu: 3.506 ± 0.871
3.506IleMet: 3.506 ± 0.871
1.403IleAsn: 1.403 ± 0.698
4.909IlePro: 4.909 ± 0.173
3.506IleGln: 3.506 ± 0.437
2.805IleArg: 2.805 ± 1.395
2.104IleSer: 2.104 ± 2.876
4.208IleThr: 4.208 ± 2.093
7.714IleVal: 7.714 ± 1.223
0.701IleTrp: 0.701 ± 0.349
3.506IleTyr: 3.506 ± 2.178
0.0IleXaa: 0.0 ± 0.0
Lys
3.506LysAla: 3.506 ± 0.437
1.403LysCys: 1.403 ± 0.61
5.61LysAsp: 5.61 ± 1.483
1.403LysGlu: 1.403 ± 0.61
3.506LysPhe: 3.506 ± 0.437
3.506LysGly: 3.506 ± 0.437
1.403LysHis: 1.403 ± 1.917
2.104LysIle: 2.104 ± 1.568
4.909LysLys: 4.909 ± 0.173
4.208LysLeu: 4.208 ± 0.786
0.701LysMet: 0.701 ± 0.349
1.403LysAsn: 1.403 ± 0.698
5.61LysPro: 5.61 ± 1.131
4.208LysGln: 4.208 ± 0.786
4.909LysArg: 4.909 ± 1.135
4.909LysSer: 4.909 ± 1.135
4.909LysThr: 4.909 ± 0.173
4.909LysVal: 4.909 ± 0.173
1.403LysTrp: 1.403 ± 0.61
2.104LysTyr: 2.104 ± 1.047
0.0LysXaa: 0.0 ± 0.0
Leu
4.208LeuAla: 4.208 ± 0.522
0.701LeuCys: 0.701 ± 0.349
4.208LeuAsp: 4.208 ± 0.786
7.714LeuGlu: 7.714 ± 0.085
4.909LeuPhe: 4.909 ± 1.135
4.909LeuGly: 4.909 ± 0.173
1.403LeuHis: 1.403 ± 0.698
4.208LeuIle: 4.208 ± 0.522
6.311LeuLys: 6.311 ± 0.525
1.403LeuLeu: 1.403 ± 0.698
2.805LeuMet: 2.805 ± 0.946
4.208LeuAsn: 4.208 ± 1.829
4.909LeuPro: 4.909 ± 2.442
4.208LeuGln: 4.208 ± 0.786
2.104LeuArg: 2.104 ± 0.261
4.909LeuSer: 4.909 ± 1.48
4.909LeuThr: 4.909 ± 1.48
5.61LeuVal: 5.61 ± 1.483
0.701LeuTrp: 0.701 ± 0.349
1.403LeuTyr: 1.403 ± 0.698
0.0LeuXaa: 0.0 ± 0.0
Met
0.701MetAla: 0.701 ± 0.349
0.0MetCys: 0.0 ± 0.0
2.805MetAsp: 2.805 ± 1.395
0.701MetGlu: 0.701 ± 0.349
0.0MetPhe: 0.0 ± 0.0
0.701MetGly: 0.701 ± 0.349
0.701MetHis: 0.701 ± 0.349
1.403MetIle: 1.403 ± 0.698
2.104MetLys: 2.104 ± 1.568
0.701MetLeu: 0.701 ± 0.349
0.0MetMet: 0.0 ± 0.0
1.403MetAsn: 1.403 ± 0.698
2.805MetPro: 2.805 ± 1.219
0.0MetGln: 0.0 ± 0.0
3.506MetArg: 3.506 ± 0.437
2.104MetSer: 2.104 ± 1.047
0.701MetThr: 0.701 ± 0.349
3.506MetVal: 3.506 ± 3.485
0.701MetTrp: 0.701 ± 0.349
2.104MetTyr: 2.104 ± 1.047
0.0MetXaa: 0.0 ± 0.0
Asn
2.805AsnAla: 2.805 ± 1.395
0.0AsnCys: 0.0 ± 0.0
0.701AsnAsp: 0.701 ± 0.349
0.701AsnGlu: 0.701 ± 0.349
3.506AsnPhe: 3.506 ± 0.437
1.403AsnGly: 1.403 ± 0.61
0.701AsnHis: 0.701 ± 0.349
1.403AsnIle: 1.403 ± 1.917
2.805AsnLys: 2.805 ± 1.395
1.403AsnLeu: 1.403 ± 0.698
2.104AsnMet: 2.104 ± 1.047
2.104AsnAsn: 2.104 ± 1.568
2.805AsnPro: 2.805 ± 0.088
2.104AsnGln: 2.104 ± 0.261
4.909AsnArg: 4.909 ± 1.48
6.311AsnSer: 6.311 ± 2.09
2.805AsnThr: 2.805 ± 0.088
3.506AsnVal: 3.506 ± 0.437
0.0AsnTrp: 0.0 ± 0.0
1.403AsnTyr: 1.403 ± 0.61
0.0AsnXaa: 0.0 ± 0.0
Pro
3.506ProAla: 3.506 ± 2.178
0.0ProCys: 0.0 ± 0.0
6.311ProAsp: 6.311 ± 0.782
6.311ProGlu: 6.311 ± 1.832
1.403ProPhe: 1.403 ± 0.61
7.013ProGly: 7.013 ± 1.741
2.104ProHis: 2.104 ± 1.047
3.506ProIle: 3.506 ± 0.437
3.506ProLys: 3.506 ± 0.437
3.506ProLeu: 3.506 ± 0.437
1.403ProMet: 1.403 ± 0.698
2.104ProAsn: 2.104 ± 1.047
2.805ProPro: 2.805 ± 1.219
1.403ProGln: 1.403 ± 0.61
1.403ProArg: 1.403 ± 0.698
4.909ProSer: 4.909 ± 0.173
5.61ProThr: 5.61 ± 2.439
2.805ProVal: 2.805 ± 0.088
0.0ProTrp: 0.0 ± 0.0
2.104ProTyr: 2.104 ± 1.047
0.0ProXaa: 0.0 ± 0.0
Gln
1.403GlnAla: 1.403 ± 0.61
0.701GlnCys: 0.701 ± 0.349
2.805GlnAsp: 2.805 ± 0.088
4.208GlnGlu: 4.208 ± 2.093
1.403GlnPhe: 1.403 ± 0.61
1.403GlnGly: 1.403 ± 0.698
0.701GlnHis: 0.701 ± 0.959
4.208GlnIle: 4.208 ± 0.522
3.506GlnLys: 3.506 ± 1.744
5.61GlnLeu: 5.61 ± 0.176
2.104GlnMet: 2.104 ± 1.047
1.403GlnAsn: 1.403 ± 0.61
1.403GlnPro: 1.403 ± 0.698
3.506GlnGln: 3.506 ± 0.871
2.104GlnArg: 2.104 ± 1.047
0.0GlnSer: 0.0 ± 0.0
2.104GlnThr: 2.104 ± 1.568
2.805GlnVal: 2.805 ± 0.088
0.0GlnTrp: 0.0 ± 0.0
0.701GlnTyr: 0.701 ± 0.349
0.0GlnXaa: 0.0 ± 0.0
Arg
2.104ArgAla: 2.104 ± 0.261
0.701ArgCys: 0.701 ± 0.349
2.104ArgAsp: 2.104 ± 1.047
2.104ArgGlu: 2.104 ± 1.047
1.403ArgPhe: 1.403 ± 0.698
2.104ArgGly: 2.104 ± 1.047
0.0ArgHis: 0.0 ± 0.0
4.208ArgIle: 4.208 ± 0.786
2.104ArgLys: 2.104 ± 1.047
4.909ArgLeu: 4.909 ± 2.788
2.805ArgMet: 2.805 ± 1.395
8.415ArgAsn: 8.415 ± 0.264
2.805ArgPro: 2.805 ± 1.219
1.403ArgGln: 1.403 ± 0.698
4.208ArgArg: 4.208 ± 0.522
2.104ArgSer: 2.104 ± 0.261
4.208ArgThr: 4.208 ± 0.786
5.61ArgVal: 5.61 ± 1.483
1.403ArgTrp: 1.403 ± 0.61
2.805ArgTyr: 2.805 ± 1.219
0.0ArgXaa: 0.0 ± 0.0
Ser
7.013SerAla: 7.013 ± 0.874
0.701SerCys: 0.701 ± 0.349
4.208SerAsp: 4.208 ± 0.786
0.0SerGlu: 0.0 ± 0.0
0.701SerPhe: 0.701 ± 0.959
2.805SerGly: 2.805 ± 0.088
1.403SerHis: 1.403 ± 1.917
3.506SerIle: 3.506 ± 1.744
3.506SerLys: 3.506 ± 0.871
4.208SerLeu: 4.208 ± 0.522
0.701SerMet: 0.701 ± 0.349
4.909SerAsn: 4.909 ± 1.135
2.805SerPro: 2.805 ± 1.219
2.805SerGln: 2.805 ± 1.395
5.61SerArg: 5.61 ± 1.131
4.909SerSer: 4.909 ± 1.48
2.104SerThr: 2.104 ± 1.568
6.311SerVal: 6.311 ± 2.09
0.701SerTrp: 0.701 ± 0.349
2.104SerTyr: 2.104 ± 0.261
0.0SerXaa: 0.0 ± 0.0
Thr
5.61ThrAla: 5.61 ± 1.483
0.701ThrCys: 0.701 ± 0.349
2.104ThrAsp: 2.104 ± 1.047
4.208ThrGlu: 4.208 ± 1.829
2.805ThrPhe: 2.805 ± 0.088
5.61ThrGly: 5.61 ± 0.176
0.701ThrHis: 0.701 ± 0.349
8.415ThrIle: 8.415 ± 0.264
4.909ThrLys: 4.909 ± 1.48
2.805ThrLeu: 2.805 ± 0.088
0.701ThrMet: 0.701 ± 0.349
2.104ThrAsn: 2.104 ± 1.568
1.403ThrPro: 1.403 ± 0.698
2.104ThrGln: 2.104 ± 0.261
3.506ThrArg: 3.506 ± 1.744
7.714ThrSer: 7.714 ± 0.085
4.909ThrThr: 4.909 ± 1.48
4.909ThrVal: 4.909 ± 2.788
0.701ThrTrp: 0.701 ± 0.349
0.701ThrTyr: 0.701 ± 0.349
0.0ThrXaa: 0.0 ± 0.0
Val
5.61ValAla: 5.61 ± 0.176
2.104ValCys: 2.104 ± 0.261
4.909ValAsp: 4.909 ± 2.442
4.909ValGlu: 4.909 ± 2.442
3.506ValPhe: 3.506 ± 0.871
1.403ValGly: 1.403 ± 0.61
2.104ValHis: 2.104 ± 2.876
3.506ValIle: 3.506 ± 0.871
4.909ValLys: 4.909 ± 0.173
8.415ValLeu: 8.415 ± 0.264
1.403ValMet: 1.403 ± 0.698
2.104ValAsn: 2.104 ± 0.261
5.61ValPro: 5.61 ± 0.176
3.506ValGln: 3.506 ± 3.485
2.104ValArg: 2.104 ± 0.261
3.506ValSer: 3.506 ± 0.437
4.208ValThr: 4.208 ± 1.829
6.311ValVal: 6.311 ± 0.525
0.701ValTrp: 0.701 ± 0.349
2.805ValTyr: 2.805 ± 1.395
0.0ValXaa: 0.0 ± 0.0
Trp
1.403TrpAla: 1.403 ± 0.698
0.701TrpCys: 0.701 ± 0.959
0.0TrpAsp: 0.0 ± 0.0
0.701TrpGlu: 0.701 ± 0.349
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.403TrpIle: 1.403 ± 0.698
1.403TrpLys: 1.403 ± 0.61
0.701TrpLeu: 0.701 ± 0.349
1.403TrpMet: 1.403 ± 0.698
0.701TrpAsn: 0.701 ± 0.959
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.805TrpArg: 2.805 ± 0.088
0.701TrpSer: 0.701 ± 0.349
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.701TrpTyr: 0.701 ± 0.959
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.104TyrAla: 2.104 ± 0.261
0.0TyrCys: 0.0 ± 0.0
2.805TyrAsp: 2.805 ± 1.219
0.701TyrGlu: 0.701 ± 0.349
0.701TyrPhe: 0.701 ± 0.349
2.805TyrGly: 2.805 ± 0.088
1.403TyrHis: 1.403 ± 1.917
2.805TyrIle: 2.805 ± 0.088
4.208TyrLys: 4.208 ± 2.093
1.403TyrLeu: 1.403 ± 0.698
0.0TyrMet: 0.0 ± 0.0
0.701TyrAsn: 0.701 ± 0.349
2.805TyrPro: 2.805 ± 0.088
0.0TyrGln: 0.0 ± 0.0
1.403TyrArg: 1.403 ± 0.698
2.104TyrSer: 2.104 ± 0.261
2.104TyrThr: 2.104 ± 1.047
4.208TyrVal: 4.208 ± 0.522
0.701TyrTrp: 0.701 ± 0.349
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1427 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski