Amino acid dipepetide frequency for Hubei permutotetra-like virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.682AlaAla: 7.682 ± 2.328
0.698AlaCys: 0.698 ± 1.388
2.793AlaAsp: 2.793 ± 1.458
6.983AlaGlu: 6.983 ± 2.553
3.492AlaPhe: 3.492 ± 2.31
4.19AlaGly: 4.19 ± 2.145
0.698AlaHis: 0.698 ± 1.388
6.983AlaIle: 6.983 ± 1.014
6.285AlaLys: 6.285 ± 2.235
9.078AlaLeu: 9.078 ± 2.158
1.397AlaMet: 1.397 ± 0.729
0.698AlaAsn: 0.698 ± 1.388
5.587AlaPro: 5.587 ± 4.76
1.397AlaGln: 1.397 ± 1.19
4.19AlaArg: 4.19 ± 5.137
5.587AlaSer: 5.587 ± 0.337
5.587AlaThr: 5.587 ± 2.248
5.587AlaVal: 5.587 ± 2.144
0.698AlaTrp: 0.698 ± 1.396
2.793AlaTyr: 2.793 ± 2.344
0.0AlaXaa: 0.0 ± 0.0
Cys
0.698CysAla: 0.698 ± 1.388
0.0CysCys: 0.0 ± 0.0
0.698CysAsp: 0.698 ± 1.388
0.698CysGlu: 0.698 ± 0.365
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.698CysHis: 0.698 ± 0.365
0.0CysIle: 0.0 ± 0.0
2.095CysLys: 2.095 ± 1.094
0.0CysLeu: 0.0 ± 0.0
0.698CysMet: 0.698 ± 0.365
0.698CysAsn: 0.698 ± 0.365
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
2.095CysArg: 2.095 ± 1.042
1.397CysSer: 1.397 ± 2.776
0.0CysThr: 0.0 ± 0.0
1.397CysVal: 1.397 ± 0.729
0.0CysTrp: 0.0 ± 0.0
1.397CysTyr: 1.397 ± 0.729
0.0CysXaa: 0.0 ± 0.0
Asp
4.19AspAla: 4.19 ± 1.349
0.0AspCys: 0.0 ± 0.0
2.793AspAsp: 2.793 ± 1.458
2.095AspGlu: 2.095 ± 1.072
2.793AspPhe: 2.793 ± 1.458
8.38AspGly: 8.38 ± 1.099
0.0AspHis: 0.0 ± 0.0
0.698AspIle: 0.698 ± 0.365
1.397AspLys: 1.397 ± 0.729
3.492AspLeu: 3.492 ± 1.823
1.397AspMet: 1.397 ± 0.85
2.095AspAsn: 2.095 ± 1.094
4.19AspPro: 4.19 ± 2.188
2.793AspGln: 2.793 ± 1.072
4.19AspArg: 4.19 ± 2.188
2.793AspSer: 2.793 ± 1.458
3.492AspThr: 3.492 ± 1.823
1.397AspVal: 1.397 ± 0.729
1.397AspTrp: 1.397 ± 0.729
1.397AspTyr: 1.397 ± 0.729
0.0AspXaa: 0.0 ± 0.0
Glu
5.587GluAla: 5.587 ± 2.917
0.698GluCys: 0.698 ± 0.365
2.095GluAsp: 2.095 ± 1.094
4.19GluGlu: 4.19 ± 1.395
1.397GluPhe: 1.397 ± 1.19
2.095GluGly: 2.095 ± 1.042
2.095GluHis: 2.095 ± 1.042
1.397GluIle: 1.397 ± 0.729
4.888GluLys: 4.888 ± 2.552
4.19GluLeu: 4.19 ± 1.395
1.397GluMet: 1.397 ± 0.729
0.698GluAsn: 0.698 ± 0.365
3.492GluPro: 3.492 ± 1.19
4.888GluGln: 4.888 ± 1.611
3.492GluArg: 3.492 ± 1.823
2.095GluSer: 2.095 ± 1.072
4.19GluThr: 4.19 ± 2.085
6.285GluVal: 6.285 ± 2.149
0.698GluTrp: 0.698 ± 0.365
5.587GluTyr: 5.587 ± 1.443
0.0GluXaa: 0.0 ± 0.0
Phe
3.492PheAla: 3.492 ± 2.236
0.698PheCys: 0.698 ± 0.365
4.888PheAsp: 4.888 ± 2.552
2.095PheGlu: 2.095 ± 1.042
1.397PhePhe: 1.397 ± 1.853
2.095PheGly: 2.095 ± 1.094
0.0PheHis: 0.0 ± 0.0
4.19PheIle: 4.19 ± 5.137
3.492PheLys: 3.492 ± 3.709
3.492PheLeu: 3.492 ± 2.236
1.397PheMet: 1.397 ± 0.729
0.698PheAsn: 0.698 ± 0.365
0.698PhePro: 0.698 ± 0.365
1.397PheGln: 1.397 ± 0.729
2.793PheArg: 2.793 ± 2.344
4.19PheSer: 4.19 ± 1.965
1.397PheThr: 1.397 ± 1.172
2.793PheVal: 2.793 ± 1.072
0.698PheTrp: 0.698 ± 0.365
0.698PheTyr: 0.698 ± 1.396
0.0PheXaa: 0.0 ± 0.0
Gly
4.888GlyAla: 4.888 ± 1.659
0.0GlyCys: 0.0 ± 0.0
4.19GlyAsp: 4.19 ± 1.395
2.793GlyGlu: 2.793 ± 1.124
0.698GlyPhe: 0.698 ± 0.365
3.492GlyGly: 3.492 ± 2.188
0.0GlyHis: 0.0 ± 0.0
2.095GlyIle: 2.095 ± 1.072
1.397GlyLys: 1.397 ± 0.729
7.682GlyLeu: 7.682 ± 4.367
1.397GlyMet: 1.397 ± 0.729
0.698GlyAsn: 0.698 ± 0.365
1.397GlyPro: 1.397 ± 0.729
0.0GlyGln: 0.0 ± 0.0
2.095GlyArg: 2.095 ± 1.042
5.587GlySer: 5.587 ± 1.951
3.492GlyThr: 3.492 ± 1.19
4.888GlyVal: 4.888 ± 0.047
0.0GlyTrp: 0.0 ± 0.0
0.698GlyTyr: 0.698 ± 0.365
0.0GlyXaa: 0.0 ± 0.0
His
2.095HisAla: 2.095 ± 1.042
0.0HisCys: 0.0 ± 0.0
0.698HisAsp: 0.698 ± 0.365
2.095HisGlu: 2.095 ± 1.094
0.698HisPhe: 0.698 ± 0.365
1.397HisGly: 1.397 ± 0.729
0.698HisHis: 0.698 ± 1.388
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.492HisLeu: 3.492 ± 2.188
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
2.095HisPro: 2.095 ± 1.072
0.698HisGln: 0.698 ± 0.365
0.0HisArg: 0.0 ± 0.0
2.095HisSer: 2.095 ± 2.544
0.0HisThr: 0.0 ± 0.0
1.397HisVal: 1.397 ± 1.19
0.0HisTrp: 0.0 ± 0.0
1.397HisTyr: 1.397 ± 0.729
0.0HisXaa: 0.0 ± 0.0
Ile
6.983IleAla: 6.983 ± 1.52
0.0IleCys: 0.0 ± 0.0
2.095IleAsp: 2.095 ± 1.094
2.095IleGlu: 2.095 ± 1.042
3.492IlePhe: 3.492 ± 3.342
0.698IleGly: 0.698 ± 0.365
0.698IleHis: 0.698 ± 0.365
4.888IleIle: 4.888 ± 1.659
3.492IleLys: 3.492 ± 1.19
4.888IleLeu: 4.888 ± 2.042
0.698IleMet: 0.698 ± 0.365
2.095IleAsn: 2.095 ± 1.094
2.095IlePro: 2.095 ± 1.072
2.095IleGln: 2.095 ± 1.094
2.793IleArg: 2.793 ± 2.344
2.095IleSer: 2.095 ± 1.489
4.19IleThr: 4.19 ± 1.349
4.19IleVal: 4.19 ± 1.997
0.0IleTrp: 0.0 ± 0.0
0.698IleTyr: 0.698 ± 1.388
0.0IleXaa: 0.0 ± 0.0
Lys
4.888LysAla: 4.888 ± 1.656
0.698LysCys: 0.698 ± 0.365
4.888LysAsp: 4.888 ± 1.611
6.285LysGlu: 6.285 ± 3.281
2.095LysPhe: 2.095 ± 1.042
2.793LysGly: 2.793 ± 1.458
2.793LysHis: 2.793 ± 1.458
0.698LysIle: 0.698 ± 1.388
8.38LysLys: 8.38 ± 4.375
3.492LysLeu: 3.492 ± 2.236
1.397LysMet: 1.397 ± 0.729
4.19LysAsn: 4.19 ± 2.188
2.793LysPro: 2.793 ± 1.072
2.793LysGln: 2.793 ± 1.458
5.587LysArg: 5.587 ± 3.299
2.793LysSer: 2.793 ± 1.032
4.888LysThr: 4.888 ± 3.418
3.492LysVal: 3.492 ± 2.188
1.397LysTrp: 1.397 ± 0.729
2.793LysTyr: 2.793 ± 1.072
0.0LysXaa: 0.0 ± 0.0
Leu
3.492LeuAla: 3.492 ± 0.76
0.0LeuCys: 0.0 ± 0.0
2.793LeuAsp: 2.793 ± 1.458
6.983LeuGlu: 6.983 ± 1.065
4.888LeuPhe: 4.888 ± 1.703
4.888LeuGly: 4.888 ± 4.938
1.397LeuHis: 1.397 ± 0.729
4.888LeuIle: 4.888 ± 2.042
9.777LeuLys: 9.777 ± 0.093
9.777LeuLeu: 9.777 ± 8.061
3.492LeuMet: 3.492 ± 1.823
4.888LeuAsn: 4.888 ± 0.047
4.888LeuPro: 4.888 ± 2.042
4.888LeuGln: 4.888 ± 0.047
5.587LeuArg: 5.587 ± 3.299
9.777LeuSer: 9.777 ± 8.289
4.19LeuThr: 4.19 ± 1.965
4.19LeuVal: 4.19 ± 3.648
0.698LeuTrp: 0.698 ± 1.396
5.587LeuTyr: 5.587 ± 3.034
0.0LeuXaa: 0.0 ± 0.0
Met
2.793MetAla: 2.793 ± 1.458
0.698MetCys: 0.698 ± 0.365
1.397MetAsp: 1.397 ± 0.729
2.793MetGlu: 2.793 ± 1.458
1.397MetPhe: 1.397 ± 1.19
0.0MetGly: 0.0 ± 0.0
0.698MetHis: 0.698 ± 0.365
1.397MetIle: 1.397 ± 0.729
1.397MetLys: 1.397 ± 0.729
1.397MetLeu: 1.397 ± 1.19
1.397MetMet: 1.397 ± 0.729
2.095MetAsn: 2.095 ± 1.042
3.492MetPro: 3.492 ± 1.19
1.397MetGln: 1.397 ± 1.172
0.698MetArg: 0.698 ± 0.365
0.0MetSer: 0.0 ± 0.0
2.793MetThr: 2.793 ± 1.124
0.698MetVal: 0.698 ± 1.396
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.19AsnAla: 4.19 ± 1.395
0.698AsnCys: 0.698 ± 1.388
1.397AsnAsp: 1.397 ± 0.729
2.793AsnGlu: 2.793 ± 1.458
0.698AsnPhe: 0.698 ± 1.396
2.793AsnGly: 2.793 ± 1.458
0.0AsnHis: 0.0 ± 0.0
2.095AsnIle: 2.095 ± 1.042
2.095AsnLys: 2.095 ± 1.094
1.397AsnLeu: 1.397 ± 2.776
1.397AsnMet: 1.397 ± 0.729
2.793AsnAsn: 2.793 ± 1.124
2.095AsnPro: 2.095 ± 1.094
3.492AsnGln: 3.492 ± 1.19
0.698AsnArg: 0.698 ± 0.365
1.397AsnSer: 1.397 ± 0.729
1.397AsnThr: 1.397 ± 0.729
0.698AsnVal: 0.698 ± 1.396
0.698AsnTrp: 0.698 ± 0.365
1.397AsnTyr: 1.397 ± 1.172
0.0AsnXaa: 0.0 ± 0.0
Pro
4.19ProAla: 4.19 ± 6.746
1.397ProCys: 1.397 ± 1.172
2.095ProAsp: 2.095 ± 1.094
1.397ProGlu: 1.397 ± 1.19
4.19ProPhe: 4.19 ± 2.188
0.0ProGly: 0.0 ± 0.0
2.793ProHis: 2.793 ± 1.032
2.095ProIle: 2.095 ± 1.094
1.397ProLys: 1.397 ± 0.729
6.285ProLeu: 6.285 ± 3.217
1.397ProMet: 1.397 ± 1.172
1.397ProAsn: 1.397 ± 0.729
6.983ProPro: 6.983 ± 3.175
3.492ProGln: 3.492 ± 1.144
2.793ProArg: 2.793 ± 1.072
5.587ProSer: 5.587 ± 1.951
4.19ProThr: 4.19 ± 1.965
4.888ProVal: 4.888 ± 0.047
0.698ProTrp: 0.698 ± 0.365
2.095ProTyr: 2.095 ± 1.094
0.0ProXaa: 0.0 ± 0.0
Gln
2.793GlnAla: 2.793 ± 1.072
1.397GlnCys: 1.397 ± 0.729
1.397GlnAsp: 1.397 ± 0.729
0.0GlnGlu: 0.0 ± 0.0
1.397GlnPhe: 1.397 ± 1.19
2.095GlnGly: 2.095 ± 1.072
0.0GlnHis: 0.0 ± 0.0
2.793GlnIle: 2.793 ± 1.458
1.397GlnLys: 1.397 ± 0.729
6.983GlnLeu: 6.983 ± 2.533
0.698GlnMet: 0.698 ± 0.365
0.0GlnAsn: 0.0 ± 0.0
1.397GlnPro: 1.397 ± 0.729
0.698GlnGln: 0.698 ± 1.388
1.397GlnArg: 1.397 ± 0.729
3.492GlnSer: 3.492 ± 1.144
4.19GlnThr: 4.19 ± 1.349
4.19GlnVal: 4.19 ± 1.395
0.0GlnTrp: 0.0 ± 0.0
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
2.793ArgAla: 2.793 ± 1.124
1.397ArgCys: 1.397 ± 0.729
5.587ArgAsp: 5.587 ± 2.144
4.888ArgGlu: 4.888 ± 2.042
2.095ArgPhe: 2.095 ± 1.094
2.095ArgGly: 2.095 ± 1.094
0.698ArgHis: 0.698 ± 0.365
2.793ArgIle: 2.793 ± 1.458
5.587ArgLys: 5.587 ± 2.144
2.793ArgLeu: 2.793 ± 1.032
1.397ArgMet: 1.397 ± 2.792
2.095ArgAsn: 2.095 ± 1.072
2.095ArgPro: 2.095 ± 1.072
2.095ArgGln: 2.095 ± 1.094
2.793ArgArg: 2.793 ± 1.032
2.793ArgSer: 2.793 ± 1.072
4.888ArgThr: 4.888 ± 0.047
0.698ArgVal: 0.698 ± 0.365
0.698ArgTrp: 0.698 ± 0.365
2.793ArgTyr: 2.793 ± 2.344
0.0ArgXaa: 0.0 ± 0.0
Ser
5.587SerAla: 5.587 ± 1.443
0.698SerCys: 0.698 ± 1.388
3.492SerAsp: 3.492 ± 1.144
2.095SerGlu: 2.095 ± 1.042
4.19SerPhe: 4.19 ± 1.349
2.095SerGly: 2.095 ± 1.072
1.397SerHis: 1.397 ± 1.172
6.285SerIle: 6.285 ± 4.906
1.397SerLys: 1.397 ± 0.729
9.777SerLeu: 9.777 ± 2.644
0.698SerMet: 0.698 ± 0.872
1.397SerAsn: 1.397 ± 0.729
5.587SerPro: 5.587 ± 0.337
1.397SerGln: 1.397 ± 0.729
5.587SerArg: 5.587 ± 2.917
8.38SerSer: 8.38 ± 3.929
4.19SerThr: 4.19 ± 1.349
3.492SerVal: 3.492 ± 1.144
2.793SerTrp: 2.793 ± 1.458
1.397SerTyr: 1.397 ± 2.776
0.0SerXaa: 0.0 ± 0.0
Thr
4.888ThrAla: 4.888 ± 1.703
0.698ThrCys: 0.698 ± 0.365
1.397ThrAsp: 1.397 ± 1.172
4.888ThrGlu: 4.888 ± 1.656
2.793ThrPhe: 2.793 ± 1.124
2.793ThrGly: 2.793 ± 1.124
2.095ThrHis: 2.095 ± 1.072
2.793ThrIle: 2.793 ± 2.344
8.38ThrLys: 8.38 ± 3.217
6.983ThrLeu: 6.983 ± 3.074
1.397ThrMet: 1.397 ± 1.19
2.095ThrAsn: 2.095 ± 1.072
6.285ThrPro: 6.285 ± 3.326
0.0ThrGln: 0.0 ± 0.0
1.397ThrArg: 1.397 ± 0.729
5.587ThrSer: 5.587 ± 1.907
5.587ThrThr: 5.587 ± 3.221
4.888ThrVal: 4.888 ± 3.352
1.397ThrTrp: 1.397 ± 0.729
1.397ThrTyr: 1.397 ± 0.729
0.0ThrXaa: 0.0 ± 0.0
Val
6.285ValAla: 6.285 ± 1.236
0.698ValCys: 0.698 ± 0.365
2.793ValAsp: 2.793 ± 1.458
4.888ValGlu: 4.888 ± 2.552
2.095ValPhe: 2.095 ± 1.042
2.095ValGly: 2.095 ± 1.042
0.698ValHis: 0.698 ± 1.396
1.397ValIle: 1.397 ± 0.729
2.793ValLys: 2.793 ± 1.072
4.888ValLeu: 4.888 ± 3.36
2.793ValMet: 2.793 ± 1.124
2.793ValAsn: 2.793 ± 1.032
3.492ValPro: 3.492 ± 1.144
2.793ValGln: 2.793 ± 1.124
2.095ValArg: 2.095 ± 1.094
4.19ValSer: 4.19 ± 1.395
4.888ValThr: 4.888 ± 3.36
5.587ValVal: 5.587 ± 2.144
2.095ValTrp: 2.095 ± 1.072
4.888ValTyr: 4.888 ± 3.352
0.0ValXaa: 0.0 ± 0.0
Trp
0.698TrpAla: 0.698 ± 1.396
0.0TrpCys: 0.0 ± 0.0
1.397TrpAsp: 1.397 ± 0.729
0.698TrpGlu: 0.698 ± 1.396
0.698TrpPhe: 0.698 ± 0.365
0.698TrpGly: 0.698 ± 1.396
0.698TrpHis: 0.698 ± 0.365
1.397TrpIle: 1.397 ± 1.19
2.793TrpLys: 2.793 ± 1.458
0.698TrpLeu: 0.698 ± 0.365
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
1.397TrpArg: 1.397 ± 0.729
0.0TrpSer: 0.0 ± 0.0
1.397TrpThr: 1.397 ± 0.729
2.095TrpVal: 2.095 ± 1.094
0.0TrpTrp: 0.0 ± 0.0
0.698TrpTyr: 0.698 ± 0.365
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.19TyrAla: 4.19 ± 0.396
2.095TyrCys: 2.095 ± 1.042
2.095TyrAsp: 2.095 ± 1.094
0.698TyrGlu: 0.698 ± 0.365
2.095TyrPhe: 2.095 ± 1.042
2.793TyrGly: 2.793 ± 1.458
0.698TyrHis: 0.698 ± 1.388
1.397TyrIle: 1.397 ± 1.172
0.698TyrLys: 0.698 ± 1.388
6.285TyrLeu: 6.285 ± 8.159
1.397TyrMet: 1.397 ± 0.729
2.793TyrAsn: 2.793 ± 1.124
0.698TyrPro: 0.698 ± 0.365
0.0TyrGln: 0.0 ± 0.0
1.397TyrArg: 1.397 ± 1.172
2.793TyrSer: 2.793 ± 1.458
2.793TyrThr: 2.793 ± 1.032
1.397TyrVal: 1.397 ± 0.729
1.397TyrTrp: 1.397 ± 1.19
0.698TyrTyr: 0.698 ± 1.388
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1433 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski