Amino acid dipepetide frequency for Pelargonium vein banding virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.869AlaAla: 8.869 ± 1.417
1.33AlaCys: 1.33 ± 0.634
3.548AlaAsp: 3.548 ± 1.003
7.095AlaGlu: 7.095 ± 6.977
3.548AlaPhe: 3.548 ± 1.003
1.774AlaGly: 1.774 ± 0.845
0.443AlaHis: 0.443 ± 0.211
5.322AlaIle: 5.322 ± 2.191
3.548AlaLys: 3.548 ± 2.71
7.982AlaLeu: 7.982 ± 4.682
1.774AlaMet: 1.774 ± 0.898
2.661AlaAsn: 2.661 ± 1.095
3.548AlaPro: 3.548 ± 1.69
7.539AlaGln: 7.539 ± 1.708
6.652AlaArg: 6.652 ± 1.898
3.548AlaSer: 3.548 ± 1.003
3.104AlaThr: 3.104 ± 3.687
3.548AlaVal: 3.548 ± 1.69
0.443AlaTrp: 0.443 ± 0.211
3.548AlaTyr: 3.548 ± 1.69
0.0AlaXaa: 0.0 ± 0.0
Cys
1.33CysAla: 1.33 ± 0.634
1.33CysCys: 1.33 ± 0.634
0.887CysAsp: 0.887 ± 0.422
0.443CysGlu: 0.443 ± 0.211
1.33CysPhe: 1.33 ± 0.634
1.33CysGly: 1.33 ± 0.634
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
1.33CysLys: 1.33 ± 0.634
0.0CysLeu: 0.0 ± 0.0
0.887CysMet: 0.887 ± 0.422
0.443CysAsn: 0.443 ± 0.211
0.887CysPro: 0.887 ± 0.422
0.443CysGln: 0.443 ± 0.211
1.774CysArg: 1.774 ± 0.845
1.774CysSer: 1.774 ± 1.323
0.0CysThr: 0.0 ± 0.0
0.443CysVal: 0.443 ± 0.211
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.548AspAla: 3.548 ± 1.69
1.33AspCys: 1.33 ± 0.634
4.435AspAsp: 4.435 ± 2.112
3.104AspGlu: 3.104 ± 1.479
3.104AspPhe: 3.104 ± 1.479
3.104AspGly: 3.104 ± 1.116
0.0AspHis: 0.0 ± 0.0
4.435AspIle: 4.435 ± 1.081
2.661AspLys: 2.661 ± 1.163
6.208AspLeu: 6.208 ± 4.244
0.443AspMet: 0.443 ± 0.211
2.217AspAsn: 2.217 ± 1.056
2.661AspPro: 2.661 ± 1.163
2.217AspGln: 2.217 ± 1.056
1.774AspArg: 1.774 ± 1.323
3.104AspSer: 3.104 ± 1.116
2.217AspThr: 2.217 ± 1.196
1.33AspVal: 1.33 ± 0.634
0.887AspTrp: 0.887 ± 0.422
1.774AspTyr: 1.774 ± 1.323
0.0AspXaa: 0.0 ± 0.0
Glu
7.982GluAla: 7.982 ± 6.536
0.887GluCys: 0.887 ± 0.422
4.435GluAsp: 4.435 ± 2.112
12.86GluGlu: 12.86 ± 2.362
0.887GluPhe: 0.887 ± 0.422
6.208GluGly: 6.208 ± 2.233
1.33GluHis: 1.33 ± 0.634
6.652GluIle: 6.652 ± 2.079
3.991GluLys: 3.991 ± 1.021
8.869GluLeu: 8.869 ± 1.216
1.33GluMet: 1.33 ± 0.634
1.33GluAsn: 1.33 ± 0.634
3.104GluPro: 3.104 ± 1.116
4.435GluGln: 4.435 ± 5.013
3.548GluArg: 3.548 ± 1.648
7.982GluSer: 7.982 ± 2.043
2.661GluThr: 2.661 ± 1.267
2.217GluVal: 2.217 ± 1.056
1.774GluTrp: 1.774 ± 0.845
0.887GluTyr: 0.887 ± 1.635
0.0GluXaa: 0.0 ± 0.0
Phe
1.33PheAla: 1.33 ± 0.634
1.33PheCys: 1.33 ± 0.634
0.887PheAsp: 0.887 ± 0.422
2.217PheGlu: 2.217 ± 1.196
0.443PhePhe: 0.443 ± 0.211
1.33PheGly: 1.33 ± 0.634
0.887PheHis: 0.887 ± 0.422
3.548PheIle: 3.548 ± 1.69
3.104PheLys: 3.104 ± 1.029
3.548PheLeu: 3.548 ± 2.71
0.443PheMet: 0.443 ± 0.211
0.443PheAsn: 0.443 ± 0.211
0.887PhePro: 0.887 ± 0.422
0.887PheGln: 0.887 ± 0.422
2.661PheArg: 2.661 ± 1.267
1.774PheSer: 1.774 ± 0.845
2.217PheThr: 2.217 ± 1.056
0.443PheVal: 0.443 ± 0.211
0.0PheTrp: 0.0 ± 0.0
1.33PheTyr: 1.33 ± 1.487
0.0PheXaa: 0.0 ± 0.0
Gly
4.435GlyAla: 4.435 ± 2.112
0.887GlyCys: 0.887 ± 0.422
1.33GlyAsp: 1.33 ± 1.487
5.322GlyGlu: 5.322 ± 1.431
2.661GlyPhe: 2.661 ± 1.163
3.548GlyGly: 3.548 ± 1.69
0.887GlyHis: 0.887 ± 0.422
2.217GlyIle: 2.217 ± 1.056
3.548GlyLys: 3.548 ± 1.108
6.652GlyLeu: 6.652 ± 1.898
0.0GlyMet: 0.0 ± 0.0
2.217GlyAsn: 2.217 ± 1.056
3.991GlyPro: 3.991 ± 1.14
0.887GlyGln: 0.887 ± 1.635
2.217GlyArg: 2.217 ± 1.056
4.435GlySer: 4.435 ± 4.322
2.661GlyThr: 2.661 ± 1.267
4.435GlyVal: 4.435 ± 1.208
1.33GlyTrp: 1.33 ± 0.634
1.774GlyTyr: 1.774 ± 0.845
0.0GlyXaa: 0.0 ± 0.0
His
0.887HisAla: 0.887 ± 0.422
0.443HisCys: 0.443 ± 0.211
1.33HisAsp: 1.33 ± 0.634
0.443HisGlu: 0.443 ± 0.211
0.443HisPhe: 0.443 ± 0.211
0.887HisGly: 0.887 ± 0.422
0.443HisHis: 0.443 ± 0.211
0.887HisIle: 0.887 ± 0.422
1.774HisLys: 1.774 ± 0.845
1.774HisLeu: 1.774 ± 1.323
0.0HisMet: 0.0 ± 0.209
0.887HisAsn: 0.887 ± 0.422
0.0HisPro: 0.0 ± 0.0
1.774HisGln: 1.774 ± 0.845
1.774HisArg: 1.774 ± 1.355
1.774HisSer: 1.774 ± 2.492
0.887HisThr: 0.887 ± 0.422
1.33HisVal: 1.33 ± 0.634
0.443HisTrp: 0.443 ± 0.211
1.33HisTyr: 1.33 ± 0.634
0.0HisXaa: 0.0 ± 0.0
Ile
5.765IleAla: 5.765 ± 0.598
1.33IleCys: 1.33 ± 0.634
1.774IleAsp: 1.774 ± 0.845
2.217IleGlu: 2.217 ± 1.056
0.443IlePhe: 0.443 ± 0.211
3.991IleGly: 3.991 ± 1.14
3.991IleHis: 3.991 ± 1.901
3.104IleIle: 3.104 ± 1.116
4.435IleLys: 4.435 ± 3.096
3.991IleLeu: 3.991 ± 2.514
0.443IleMet: 0.443 ± 0.211
4.435IleAsn: 4.435 ± 1.227
4.878IlePro: 4.878 ± 1.308
5.765IleGln: 5.765 ± 0.598
3.104IleArg: 3.104 ± 1.029
2.661IleSer: 2.661 ± 1.267
3.991IleThr: 3.991 ± 1.901
2.661IleVal: 2.661 ± 1.267
0.887IleTrp: 0.887 ± 0.422
1.774IleTyr: 1.774 ± 1.323
0.0IleXaa: 0.0 ± 0.0
Lys
7.539LysAla: 7.539 ± 1.828
1.774LysCys: 1.774 ± 0.845
4.435LysAsp: 4.435 ± 2.392
7.539LysGlu: 7.539 ± 1.708
1.774LysPhe: 1.774 ± 0.845
3.991LysGly: 3.991 ± 1.14
1.774LysHis: 1.774 ± 0.845
5.765LysIle: 5.765 ± 2.524
3.991LysLys: 3.991 ± 1.901
5.322LysLeu: 5.322 ± 5.909
1.774LysMet: 1.774 ± 0.845
1.774LysAsn: 1.774 ± 1.355
3.104LysPro: 3.104 ± 1.029
1.774LysGln: 1.774 ± 1.355
2.661LysArg: 2.661 ± 1.095
3.104LysSer: 3.104 ± 1.479
1.774LysThr: 1.774 ± 1.355
4.435LysVal: 4.435 ± 3.065
0.887LysTrp: 0.887 ± 1.63
1.774LysTyr: 1.774 ± 0.845
0.0LysXaa: 0.0 ± 0.0
Leu
4.878LeuAla: 4.878 ± 2.864
0.443LeuCys: 0.443 ± 0.211
5.765LeuAsp: 5.765 ± 2.524
8.426LeuGlu: 8.426 ± 2.664
0.887LeuPhe: 0.887 ± 0.422
5.765LeuGly: 5.765 ± 1.574
2.217LeuHis: 2.217 ± 2.281
4.435LeuIle: 4.435 ± 1.208
5.322LeuLys: 5.322 ± 1.431
6.208LeuLeu: 6.208 ± 5.478
2.217LeuMet: 2.217 ± 1.056
3.548LeuAsn: 3.548 ± 1.69
6.652LeuPro: 6.652 ± 3.169
3.548LeuGln: 3.548 ± 1.003
4.878LeuArg: 4.878 ± 2.864
6.652LeuSer: 6.652 ± 2.163
3.548LeuThr: 3.548 ± 2.646
7.982LeuVal: 7.982 ± 7.469
0.887LeuTrp: 0.887 ± 0.422
3.104LeuTyr: 3.104 ± 1.479
0.0LeuXaa: 0.0 ± 0.0
Met
3.104MetAla: 3.104 ± 1.479
0.443MetCys: 0.443 ± 0.211
1.33MetAsp: 1.33 ± 0.634
1.774MetGlu: 1.774 ± 0.845
0.443MetPhe: 0.443 ± 0.211
1.774MetGly: 1.774 ± 0.845
0.0MetHis: 0.0 ± 0.0
0.887MetIle: 0.887 ± 0.422
0.443MetLys: 0.443 ± 0.211
1.774MetLeu: 1.774 ± 0.845
1.33MetMet: 1.33 ± 0.634
0.887MetAsn: 0.887 ± 0.422
1.774MetPro: 1.774 ± 0.845
0.0MetGln: 0.0 ± 0.0
2.217MetArg: 2.217 ± 1.056
1.33MetSer: 1.33 ± 2.703
0.443MetThr: 0.443 ± 0.211
1.774MetVal: 1.774 ± 0.845
0.443MetTrp: 0.443 ± 0.211
0.443MetTyr: 0.443 ± 0.211
0.0MetXaa: 0.0 ± 0.0
Asn
0.887AsnAla: 0.887 ± 0.422
0.0AsnCys: 0.0 ± 0.0
1.774AsnAsp: 1.774 ± 0.845
1.774AsnGlu: 1.774 ± 1.323
3.104AsnPhe: 3.104 ± 1.116
2.661AsnGly: 2.661 ± 1.163
0.443AsnHis: 0.443 ± 0.211
2.217AsnIle: 2.217 ± 1.245
3.548AsnLys: 3.548 ± 1.69
3.991AsnLeu: 3.991 ± 1.901
1.33AsnMet: 1.33 ± 0.634
2.661AsnAsn: 2.661 ± 3.881
2.661AsnPro: 2.661 ± 1.267
1.33AsnGln: 1.33 ± 1.47
1.774AsnArg: 1.774 ± 0.845
2.217AsnSer: 2.217 ± 1.056
3.104AsnThr: 3.104 ± 1.479
2.661AsnVal: 2.661 ± 1.095
0.887AsnTrp: 0.887 ± 0.422
2.661AsnTyr: 2.661 ± 1.267
0.0AsnXaa: 0.0 ± 0.0
Pro
5.322ProAla: 5.322 ± 2.327
0.443ProCys: 0.443 ± 0.211
2.661ProAsp: 2.661 ± 1.163
5.322ProGlu: 5.322 ± 2.535
2.661ProPhe: 2.661 ± 1.267
3.548ProGly: 3.548 ± 2.71
1.33ProHis: 1.33 ± 0.634
3.548ProIle: 3.548 ± 1.69
4.435ProLys: 4.435 ± 1.227
2.217ProLeu: 2.217 ± 1.196
1.33ProMet: 1.33 ± 0.634
2.217ProAsn: 2.217 ± 1.056
7.539ProPro: 7.539 ± 2.254
1.33ProGln: 1.33 ± 0.634
3.548ProArg: 3.548 ± 1.69
1.774ProSer: 1.774 ± 0.845
3.548ProThr: 3.548 ± 1.003
1.774ProVal: 1.774 ± 0.845
0.443ProTrp: 0.443 ± 0.211
1.774ProTyr: 1.774 ± 0.845
0.0ProXaa: 0.0 ± 0.0
Gln
3.104GlnAla: 3.104 ± 3.676
0.0GlnCys: 0.0 ± 0.0
4.435GlnAsp: 4.435 ± 1.081
5.322GlnGlu: 5.322 ± 2.191
0.443GlnPhe: 0.443 ± 0.211
2.661GlnGly: 2.661 ± 2.974
0.887GlnHis: 0.887 ± 1.63
2.217GlnIle: 2.217 ± 1.056
4.435GlnLys: 4.435 ± 3.096
2.217GlnLeu: 2.217 ± 1.056
2.217GlnMet: 2.217 ± 1.056
2.217GlnAsn: 2.217 ± 1.245
1.774GlnPro: 1.774 ± 1.323
1.774GlnGln: 1.774 ± 0.845
3.104GlnArg: 3.104 ± 1.479
3.104GlnSer: 3.104 ± 1.479
1.33GlnThr: 1.33 ± 1.487
4.878GlnVal: 4.878 ± 4.11
0.887GlnTrp: 0.887 ± 0.422
0.887GlnTyr: 0.887 ± 0.422
0.0GlnXaa: 0.0 ± 0.0
Arg
3.548ArgAla: 3.548 ± 1.108
0.0ArgCys: 0.0 ± 0.0
2.217ArgAsp: 2.217 ± 1.245
2.217ArgGlu: 2.217 ± 1.196
1.774ArgPhe: 1.774 ± 1.323
3.104ArgGly: 3.104 ± 1.479
2.661ArgHis: 2.661 ± 1.267
1.774ArgIle: 1.774 ± 0.845
5.322ArgLys: 5.322 ± 2.327
8.426ArgLeu: 8.426 ± 1.367
2.661ArgMet: 2.661 ± 1.267
2.217ArgAsn: 2.217 ± 1.056
3.548ArgPro: 3.548 ± 1.108
0.887ArgGln: 0.887 ± 0.422
9.313ArgArg: 9.313 ± 1.106
3.548ArgSer: 3.548 ± 1.003
5.322ArgThr: 5.322 ± 1.3
3.991ArgVal: 3.991 ± 1.021
3.104ArgTrp: 3.104 ± 1.479
1.774ArgTyr: 1.774 ± 2.492
0.0ArgXaa: 0.0 ± 0.0
Ser
3.548SerAla: 3.548 ± 1.108
0.443SerCys: 0.443 ± 0.211
2.661SerAsp: 2.661 ± 1.267
4.435SerGlu: 4.435 ± 1.081
2.217SerPhe: 2.217 ± 1.056
2.661SerGly: 2.661 ± 1.267
0.887SerHis: 0.887 ± 0.422
3.104SerIle: 3.104 ± 1.116
5.322SerLys: 5.322 ± 3.969
2.217SerLeu: 2.217 ± 1.056
2.661SerMet: 2.661 ± 0.802
3.548SerAsn: 3.548 ± 1.69
4.878SerPro: 4.878 ± 1.017
4.878SerGln: 4.878 ± 2.902
4.435SerArg: 4.435 ± 3.096
3.991SerSer: 3.991 ± 2.594
4.878SerThr: 4.878 ± 2.324
3.991SerVal: 3.991 ± 3.268
0.443SerTrp: 0.443 ± 0.211
1.33SerTyr: 1.33 ± 0.634
0.0SerXaa: 0.0 ± 0.0
Thr
5.322ThrAla: 5.322 ± 4.14
0.0ThrCys: 0.0 ± 0.0
3.104ThrAsp: 3.104 ± 1.029
2.661ThrGlu: 2.661 ± 2.07
1.33ThrPhe: 1.33 ± 0.634
3.548ThrGly: 3.548 ± 1.69
1.33ThrHis: 1.33 ± 1.487
4.435ThrIle: 4.435 ± 2.112
0.887ThrLys: 0.887 ± 0.422
5.765ThrLeu: 5.765 ± 2.746
0.443ThrMet: 0.443 ± 0.211
1.33ThrAsn: 1.33 ± 0.634
1.33ThrPro: 1.33 ± 0.634
0.887ThrGln: 0.887 ± 0.422
3.548ThrArg: 3.548 ± 1.69
4.435ThrSer: 4.435 ± 2.112
4.435ThrThr: 4.435 ± 1.227
2.661ThrVal: 2.661 ± 1.095
0.443ThrTrp: 0.443 ± 0.211
1.774ThrTyr: 1.774 ± 1.323
0.0ThrXaa: 0.0 ± 0.0
Val
3.548ValAla: 3.548 ± 1.003
1.33ValCys: 1.33 ± 0.634
2.217ValAsp: 2.217 ± 1.056
4.435ValGlu: 4.435 ± 3.065
1.774ValPhe: 1.774 ± 0.845
1.774ValGly: 1.774 ± 0.845
0.443ValHis: 0.443 ± 0.211
3.991ValIle: 3.991 ± 2.514
6.652ValLys: 6.652 ± 6.035
5.765ValLeu: 5.765 ± 0.598
0.443ValMet: 0.443 ± 0.211
2.661ValAsn: 2.661 ± 1.267
1.33ValPro: 1.33 ± 0.634
3.104ValGln: 3.104 ± 2.789
4.435ValArg: 4.435 ± 1.081
2.661ValSer: 2.661 ± 2.939
1.774ValThr: 1.774 ± 0.845
3.548ValVal: 3.548 ± 1.69
0.0ValTrp: 0.0 ± 0.0
3.104ValTyr: 3.104 ± 1.116
0.0ValXaa: 0.0 ± 0.0
Trp
1.774TrpAla: 1.774 ± 0.845
0.0TrpCys: 0.0 ± 0.0
0.887TrpAsp: 0.887 ± 0.422
1.33TrpGlu: 1.33 ± 0.634
0.0TrpPhe: 0.0 ± 0.0
0.443TrpGly: 0.443 ± 0.211
0.0TrpHis: 0.0 ± 0.0
0.443TrpIle: 0.443 ± 0.211
0.443TrpLys: 0.443 ± 0.211
0.887TrpLeu: 0.887 ± 0.422
0.0TrpMet: 0.0 ± 0.0
1.33TrpAsn: 1.33 ± 0.634
0.887TrpPro: 0.887 ± 0.422
1.774TrpGln: 1.774 ± 0.845
1.774TrpArg: 1.774 ± 0.845
1.33TrpSer: 1.33 ± 0.634
0.443TrpThr: 0.443 ± 0.211
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.443TrpTyr: 0.443 ± 1.802
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.104TyrAla: 3.104 ± 1.116
0.887TyrCys: 0.887 ± 1.63
0.443TyrAsp: 0.443 ± 1.802
4.435TyrGlu: 4.435 ± 2.112
0.443TyrPhe: 0.443 ± 1.797
1.33TyrGly: 1.33 ± 0.634
0.0TyrHis: 0.0 ± 0.0
2.661TyrIle: 2.661 ± 1.163
1.774TyrLys: 1.774 ± 1.323
3.548TyrLeu: 3.548 ± 1.003
0.443TyrMet: 0.443 ± 0.211
2.661TyrAsn: 2.661 ± 1.095
1.33TyrPro: 1.33 ± 0.634
2.661TyrGln: 2.661 ± 1.267
2.217TyrArg: 2.217 ± 1.245
1.33TyrSer: 1.33 ± 0.634
1.33TyrThr: 1.33 ± 0.634
0.887TyrVal: 0.887 ± 0.422
0.0TyrTrp: 0.0 ± 0.0
1.774TyrTyr: 1.774 ± 0.845
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2256 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski