Amino acid dipepetide frequency for Sida yellow net virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.236AlaAla: 5.236 ± 2.93
1.047AlaCys: 1.047 ± 0.912
1.047AlaAsp: 1.047 ± 0.76
2.094AlaGlu: 2.094 ± 1.103
1.047AlaPhe: 1.047 ± 1.168
2.094AlaGly: 2.094 ± 1.825
1.047AlaHis: 1.047 ± 0.76
1.047AlaIle: 1.047 ± 0.76
5.236AlaLys: 5.236 ± 1.812
7.33AlaLeu: 7.33 ± 2.125
0.0AlaMet: 0.0 ± 0.0
2.094AlaAsn: 2.094 ± 1.49
4.188AlaPro: 4.188 ± 1.86
1.047AlaGln: 1.047 ± 0.76
4.188AlaArg: 4.188 ± 2.267
6.283AlaSer: 6.283 ± 2.119
4.188AlaThr: 4.188 ± 1.504
1.047AlaVal: 1.047 ± 1.168
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
1.047CysAla: 1.047 ± 1.23
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.047CysGlu: 1.047 ± 0.912
0.0CysPhe: 0.0 ± 0.0
1.047CysGly: 1.047 ± 1.23
0.0CysHis: 0.0 ± 0.0
3.141CysIle: 3.141 ± 1.077
2.094CysLys: 2.094 ± 0.839
0.0CysLeu: 0.0 ± 0.0
1.047CysMet: 1.047 ± 0.912
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.047CysArg: 1.047 ± 0.76
4.188CysSer: 4.188 ± 1.758
3.141CysThr: 3.141 ± 1.077
1.047CysVal: 1.047 ± 0.912
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.047AspAla: 1.047 ± 0.76
1.047AspCys: 1.047 ± 1.578
3.141AspAsp: 3.141 ± 2.355
3.141AspGlu: 3.141 ± 1.316
2.094AspPhe: 2.094 ± 0.839
2.094AspGly: 2.094 ± 1.52
0.0AspHis: 0.0 ± 0.0
6.283AspIle: 6.283 ± 3.374
3.141AspLys: 3.141 ± 2.28
4.188AspLeu: 4.188 ± 1.071
2.094AspMet: 2.094 ± 1.532
2.094AspAsn: 2.094 ± 1.352
1.047AspPro: 1.047 ± 0.912
1.047AspGln: 1.047 ± 1.23
4.188AspArg: 4.188 ± 2.654
6.283AspSer: 6.283 ± 1.107
2.094AspThr: 2.094 ± 1.244
3.141AspVal: 3.141 ± 1.316
1.047AspTrp: 1.047 ± 0.76
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
3.141GluAla: 3.141 ± 1.367
0.0GluCys: 0.0 ± 0.0
1.047GluAsp: 1.047 ± 1.168
6.283GluGlu: 6.283 ± 3.523
0.0GluPhe: 0.0 ± 0.0
4.188GluGly: 4.188 ± 1.978
0.0GluHis: 0.0 ± 0.0
2.094GluIle: 2.094 ± 2.074
2.094GluLys: 2.094 ± 1.52
5.236GluLeu: 5.236 ± 1.798
1.047GluMet: 1.047 ± 0.76
7.33GluAsn: 7.33 ± 2.347
3.141GluPro: 3.141 ± 1.316
2.094GluGln: 2.094 ± 1.825
1.047GluArg: 1.047 ± 0.76
1.047GluSer: 1.047 ± 1.23
1.047GluThr: 1.047 ± 0.76
2.094GluVal: 2.094 ± 1.49
4.188GluTrp: 4.188 ± 1.282
3.141GluTyr: 3.141 ± 2.28
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
2.094PheCys: 2.094 ± 0.839
3.141PheAsp: 3.141 ± 1.077
1.047PheGlu: 1.047 ± 0.76
0.0PhePhe: 0.0 ± 0.0
2.094PheGly: 2.094 ± 0.839
2.094PheHis: 2.094 ± 1.52
3.141PheIle: 3.141 ± 2.28
2.094PheLys: 2.094 ± 2.336
4.188PheLeu: 4.188 ± 2.094
0.0PheMet: 0.0 ± 0.0
4.188PheAsn: 4.188 ± 1.504
1.047PhePro: 1.047 ± 0.76
6.283PheGln: 6.283 ± 1.349
2.094PheArg: 2.094 ± 2.074
2.094PheSer: 2.094 ± 1.687
3.141PheThr: 3.141 ± 1.655
0.0PheVal: 0.0 ± 0.0
3.141PheTrp: 3.141 ± 1.952
2.094PheTyr: 2.094 ± 1.825
0.0PheXaa: 0.0 ± 0.0
Gly
2.094GlyAla: 2.094 ± 1.52
2.094GlyCys: 2.094 ± 1.352
4.188GlyAsp: 4.188 ± 1.758
4.188GlyGlu: 4.188 ± 2.094
1.047GlyPhe: 1.047 ± 1.23
5.236GlyGly: 5.236 ± 3.328
2.094GlyHis: 2.094 ± 1.244
2.094GlyIle: 2.094 ± 1.352
7.33GlyLys: 7.33 ± 3.17
0.0GlyLeu: 0.0 ± 0.0
0.0GlyMet: 0.0 ± 0.0
2.094GlyAsn: 2.094 ± 1.687
3.141GlyPro: 3.141 ± 1.58
5.236GlyGln: 5.236 ± 1.63
3.141GlyArg: 3.141 ± 1.655
4.188GlySer: 4.188 ± 1.758
5.236GlyThr: 5.236 ± 2.716
3.141GlyVal: 3.141 ± 2.141
0.0GlyTrp: 0.0 ± 0.0
1.047GlyTyr: 1.047 ± 0.76
0.0GlyXaa: 0.0 ± 0.0
His
1.047HisAla: 1.047 ± 0.912
1.047HisCys: 1.047 ± 1.23
2.094HisAsp: 2.094 ± 1.352
1.047HisGlu: 1.047 ± 0.76
1.047HisPhe: 1.047 ± 0.76
1.047HisGly: 1.047 ± 1.23
1.047HisHis: 1.047 ± 1.23
2.094HisIle: 2.094 ± 1.888
2.094HisLys: 2.094 ± 1.327
3.141HisLeu: 3.141 ± 1.491
0.0HisMet: 0.0 ± 0.0
4.188HisAsn: 4.188 ± 2.094
1.047HisPro: 1.047 ± 0.76
2.094HisGln: 2.094 ± 0.839
3.141HisArg: 3.141 ± 2.419
1.047HisSer: 1.047 ± 1.168
2.094HisThr: 2.094 ± 1.825
6.283HisVal: 6.283 ± 2.065
1.047HisTrp: 1.047 ± 0.76
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.0IleAla: 0.0 ± 0.0
1.047IleCys: 1.047 ± 0.76
4.188IleAsp: 4.188 ± 1.758
1.047IleGlu: 1.047 ± 0.76
2.094IlePhe: 2.094 ± 1.52
2.094IleGly: 2.094 ± 1.244
0.0IleHis: 0.0 ± 0.0
2.094IleIle: 2.094 ± 1.49
5.236IleLys: 5.236 ± 0.952
1.047IleLeu: 1.047 ± 0.912
0.0IleMet: 0.0 ± 0.0
2.094IleAsn: 2.094 ± 1.631
2.094IlePro: 2.094 ± 2.46
5.236IleGln: 5.236 ± 2.142
6.283IleArg: 6.283 ± 3.464
5.236IleSer: 5.236 ± 3.315
7.33IleThr: 7.33 ± 3.483
1.047IleVal: 1.047 ± 0.76
1.047IleTrp: 1.047 ± 1.168
3.141IleTyr: 3.141 ± 1.955
0.0IleXaa: 0.0 ± 0.0
Lys
6.283LysAla: 6.283 ± 2.037
0.0LysCys: 0.0 ± 0.0
2.094LysAsp: 2.094 ± 1.52
4.188LysGlu: 4.188 ± 3.039
7.33LysPhe: 7.33 ± 2.125
2.094LysGly: 2.094 ± 1.49
1.047LysHis: 1.047 ± 0.76
3.141LysIle: 3.141 ± 0.953
1.047LysLys: 1.047 ± 0.76
5.236LysLeu: 5.236 ± 1.633
0.0LysMet: 0.0 ± 0.0
3.141LysAsn: 3.141 ± 1.58
1.047LysPro: 1.047 ± 0.912
1.047LysGln: 1.047 ± 0.76
4.188LysArg: 4.188 ± 2.731
5.236LysSer: 5.236 ± 1.812
2.094LysThr: 2.094 ± 1.49
8.377LysVal: 8.377 ± 4.918
0.0LysTrp: 0.0 ± 0.0
2.094LysTyr: 2.094 ± 0.839
0.0LysXaa: 0.0 ± 0.0
Leu
0.0LeuAla: 0.0 ± 0.0
0.0LeuCys: 0.0 ± 0.0
7.33LeuAsp: 7.33 ± 2.795
2.094LeuGlu: 2.094 ± 2.074
3.141LeuPhe: 3.141 ± 1.655
4.188LeuGly: 4.188 ± 1.071
4.188LeuHis: 4.188 ± 2.094
5.236LeuIle: 5.236 ± 2.559
6.283LeuLys: 6.283 ± 2.631
3.141LeuLeu: 3.141 ± 1.58
1.047LeuMet: 1.047 ± 1.578
6.283LeuAsn: 6.283 ± 3.185
2.094LeuPro: 2.094 ± 1.244
3.141LeuGln: 3.141 ± 1.761
5.236LeuArg: 5.236 ± 1.843
3.141LeuSer: 3.141 ± 2.28
2.094LeuThr: 2.094 ± 1.103
4.188LeuVal: 4.188 ± 1.504
0.0LeuTrp: 0.0 ± 0.0
6.283LeuTyr: 6.283 ± 3.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.094MetAla: 2.094 ± 1.825
1.047MetCys: 1.047 ± 0.912
3.141MetAsp: 3.141 ± 1.955
0.0MetGlu: 0.0 ± 0.0
2.094MetPhe: 2.094 ± 1.825
1.047MetGly: 1.047 ± 1.578
1.047MetHis: 1.047 ± 0.912
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
2.094MetLeu: 2.094 ± 2.074
2.094MetMet: 2.094 ± 2.074
0.0MetAsn: 0.0 ± 0.0
2.094MetPro: 2.094 ± 0.839
2.094MetGln: 2.094 ± 1.244
0.0MetArg: 0.0 ± 0.0
1.047MetSer: 1.047 ± 0.76
0.0MetThr: 0.0 ± 0.0
0.0MetVal: 0.0 ± 0.0
1.047MetTrp: 1.047 ± 0.76
2.094MetTyr: 2.094 ± 1.327
0.0MetXaa: 0.0 ± 0.0
Asn
4.188AsnAla: 4.188 ± 1.978
2.094AsnCys: 2.094 ± 1.244
3.141AsnAsp: 3.141 ± 1.077
3.141AsnGlu: 3.141 ± 1.58
2.094AsnPhe: 2.094 ± 1.327
4.188AsnGly: 4.188 ± 1.586
7.33AsnHis: 7.33 ± 3.782
2.094AsnIle: 2.094 ± 0.839
3.141AsnLys: 3.141 ± 1.077
3.141AsnLeu: 3.141 ± 1.491
1.047AsnMet: 1.047 ± 1.679
3.141AsnAsn: 3.141 ± 0.953
4.188AsnPro: 4.188 ± 1.586
0.0AsnGln: 0.0 ± 0.0
4.188AsnArg: 4.188 ± 1.43
6.283AsnSer: 6.283 ± 1.52
0.0AsnThr: 0.0 ± 0.0
3.141AsnVal: 3.141 ± 1.491
0.0AsnTrp: 0.0 ± 0.0
2.094AsnTyr: 2.094 ± 1.52
0.0AsnXaa: 0.0 ± 0.0
Pro
1.047ProAla: 1.047 ± 1.578
2.094ProCys: 2.094 ± 1.352
3.141ProAsp: 3.141 ± 1.316
3.141ProGlu: 3.141 ± 1.655
1.047ProPhe: 1.047 ± 0.76
0.0ProGly: 0.0 ± 0.0
2.094ProHis: 2.094 ± 1.52
0.0ProIle: 0.0 ± 0.0
2.094ProLys: 2.094 ± 1.825
4.188ProLeu: 4.188 ± 2.094
4.188ProMet: 4.188 ± 1.704
1.047ProAsn: 1.047 ± 0.76
2.094ProPro: 2.094 ± 1.244
6.283ProGln: 6.283 ± 3.65
7.33ProArg: 7.33 ± 2.521
3.141ProSer: 3.141 ± 1.952
3.141ProThr: 3.141 ± 2.974
3.141ProVal: 3.141 ± 1.077
1.047ProTrp: 1.047 ± 0.76
1.047ProTyr: 1.047 ± 0.912
0.0ProXaa: 0.0 ± 0.0
Gln
2.094GlnAla: 2.094 ± 1.327
2.094GlnCys: 2.094 ± 1.244
1.047GlnAsp: 1.047 ± 1.23
3.141GlnGlu: 3.141 ± 1.077
4.188GlnPhe: 4.188 ± 3.039
4.188GlnGly: 4.188 ± 1.758
2.094GlnHis: 2.094 ± 1.888
2.094GlnIle: 2.094 ± 1.103
1.047GlnLys: 1.047 ± 0.76
3.141GlnLeu: 3.141 ± 4.735
1.047GlnMet: 1.047 ± 0.912
1.047GlnAsn: 1.047 ± 1.23
2.094GlnPro: 2.094 ± 2.46
2.094GlnGln: 2.094 ± 1.49
1.047GlnArg: 1.047 ± 0.912
6.283GlnSer: 6.283 ± 2.465
3.141GlnThr: 3.141 ± 2.28
4.188GlnVal: 4.188 ± 1.504
0.0GlnTrp: 0.0 ± 0.0
2.094GlnTyr: 2.094 ± 0.839
0.0GlnXaa: 0.0 ± 0.0
Arg
3.141ArgAla: 3.141 ± 1.404
1.047ArgCys: 1.047 ± 0.76
3.141ArgAsp: 3.141 ± 2.737
4.188ArgGlu: 4.188 ± 1.758
5.236ArgPhe: 5.236 ± 3.215
8.377ArgGly: 8.377 ± 3.359
2.094ArgHis: 2.094 ± 1.327
5.236ArgIle: 5.236 ± 1.638
3.141ArgLys: 3.141 ± 0.953
3.141ArgLeu: 3.141 ± 1.404
1.047ArgMet: 1.047 ± 1.205
1.047ArgAsn: 1.047 ± 0.76
4.188ArgPro: 4.188 ± 1.679
2.094ArgGln: 2.094 ± 1.687
7.33ArgArg: 7.33 ± 5.99
4.188ArgSer: 4.188 ± 1.86
7.33ArgThr: 7.33 ± 2.169
3.141ArgVal: 3.141 ± 0.953
0.0ArgTrp: 0.0 ± 0.0
2.094ArgTyr: 2.094 ± 2.336
0.0ArgXaa: 0.0 ± 0.0
Ser
6.283SerAla: 6.283 ± 3.561
2.094SerCys: 2.094 ± 0.839
3.141SerAsp: 3.141 ± 1.077
1.047SerGlu: 1.047 ± 0.912
4.188SerPhe: 4.188 ± 1.758
4.188SerGly: 4.188 ± 1.561
1.047SerHis: 1.047 ± 0.912
4.188SerIle: 4.188 ± 2.135
3.141SerLys: 3.141 ± 1.367
5.236SerLeu: 5.236 ± 1.798
0.0SerMet: 0.0 ± 0.0
7.33SerAsn: 7.33 ± 2.163
7.33SerPro: 7.33 ± 2.065
1.047SerGln: 1.047 ± 0.76
5.236SerArg: 5.236 ± 1.403
12.565SerSer: 12.565 ± 7.419
6.283SerThr: 6.283 ± 6.166
2.094SerVal: 2.094 ± 1.352
2.094SerTrp: 2.094 ± 0.839
4.188SerTyr: 4.188 ± 1.1
0.0SerXaa: 0.0 ± 0.0
Thr
5.236ThrAla: 5.236 ± 1.848
0.0ThrCys: 0.0 ± 0.0
2.094ThrAsp: 2.094 ± 1.631
3.141ThrGlu: 3.141 ± 3.137
1.047ThrPhe: 1.047 ± 1.578
3.141ThrGly: 3.141 ± 0.953
4.188ThrHis: 4.188 ± 2.705
2.094ThrIle: 2.094 ± 1.103
3.141ThrLys: 3.141 ± 1.761
4.188ThrLeu: 4.188 ± 1.679
1.047ThrMet: 1.047 ± 0.76
5.236ThrAsn: 5.236 ± 1.364
4.188ThrPro: 4.188 ± 1.56
1.047ThrGln: 1.047 ± 0.76
4.188ThrArg: 4.188 ± 1.071
6.283ThrSer: 6.283 ± 4.971
4.188ThrThr: 4.188 ± 4.673
3.141ThrVal: 3.141 ± 2.419
1.047ThrTrp: 1.047 ± 1.578
2.094ThrTyr: 2.094 ± 1.103
0.0ThrXaa: 0.0 ± 0.0
Val
0.0ValAla: 0.0 ± 0.0
0.0ValCys: 0.0 ± 0.0
1.047ValAsp: 1.047 ± 0.76
3.141ValGlu: 3.141 ± 1.491
2.094ValPhe: 2.094 ± 1.352
3.141ValGly: 3.141 ± 1.58
2.094ValHis: 2.094 ± 2.46
3.141ValIle: 3.141 ± 2.141
4.188ValLys: 4.188 ± 2.44
3.141ValLeu: 3.141 ± 1.817
3.141ValMet: 3.141 ± 1.955
4.188ValAsn: 4.188 ± 1.679
4.188ValPro: 4.188 ± 1.282
4.188ValGln: 4.188 ± 2.519
3.141ValArg: 3.141 ± 2.328
3.141ValSer: 3.141 ± 1.316
1.047ValThr: 1.047 ± 0.912
2.094ValVal: 2.094 ± 1.352
2.094ValTrp: 2.094 ± 1.327
7.33ValTyr: 7.33 ± 2.517
0.0ValXaa: 0.0 ± 0.0
Trp
2.094TrpAla: 2.094 ± 1.52
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.047TrpGlu: 1.047 ± 1.168
0.0TrpPhe: 0.0 ± 0.0
1.047TrpGly: 1.047 ± 0.76
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
2.094TrpLys: 2.094 ± 0.839
1.047TrpLeu: 1.047 ± 0.912
1.047TrpMet: 1.047 ± 0.912
1.047TrpAsn: 1.047 ± 1.23
0.0TrpPro: 0.0 ± 0.0
1.047TrpGln: 1.047 ± 0.76
2.094TrpArg: 2.094 ± 1.352
0.0TrpSer: 0.0 ± 0.0
2.094TrpThr: 2.094 ± 1.103
2.094TrpVal: 2.094 ± 0.839
0.0TrpTrp: 0.0 ± 0.0
1.047TrpTyr: 1.047 ± 1.578
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.188TyrAla: 4.188 ± 2.747
0.0TyrCys: 0.0 ± 0.0
1.047TyrAsp: 1.047 ± 0.912
2.094TyrGlu: 2.094 ± 1.825
4.188TyrPhe: 4.188 ± 1.1
2.094TyrGly: 2.094 ± 0.839
3.141TyrHis: 3.141 ± 2.141
3.141TyrIle: 3.141 ± 0.953
1.047TyrLys: 1.047 ± 0.76
6.283TyrLeu: 6.283 ± 2.4
2.094TyrMet: 2.094 ± 1.256
2.094TyrAsn: 2.094 ± 0.839
2.094TyrPro: 2.094 ± 1.49
1.047TyrGln: 1.047 ± 0.76
2.094TyrArg: 2.094 ± 1.825
1.047TyrSer: 1.047 ± 0.76
1.047TyrThr: 1.047 ± 1.168
3.141TyrVal: 3.141 ± 2.557
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (956 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski