Amino acid dipepetide frequency for Cassava virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.584AlaAla: 7.584 ± 2.508
0.542AlaCys: 0.542 ± 0.85
3.25AlaAsp: 3.25 ± 0.551
2.167AlaGlu: 2.167 ± 1.472
4.334AlaPhe: 4.334 ± 3.795
5.959AlaGly: 5.959 ± 1.706
2.709AlaHis: 2.709 ± 1.884
4.875AlaIle: 4.875 ± 2.491
6.501AlaLys: 6.501 ± 2.5
11.918AlaLeu: 11.918 ± 1.809
2.167AlaMet: 2.167 ± 1.106
2.709AlaAsn: 2.709 ± 1.382
4.334AlaPro: 4.334 ± 1.18
2.167AlaGln: 2.167 ± 0.634
3.792AlaArg: 3.792 ± 1.157
7.042AlaSer: 7.042 ± 1.706
5.417AlaThr: 5.417 ± 1.785
5.959AlaVal: 5.959 ± 2.719
0.542AlaTrp: 0.542 ± 0.917
4.334AlaTyr: 4.334 ± 1.399
0.0AlaXaa: 0.0 ± 0.0
Cys
1.625CysAla: 1.625 ± 2.141
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.625CysGlu: 1.625 ± 0.644
1.625CysPhe: 1.625 ± 0.829
0.542CysGly: 0.542 ± 0.276
0.0CysHis: 0.0 ± 0.0
1.083CysIle: 1.083 ± 1.357
1.083CysLys: 1.083 ± 1.357
0.542CysLeu: 0.542 ± 0.276
0.0CysMet: 0.0 ± 0.0
0.542CysAsn: 0.542 ± 0.917
1.625CysPro: 1.625 ± 1.257
0.542CysGln: 0.542 ± 0.276
0.542CysArg: 0.542 ± 1.501
2.709CysSer: 2.709 ± 1.598
0.542CysThr: 0.542 ± 1.501
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.542CysTyr: 0.542 ± 1.501
0.0CysXaa: 0.0 ± 0.0
Asp
3.792AspAla: 3.792 ± 1.221
1.083AspCys: 1.083 ± 0.553
3.25AspAsp: 3.25 ± 0.935
4.334AspGlu: 4.334 ± 1.522
3.792AspPhe: 3.792 ± 1.157
1.083AspGly: 1.083 ± 0.553
0.542AspHis: 0.542 ± 0.276
5.959AspIle: 5.959 ± 1.363
2.167AspLys: 2.167 ± 0.699
4.875AspLeu: 4.875 ± 0.992
1.625AspMet: 1.625 ± 0.629
2.167AspAsn: 2.167 ± 1.213
3.25AspPro: 3.25 ± 1.08
1.083AspGln: 1.083 ± 0.553
3.25AspArg: 3.25 ± 1.08
4.875AspSer: 4.875 ± 1.138
1.083AspThr: 1.083 ± 0.701
2.167AspVal: 2.167 ± 1.106
1.083AspTrp: 1.083 ± 0.553
2.167AspTyr: 2.167 ± 0.699
0.0AspXaa: 0.0 ± 0.0
Glu
3.25GluAla: 3.25 ± 0.551
0.542GluCys: 0.542 ± 0.276
3.25GluAsp: 3.25 ± 0.935
3.25GluGlu: 3.25 ± 0.551
1.625GluPhe: 1.625 ± 0.829
4.875GluGly: 4.875 ± 1.528
0.542GluHis: 0.542 ± 0.276
2.709GluIle: 2.709 ± 1.382
5.417GluLys: 5.417 ± 2.037
4.875GluLeu: 4.875 ± 1.777
0.542GluMet: 0.542 ± 0.276
2.167GluAsn: 2.167 ± 1.552
4.875GluPro: 4.875 ± 0.992
1.083GluGln: 1.083 ± 1.346
3.792GluArg: 3.792 ± 1.935
2.167GluSer: 2.167 ± 1.106
3.25GluThr: 3.25 ± 1.658
5.417GluVal: 5.417 ± 2.214
1.083GluTrp: 1.083 ± 0.553
0.542GluTyr: 0.542 ± 0.85
0.0GluXaa: 0.0 ± 0.0
Phe
1.625PheAla: 1.625 ± 0.629
1.083PheCys: 1.083 ± 0.736
3.25PheAsp: 3.25 ± 1.258
3.25PheGlu: 3.25 ± 1.287
3.25PhePhe: 3.25 ± 0.551
1.625PheGly: 1.625 ± 1.534
1.625PheHis: 1.625 ± 1.257
2.709PheIle: 2.709 ± 0.672
1.083PheLys: 1.083 ± 0.553
4.334PheLeu: 4.334 ± 3.853
1.083PheMet: 1.083 ± 0.553
2.167PheAsn: 2.167 ± 1.106
3.25PhePro: 3.25 ± 1.08
2.709PheGln: 2.709 ± 1.341
2.167PheArg: 2.167 ± 0.699
2.709PheSer: 2.709 ± 1.231
2.709PheThr: 2.709 ± 0.749
1.625PheVal: 1.625 ± 1.098
0.542PheTrp: 0.542 ± 0.276
0.542PheTyr: 0.542 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
5.959GlyAla: 5.959 ± 1.706
1.083GlyCys: 1.083 ± 0.553
3.25GlyAsp: 3.25 ± 1.309
2.167GlyGlu: 2.167 ± 1.403
2.709GlyPhe: 2.709 ± 1.709
3.792GlyGly: 3.792 ± 1.19
1.083GlyHis: 1.083 ± 0.553
2.167GlyIle: 2.167 ± 0.634
3.792GlyLys: 3.792 ± 1.028
3.25GlyLeu: 3.25 ± 1.395
0.542GlyMet: 0.542 ± 1.323
2.167GlyAsn: 2.167 ± 1.82
1.083GlyPro: 1.083 ± 0.553
2.167GlyGln: 2.167 ± 0.867
0.542GlyArg: 0.542 ± 0.276
2.709GlySer: 2.709 ± 1.382
3.25GlyThr: 3.25 ± 0.935
2.709GlyVal: 2.709 ± 1.388
0.542GlyTrp: 0.542 ± 0.276
1.625GlyTyr: 1.625 ± 0.829
0.0GlyXaa: 0.0 ± 0.0
His
2.167HisAla: 2.167 ± 0.699
0.542HisCys: 0.542 ± 0.917
1.083HisAsp: 1.083 ± 0.553
2.167HisGlu: 2.167 ± 1.106
1.625HisPhe: 1.625 ± 0.644
1.625HisGly: 1.625 ± 0.644
1.625HisHis: 1.625 ± 0.829
0.542HisIle: 0.542 ± 1.501
0.542HisLys: 0.542 ± 0.276
4.875HisLeu: 4.875 ± 1.875
0.0HisMet: 0.0 ± 0.0
1.083HisAsn: 1.083 ± 0.701
2.167HisPro: 2.167 ± 1.213
1.625HisGln: 1.625 ± 0.644
2.709HisArg: 2.709 ± 1.195
2.167HisSer: 2.167 ± 1.358
1.083HisThr: 1.083 ± 0.701
1.083HisVal: 1.083 ± 0.553
0.0HisTrp: 0.0 ± 0.0
1.083HisTyr: 1.083 ± 0.553
0.0HisXaa: 0.0 ± 0.0
Ile
3.792IleAla: 3.792 ± 1.503
1.625IleCys: 1.625 ± 2.848
0.542IleAsp: 0.542 ± 0.276
4.334IleGlu: 4.334 ± 1.096
3.792IlePhe: 3.792 ± 2.455
1.625IleGly: 1.625 ± 1.64
2.167IleHis: 2.167 ± 1.403
3.792IleIle: 3.792 ± 2.382
5.959IleLys: 5.959 ± 2.3
7.042IleLeu: 7.042 ± 2.221
1.083IleMet: 1.083 ± 0.553
4.875IleAsn: 4.875 ± 1.651
3.25IlePro: 3.25 ± 1.658
2.709IleGln: 2.709 ± 1.341
4.334IleArg: 4.334 ± 1.096
3.792IleSer: 3.792 ± 1.221
1.625IleThr: 1.625 ± 0.829
2.167IleVal: 2.167 ± 1.106
0.0IleTrp: 0.0 ± 0.0
0.542IleTyr: 0.542 ± 0.276
0.0IleXaa: 0.0 ± 0.0
Lys
5.417LysAla: 5.417 ± 1.021
0.0LysCys: 0.0 ± 0.0
4.334LysAsp: 4.334 ± 2.211
4.334LysGlu: 4.334 ± 1.398
1.625LysPhe: 1.625 ± 1.64
1.625LysGly: 1.625 ± 0.629
1.625LysHis: 1.625 ± 0.829
3.792LysIle: 3.792 ± 1.277
2.167LysLys: 2.167 ± 1.106
8.126LysLeu: 8.126 ± 1.716
1.083LysMet: 1.083 ± 0.736
0.542LysAsn: 0.542 ± 0.276
4.875LysPro: 4.875 ± 1.02
3.25LysGln: 3.25 ± 1.048
1.083LysArg: 1.083 ± 0.553
3.792LysSer: 3.792 ± 1.157
5.417LysThr: 5.417 ± 1.282
2.709LysVal: 2.709 ± 1.341
0.542LysTrp: 0.542 ± 0.276
0.542LysTyr: 0.542 ± 0.85
0.0LysXaa: 0.0 ± 0.0
Leu
9.751LeuAla: 9.751 ± 3.053
0.542LeuCys: 0.542 ± 0.276
7.042LeuAsp: 7.042 ± 1.883
4.875LeuGlu: 4.875 ± 1.651
3.25LeuPhe: 3.25 ± 1.658
3.25LeuGly: 3.25 ± 0.935
3.792LeuHis: 3.792 ± 1.935
5.959LeuIle: 5.959 ± 2.906
8.667LeuLys: 8.667 ± 1.879
9.751LeuLeu: 9.751 ± 2.879
0.542LeuMet: 0.542 ± 0.85
4.334LeuAsn: 4.334 ± 0.683
5.959LeuPro: 5.959 ± 2.174
5.959LeuGln: 5.959 ± 0.957
2.709LeuArg: 2.709 ± 1.317
8.667LeuSer: 8.667 ± 2.839
7.584LeuThr: 7.584 ± 2.381
3.792LeuVal: 3.792 ± 3.599
2.167LeuTrp: 2.167 ± 0.634
2.167LeuTyr: 2.167 ± 1.106
0.0LeuXaa: 0.0 ± 0.0
Met
2.167MetAla: 2.167 ± 0.634
1.083MetCys: 1.083 ± 1.346
0.542MetAsp: 0.542 ± 0.85
0.542MetGlu: 0.542 ± 0.276
0.542MetPhe: 0.542 ± 0.276
2.167MetGly: 2.167 ± 0.634
0.0MetHis: 0.0 ± 0.0
1.083MetIle: 1.083 ± 0.736
0.542MetLys: 0.542 ± 0.276
3.25MetLeu: 3.25 ± 0.935
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
1.625MetPro: 1.625 ± 1.257
1.083MetGln: 1.083 ± 0.553
2.709MetArg: 2.709 ± 0.749
1.083MetSer: 1.083 ± 0.553
0.542MetThr: 0.542 ± 0.276
1.083MetVal: 1.083 ± 0.553
0.0MetTrp: 0.0 ± 0.0
0.542MetTyr: 0.542 ± 0.276
0.0MetXaa: 0.0 ± 0.0
Asn
3.792AsnAla: 3.792 ± 1.157
1.083AsnCys: 1.083 ± 0.553
1.625AsnAsp: 1.625 ± 0.829
1.625AsnGlu: 1.625 ± 0.829
0.0AsnPhe: 0.0 ± 0.0
0.542AsnGly: 0.542 ± 1.501
2.167AsnHis: 2.167 ± 1.358
1.625AsnIle: 1.625 ± 0.629
2.709AsnLys: 2.709 ± 0.847
2.709AsnLeu: 2.709 ± 0.749
0.542AsnMet: 0.542 ± 1.617
1.625AsnAsn: 1.625 ± 0.829
1.625AsnPro: 1.625 ± 1.534
1.625AsnGln: 1.625 ± 0.829
3.792AsnArg: 3.792 ± 2.672
3.25AsnSer: 3.25 ± 2.208
2.709AsnThr: 2.709 ± 0.749
1.083AsnVal: 1.083 ± 0.553
0.0AsnTrp: 0.0 ± 0.0
2.709AsnTyr: 2.709 ± 1.382
0.0AsnXaa: 0.0 ± 0.0
Pro
3.792ProAla: 3.792 ± 2.07
1.083ProCys: 1.083 ± 1.346
4.875ProAsp: 4.875 ± 2.019
5.417ProGlu: 5.417 ± 1.114
2.167ProPhe: 2.167 ± 0.699
2.709ProGly: 2.709 ± 1.382
2.167ProHis: 2.167 ± 1.358
3.792ProIle: 3.792 ± 1.935
1.625ProLys: 1.625 ± 0.629
4.334ProLeu: 4.334 ± 2.191
1.625ProMet: 1.625 ± 0.829
3.25ProAsn: 3.25 ± 4.07
7.042ProPro: 7.042 ± 3.528
3.25ProGln: 3.25 ± 1.309
2.709ProArg: 2.709 ± 1.382
4.334ProSer: 4.334 ± 1.734
5.417ProThr: 5.417 ± 1.498
3.25ProVal: 3.25 ± 1.048
1.625ProTrp: 1.625 ± 0.829
1.625ProTyr: 1.625 ± 0.829
0.0ProXaa: 0.0 ± 0.0
Gln
6.501GlnAla: 6.501 ± 3.166
1.083GlnCys: 1.083 ± 1.357
2.709GlnAsp: 2.709 ± 1.231
2.709GlnGlu: 2.709 ± 0.847
1.083GlnPhe: 1.083 ± 0.553
1.083GlnGly: 1.083 ± 0.736
0.542GlnHis: 0.542 ± 0.276
3.792GlnIle: 3.792 ± 1.221
2.709GlnLys: 2.709 ± 1.382
6.501GlnLeu: 6.501 ± 0.988
0.542GlnMet: 0.542 ± 0.276
1.625GlnAsn: 1.625 ± 0.644
1.625GlnPro: 1.625 ± 0.829
1.083GlnGln: 1.083 ± 0.553
0.542GlnArg: 0.542 ± 0.917
2.167GlnSer: 2.167 ± 1.403
2.709GlnThr: 2.709 ± 0.749
0.542GlnVal: 0.542 ± 0.276
0.542GlnTrp: 0.542 ± 0.276
1.083GlnTyr: 1.083 ± 0.701
0.0GlnXaa: 0.0 ± 0.0
Arg
5.417ArgAla: 5.417 ± 1.498
0.542ArgCys: 0.542 ± 0.276
3.25ArgAsp: 3.25 ± 1.309
2.709ArgGlu: 2.709 ± 1.709
3.25ArgPhe: 3.25 ± 1.395
3.25ArgGly: 3.25 ± 2.821
2.167ArgHis: 2.167 ± 0.867
1.083ArgIle: 1.083 ± 0.553
2.709ArgLys: 2.709 ± 1.382
3.25ArgLeu: 3.25 ± 1.658
1.083ArgMet: 1.083 ± 0.553
2.167ArgAsn: 2.167 ± 0.699
3.25ArgPro: 3.25 ± 1.287
2.167ArgGln: 2.167 ± 1.106
1.625ArgArg: 1.625 ± 1.745
3.25ArgSer: 3.25 ± 1.448
3.792ArgThr: 3.792 ± 2.455
3.25ArgVal: 3.25 ± 1.658
0.542ArgTrp: 0.542 ± 0.276
1.625ArgTyr: 1.625 ± 0.829
0.0ArgXaa: 0.0 ± 0.0
Ser
2.167SerAla: 2.167 ± 2.379
1.083SerCys: 1.083 ± 1.357
4.334SerAsp: 4.334 ± 1.951
2.709SerGlu: 2.709 ± 1.382
2.709SerPhe: 2.709 ± 1.195
3.25SerGly: 3.25 ± 2.589
1.625SerHis: 1.625 ± 0.644
3.792SerIle: 3.792 ± 1.935
3.25SerLys: 3.25 ± 0.551
7.042SerLeu: 7.042 ± 1.404
1.625SerMet: 1.625 ± 0.642
2.167SerAsn: 2.167 ± 0.634
2.167SerPro: 2.167 ± 1.472
3.792SerGln: 3.792 ± 1.436
6.501SerArg: 6.501 ± 1.306
6.501SerSer: 6.501 ± 1.228
4.334SerThr: 4.334 ± 2.127
5.417SerVal: 5.417 ± 2.27
0.542SerTrp: 0.542 ± 0.917
3.792SerTyr: 3.792 ± 1.19
0.0SerXaa: 0.0 ± 0.0
Thr
8.667ThrAla: 8.667 ± 4.697
0.542ThrCys: 0.542 ± 0.85
2.709ThrAsp: 2.709 ± 1.382
3.792ThrGlu: 3.792 ± 1.277
2.167ThrPhe: 2.167 ± 1.106
2.167ThrGly: 2.167 ± 1.106
2.709ThrHis: 2.709 ± 0.672
3.792ThrIle: 3.792 ± 1.935
1.625ThrLys: 1.625 ± 1.64
5.959ThrLeu: 5.959 ± 1.657
3.792ThrMet: 3.792 ± 1.238
1.625ThrAsn: 1.625 ± 0.829
7.042ThrPro: 7.042 ± 0.852
1.625ThrGln: 1.625 ± 1.745
2.167ThrArg: 2.167 ± 0.634
2.709ThrSer: 2.709 ± 0.749
9.209ThrThr: 9.209 ± 4.065
3.792ThrVal: 3.792 ± 1.315
0.0ThrTrp: 0.0 ± 0.0
2.709ThrTyr: 2.709 ± 1.231
0.0ThrXaa: 0.0 ± 0.0
Val
3.792ValAla: 3.792 ± 2.961
0.542ValCys: 0.542 ± 1.501
2.167ValAsp: 2.167 ± 1.403
3.25ValGlu: 3.25 ± 1.658
1.625ValPhe: 1.625 ± 1.257
3.25ValGly: 3.25 ± 1.309
1.625ValHis: 1.625 ± 0.644
4.875ValIle: 4.875 ± 2.253
2.167ValLys: 2.167 ± 0.634
3.25ValLeu: 3.25 ± 1.658
1.625ValMet: 1.625 ± 0.829
0.542ValAsn: 0.542 ± 0.917
4.334ValPro: 4.334 ± 0.683
2.167ValGln: 2.167 ± 1.403
2.709ValArg: 2.709 ± 0.749
0.0ValSer: 0.0 ± 0.0
5.959ValThr: 5.959 ± 2.062
9.209ValVal: 9.209 ± 2.036
0.542ValTrp: 0.542 ± 0.917
2.709ValTyr: 2.709 ± 1.317
0.0ValXaa: 0.0 ± 0.0
Trp
1.625TrpAla: 1.625 ± 0.629
0.542TrpCys: 0.542 ± 0.276
1.083TrpAsp: 1.083 ± 0.736
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
1.625TrpLeu: 1.625 ± 0.829
0.0TrpMet: 0.0 ± 0.0
0.542TrpAsn: 0.542 ± 0.917
1.083TrpPro: 1.083 ± 0.553
0.0TrpGln: 0.0 ± 0.0
1.625TrpArg: 1.625 ± 0.829
0.0TrpSer: 0.0 ± 0.0
1.083TrpThr: 1.083 ± 0.736
1.083TrpVal: 1.083 ± 0.553
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.417TyrAla: 5.417 ± 1.91
0.542TyrCys: 0.542 ± 1.501
1.083TyrAsp: 1.083 ± 0.553
0.0TyrGlu: 0.0 ± 0.0
2.167TyrPhe: 2.167 ± 1.213
2.167TyrGly: 2.167 ± 0.699
1.083TyrHis: 1.083 ± 0.701
1.625TyrIle: 1.625 ± 0.829
2.167TyrLys: 2.167 ± 0.699
3.25TyrLeu: 3.25 ± 1.287
0.542TyrMet: 0.542 ± 0.276
1.083TyrAsn: 1.083 ± 0.553
1.625TyrPro: 1.625 ± 1.534
1.083TyrGln: 1.083 ± 1.357
1.083TyrArg: 1.083 ± 0.553
4.875TyrSer: 4.875 ± 1.777
1.083TyrThr: 1.083 ± 0.553
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
1.083TyrTyr: 1.083 ± 0.736
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1847 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski