Amino acid dipepetide frequency for Cowpea mosaic virus (strain SB) (CPMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.16AlaAla: 5.16 ± 0.253
0.774AlaCys: 0.774 ± 0.529
3.612AlaAsp: 3.612 ± 0.417
5.16AlaGlu: 5.16 ± 0.579
4.128AlaPhe: 4.128 ± 0.595
5.16AlaGly: 5.16 ± 0.911
1.548AlaHis: 1.548 ± 0.318
5.934AlaIle: 5.934 ± 0.731
3.096AlaLys: 3.096 ± 0.43
3.612AlaLeu: 3.612 ± 1.79
2.58AlaMet: 2.58 ± 0.629
3.612AlaAsn: 3.612 ± 0.441
4.128AlaPro: 4.128 ± 1.056
4.902AlaGln: 4.902 ± 0.646
1.29AlaArg: 1.29 ± 0.205
4.902AlaSer: 4.902 ± 0.562
5.676AlaThr: 5.676 ± 1.574
4.128AlaVal: 4.128 ± 0.79
1.29AlaTrp: 1.29 ± 0.481
2.064AlaTyr: 2.064 ± 1.411
0.0AlaXaa: 0.0 ± 0.0
Cys
2.58CysAla: 2.58 ± 0.289
1.806CysCys: 1.806 ± 0.386
2.322CysAsp: 2.322 ± 0.237
1.29CysGlu: 1.29 ± 0.205
0.774CysPhe: 0.774 ± 0.554
2.838CysGly: 2.838 ± 0.502
0.258CysHis: 0.258 ± 0.176
0.0CysIle: 0.0 ± 0.0
1.806CysLys: 1.806 ± 0.556
1.29CysLeu: 1.29 ± 0.617
0.516CysMet: 0.516 ± 0.328
1.032CysAsn: 1.032 ± 0.041
1.032CysPro: 1.032 ± 0.041
0.774CysGln: 0.774 ± 0.153
0.774CysArg: 0.774 ± 0.258
2.838CysSer: 2.838 ± 0.271
0.774CysThr: 0.774 ± 0.153
2.064CysVal: 2.064 ± 0.081
0.774CysTrp: 0.774 ± 0.529
0.516CysTyr: 0.516 ± 0.189
0.0CysXaa: 0.0 ± 0.0
Asp
2.58AspAla: 2.58 ± 0.41
1.29AspCys: 1.29 ± 0.205
2.58AspAsp: 2.58 ± 0.41
2.322AspGlu: 2.322 ± 0.237
5.16AspPhe: 5.16 ± 0.253
3.612AspGly: 3.612 ± 0.441
0.258AspHis: 0.258 ± 0.176
3.096AspIle: 3.096 ± 0.613
2.58AspLys: 2.58 ± 1.085
3.87AspLeu: 3.87 ± 0.143
1.29AspMet: 1.29 ± 0.205
2.322AspAsn: 2.322 ± 0.237
3.096AspPro: 3.096 ± 0.122
1.032AspGln: 1.032 ± 0.706
0.258AspArg: 0.258 ± 0.531
3.354AspSer: 3.354 ± 0.788
2.064AspThr: 2.064 ± 0.634
4.902AspVal: 4.902 ± 0.748
2.064AspTrp: 2.064 ± 0.335
0.516AspTyr: 0.516 ± 0.353
0.0AspXaa: 0.0 ± 0.0
Glu
4.902GluAla: 4.902 ± 0.968
1.806GluCys: 1.806 ± 0.556
2.322GluAsp: 2.322 ± 0.909
4.386GluGlu: 4.386 ± 0.307
3.612GluPhe: 3.612 ± 0.152
3.096GluGly: 3.096 ± 0.122
0.0GluHis: 0.0 ± 0.0
2.064GluIle: 2.064 ± 0.081
7.74GluLys: 7.74 ± 1.23
4.128GluLeu: 4.128 ± 2.143
1.548GluMet: 1.548 ± 0.168
1.548GluAsn: 1.548 ± 0.307
1.548GluPro: 1.548 ± 0.38
2.322GluGln: 2.322 ± 0.237
1.032GluArg: 1.032 ± 0.572
3.612GluSer: 3.612 ± 1.617
2.58GluThr: 2.58 ± 0.749
4.128GluVal: 4.128 ± 0.595
0.516GluTrp: 0.516 ± 0.353
3.096GluTyr: 3.096 ± 0.761
0.0GluXaa: 0.0 ± 0.0
Phe
3.87PheAla: 3.87 ± 0.767
1.806PheCys: 1.806 ± 0.261
4.386PheAsp: 4.386 ± 0.307
3.612PheGlu: 3.612 ± 1.0
1.548PhePhe: 1.548 ± 0.515
3.096PheGly: 3.096 ± 0.761
1.032PheHis: 1.032 ± 0.041
2.322PheIle: 2.322 ± 0.237
2.322PheLys: 2.322 ± 0.237
6.45PheLeu: 6.45 ± 1.113
0.0PheMet: 0.0 ± 0.0
1.806PheAsn: 1.806 ± 0.139
3.354PhePro: 3.354 ± 1.114
0.774PheGln: 0.774 ± 0.529
3.87PheArg: 3.87 ± 1.485
3.096PheSer: 3.096 ± 1.341
1.806PheThr: 1.806 ± 0.261
4.644PheVal: 4.644 ± 0.27
0.258PheTrp: 0.258 ± 0.176
1.29PheTyr: 1.29 ± 0.225
0.0PheXaa: 0.0 ± 0.0
Gly
5.16GlyAla: 5.16 ± 1.246
3.612GlyCys: 3.612 ± 0.614
3.096GlyAsp: 3.096 ± 0.613
3.612GlyGlu: 3.612 ± 1.112
4.386GlyPhe: 4.386 ± 0.142
5.676GlyGly: 5.676 ± 0.177
0.774GlyHis: 0.774 ± 0.679
5.418GlyIle: 5.418 ± 0.344
5.676GlyLys: 5.676 ± 0.177
2.838GlyLeu: 2.838 ± 1.597
1.032GlyMet: 1.032 ± 0.041
3.87GlyAsn: 3.87 ± 0.767
1.548GlyPro: 1.548 ± 0.307
1.29GlyGln: 1.29 ± 0.481
3.096GlyArg: 3.096 ± 0.122
4.128GlySer: 4.128 ± 0.595
1.806GlyThr: 1.806 ± 0.139
5.418GlyVal: 5.418 ± 0.995
1.032GlyTrp: 1.032 ± 0.041
0.516GlyTyr: 0.516 ± 0.353
0.0GlyXaa: 0.0 ± 0.0
His
1.29HisAla: 1.29 ± 0.481
0.0HisCys: 0.0 ± 0.0
0.774HisAsp: 0.774 ± 0.153
0.516HisGlu: 0.516 ± 0.513
1.806HisPhe: 1.806 ± 0.139
0.258HisGly: 0.258 ± 0.176
0.258HisHis: 0.258 ± 0.331
1.806HisIle: 1.806 ± 0.139
0.258HisLys: 0.258 ± 0.176
1.032HisLeu: 1.032 ± 0.041
0.774HisMet: 0.774 ± 0.434
0.258HisAsn: 0.258 ± 0.176
0.258HisPro: 0.258 ± 0.176
0.258HisGln: 0.258 ± 0.176
1.548HisArg: 1.548 ± 0.38
1.032HisSer: 1.032 ± 0.706
2.064HisThr: 2.064 ± 1.312
2.58HisVal: 2.58 ± 0.748
0.0HisTrp: 0.0 ± 0.0
1.548HisTyr: 1.548 ± 1.058
0.0HisXaa: 0.0 ± 0.0
Ile
3.354IleAla: 3.354 ± 0.442
2.064IleCys: 2.064 ± 0.634
0.774IleAsp: 0.774 ± 0.153
2.838IleGlu: 2.838 ± 0.135
1.806IlePhe: 1.806 ± 0.261
4.128IleGly: 4.128 ± 0.162
1.548IleHis: 1.548 ± 0.984
2.838IleIle: 2.838 ± 0.787
2.322IleLys: 2.322 ± 0.46
5.16IleLeu: 5.16 ± 0.472
1.548IleMet: 1.548 ± 0.307
3.096IleAsn: 3.096 ± 0.122
3.612IlePro: 3.612 ± 0.441
2.322IleGln: 2.322 ± 0.46
2.322IleArg: 2.322 ± 0.237
5.934IleSer: 5.934 ± 0.394
1.806IleThr: 1.806 ± 0.5
4.644IleVal: 4.644 ± 0.794
0.516IleTrp: 0.516 ± 0.353
1.548IleTyr: 1.548 ± 0.307
0.0IleXaa: 0.0 ± 0.0
Lys
3.87LysAla: 3.87 ± 0.143
1.29LysCys: 1.29 ± 0.481
3.096LysAsp: 3.096 ± 0.122
5.418LysGlu: 5.418 ± 0.995
2.58LysPhe: 2.58 ± 0.642
2.58LysGly: 2.58 ± 1.085
1.548LysHis: 1.548 ± 0.38
2.838LysIle: 2.838 ± 0.356
2.064LysLys: 2.064 ± 0.413
5.676LysLeu: 5.676 ± 0.941
2.322LysMet: 2.322 ± 0.237
2.064LysAsn: 2.064 ± 0.732
1.548LysPro: 1.548 ± 0.307
1.806LysGln: 1.806 ± 0.556
3.612LysArg: 3.612 ± 0.526
3.354LysSer: 3.354 ± 0.402
3.354LysThr: 3.354 ± 0.271
3.354LysVal: 3.354 ± 1.614
0.774LysTrp: 0.774 ± 0.153
1.806LysTyr: 1.806 ± 0.139
0.0LysXaa: 0.0 ± 0.0
Leu
7.74LeuAla: 7.74 ± 0.571
1.29LeuCys: 1.29 ± 0.762
4.128LeuAsp: 4.128 ± 1.267
5.676LeuGlu: 5.676 ± 1.845
2.064LeuPhe: 2.064 ± 0.732
4.386LeuGly: 4.386 ± 0.142
1.29LeuHis: 1.29 ± 0.481
3.354LeuIle: 3.354 ± 0.271
5.934LeuLys: 5.934 ± 0.536
10.062LeuLeu: 10.062 ± 1.521
3.612LeuMet: 3.612 ± 0.441
3.87LeuAsn: 3.87 ± 0.443
5.16LeuPro: 5.16 ± 1.143
0.774LeuGln: 0.774 ± 0.529
4.386LeuArg: 4.386 ± 0.427
7.998LeuSer: 7.998 ± 1.289
2.58LeuThr: 2.58 ± 0.155
6.45LeuVal: 6.45 ± 2.373
1.032LeuTrp: 1.032 ± 0.399
4.128LeuTyr: 4.128 ± 0.571
0.0LeuXaa: 0.0 ± 0.0
Met
2.322MetAla: 2.322 ± 0.237
0.0MetCys: 0.0 ± 0.0
0.774MetAsp: 0.774 ± 0.529
2.064MetGlu: 2.064 ± 0.081
0.258MetPhe: 0.258 ± 0.259
2.58MetGly: 2.58 ± 0.289
0.516MetHis: 0.516 ± 0.353
1.806MetIle: 1.806 ± 0.139
0.516MetLys: 0.516 ± 0.353
2.064MetLeu: 2.064 ± 0.634
0.774MetMet: 0.774 ± 0.529
1.032MetAsn: 1.032 ± 0.656
2.322MetPro: 2.322 ± 0.46
1.548MetGln: 1.548 ± 1.058
1.806MetArg: 1.806 ± 0.139
3.096MetSer: 3.096 ± 0.122
1.806MetThr: 1.806 ± 0.139
2.838MetVal: 2.838 ± 1.261
0.516MetTrp: 0.516 ± 0.328
0.774MetTyr: 0.774 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.096AsnAla: 3.096 ± 0.761
1.29AsnCys: 1.29 ± 0.205
1.548AsnAsp: 1.548 ± 1.058
1.032AsnGlu: 1.032 ± 0.041
3.096AsnPhe: 3.096 ± 0.122
2.58AsnGly: 2.58 ± 0.289
0.516AsnHis: 0.516 ± 0.328
2.838AsnIle: 2.838 ± 0.135
2.064AsnLys: 2.064 ± 0.081
5.418AsnLeu: 5.418 ± 0.344
1.806AsnMet: 1.806 ± 0.139
1.548AsnAsn: 1.548 ± 0.168
5.16AsnPro: 5.16 ± 1.923
1.29AsnGln: 1.29 ± 0.506
2.064AsnArg: 2.064 ± 0.081
3.096AsnSer: 3.096 ± 1.289
1.806AsnThr: 1.806 ± 0.139
4.644AsnVal: 4.644 ± 0.588
1.032AsnTrp: 1.032 ± 0.656
0.774AsnTyr: 0.774 ± 0.529
0.0AsnXaa: 0.0 ± 0.0
Pro
2.322ProAla: 2.322 ± 0.909
1.548ProCys: 1.548 ± 0.307
0.774ProAsp: 0.774 ± 0.529
4.128ProGlu: 4.128 ± 0.79
2.064ProPhe: 2.064 ± 0.081
2.58ProGly: 2.58 ± 0.289
1.032ProHis: 1.032 ± 0.041
1.548ProIle: 1.548 ± 0.307
1.806ProLys: 1.806 ± 0.5
4.386ProLeu: 4.386 ± 0.762
2.064ProMet: 2.064 ± 0.732
3.354ProAsn: 3.354 ± 1.792
2.58ProPro: 2.58 ± 0.961
1.548ProGln: 1.548 ± 0.307
2.064ProArg: 2.064 ± 0.634
4.128ProSer: 4.128 ± 0.162
3.87ProThr: 3.87 ± 0.143
3.612ProVal: 3.612 ± 1.316
0.516ProTrp: 0.516 ± 0.328
2.322ProTyr: 2.322 ± 0.193
0.0ProXaa: 0.0 ± 0.0
Gln
2.064GlnAla: 2.064 ± 0.732
0.258GlnCys: 0.258 ± 0.531
1.548GlnAsp: 1.548 ± 0.307
1.29GlnGlu: 1.29 ± 0.205
2.58GlnPhe: 2.58 ± 0.289
4.128GlnGly: 4.128 ± 0.79
0.516GlnHis: 0.516 ± 0.353
1.548GlnIle: 1.548 ± 0.38
0.774GlnLys: 0.774 ± 0.153
3.354GlnLeu: 3.354 ± 0.936
1.806GlnMet: 1.806 ± 0.556
1.29GlnAsn: 1.29 ± 0.481
2.58GlnPro: 2.58 ± 0.961
2.838GlnGln: 2.838 ± 0.585
0.774GlnArg: 0.774 ± 0.554
3.096GlnSer: 3.096 ± 0.761
1.548GlnThr: 1.548 ± 0.984
3.612GlnVal: 3.612 ± 0.441
0.258GlnTrp: 0.258 ± 0.176
1.806GlnTyr: 1.806 ± 0.261
0.0GlnXaa: 0.0 ± 0.0
Arg
2.322ArgAla: 2.322 ± 0.894
0.516ArgCys: 0.516 ± 0.189
2.064ArgAsp: 2.064 ± 0.732
1.29ArgGlu: 1.29 ± 0.205
3.354ArgPhe: 3.354 ± 0.271
4.128ArgGly: 4.128 ± 1.613
0.774ArgHis: 0.774 ± 0.554
1.548ArgIle: 1.548 ± 0.38
4.128ArgLys: 4.128 ± 0.825
1.548ArgLeu: 1.548 ± 0.38
1.29ArgMet: 1.29 ± 0.481
1.806ArgAsn: 1.806 ± 0.547
0.774ArgPro: 0.774 ± 0.153
2.322ArgGln: 2.322 ± 0.46
3.612ArgArg: 3.612 ± 1.234
4.128ArgSer: 4.128 ± 1.817
3.87ArgThr: 3.87 ± 0.443
4.386ArgVal: 4.386 ± 0.762
0.516ArgTrp: 0.516 ± 0.328
2.064ArgTyr: 2.064 ± 0.413
0.0ArgXaa: 0.0 ± 0.0
Ser
5.676SerAla: 5.676 ± 0.677
2.322SerCys: 2.322 ± 0.193
4.128SerAsp: 4.128 ± 1.625
2.838SerGlu: 2.838 ± 1.261
3.354SerPhe: 3.354 ± 0.788
6.708SerGly: 6.708 ± 0.884
1.806SerHis: 1.806 ± 0.139
4.386SerIle: 4.386 ± 1.437
4.128SerLys: 4.128 ± 0.277
8.514SerLeu: 8.514 ± 1.181
2.58SerMet: 2.58 ± 0.289
3.096SerAsn: 3.096 ± 0.122
3.612SerPro: 3.612 ± 0.94
5.934SerGln: 5.934 ± 0.187
4.902SerArg: 4.902 ± 0.417
3.87SerSer: 3.87 ± 0.79
3.87SerThr: 3.87 ± 1.151
4.386SerVal: 4.386 ± 0.307
1.806SerTrp: 1.806 ± 0.139
1.548SerTyr: 1.548 ± 0.168
0.0SerXaa: 0.0 ± 0.0
Thr
4.644ThrAla: 4.644 ± 1.595
1.806ThrCys: 1.806 ± 0.901
2.064ThrAsp: 2.064 ± 0.634
2.58ThrGlu: 2.58 ± 1.03
3.87ThrPhe: 3.87 ± 1.442
2.064ThrGly: 2.064 ± 0.634
1.29ThrHis: 1.29 ± 0.481
4.128ThrIle: 4.128 ± 0.162
3.096ThrLys: 3.096 ± 0.122
4.644ThrLeu: 4.644 ± 0.689
0.516ThrMet: 0.516 ± 0.353
2.322ThrAsn: 2.322 ± 0.237
3.096ThrPro: 3.096 ± 0.613
1.29ThrGln: 1.29 ± 0.481
2.064ThrArg: 2.064 ± 0.355
3.612ThrSer: 3.612 ± 0.94
3.87ThrThr: 3.87 ± 1.442
4.902ThrVal: 4.902 ± 0.748
2.322ThrTrp: 2.322 ± 0.193
1.032ThrTyr: 1.032 ± 0.041
0.0ThrXaa: 0.0 ± 0.0
Val
7.74ValAla: 7.74 ± 0.579
1.29ValCys: 1.29 ± 0.205
2.58ValAsp: 2.58 ± 0.41
3.612ValGlu: 3.612 ± 0.441
2.58ValPhe: 2.58 ± 0.41
3.354ValGly: 3.354 ± 0.271
1.29ValHis: 1.29 ± 0.12
4.386ValIle: 4.386 ± 0.307
2.322ValLys: 2.322 ± 0.815
6.708ValLeu: 6.708 ± 0.275
1.548ValMet: 1.548 ± 0.468
4.386ValAsn: 4.386 ± 0.427
2.064ValPro: 2.064 ± 1.411
4.128ValGln: 4.128 ± 0.482
5.418ValArg: 5.418 ± 0.806
7.74ValSer: 7.74 ± 1.901
7.74ValThr: 7.74 ± 0.868
5.16ValVal: 5.16 ± 0.468
0.774ValTrp: 0.774 ± 0.529
2.58ValTyr: 2.58 ± 0.642
0.0ValXaa: 0.0 ± 0.0
Trp
1.032TrpAla: 1.032 ± 0.041
0.0TrpCys: 0.0 ± 0.0
1.806TrpAsp: 1.806 ± 0.556
0.0TrpGlu: 0.0 ± 0.0
0.774TrpPhe: 0.774 ± 0.258
0.258TrpGly: 0.258 ± 0.176
0.774TrpHis: 0.774 ± 0.529
1.032TrpIle: 1.032 ± 0.041
1.29TrpLys: 1.29 ± 0.205
1.032TrpLeu: 1.032 ± 0.041
0.774TrpMet: 0.774 ± 0.153
1.548TrpAsn: 1.548 ± 0.984
0.258TrpPro: 0.258 ± 0.176
0.0TrpGln: 0.0 ± 0.0
1.032TrpArg: 1.032 ± 0.368
2.838TrpSer: 2.838 ± 0.135
0.774TrpThr: 0.774 ± 0.531
0.258TrpVal: 0.258 ± 0.176
0.0TrpTrp: 0.0 ± 0.0
0.774TrpTyr: 0.774 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.29TyrAla: 1.29 ± 0.205
1.29TyrCys: 1.29 ± 0.225
3.87TyrAsp: 3.87 ± 0.286
2.322TyrGlu: 2.322 ± 0.909
1.548TyrPhe: 1.548 ± 0.484
0.774TyrGly: 0.774 ± 0.529
1.29TyrHis: 1.29 ± 0.205
1.29TyrIle: 1.29 ± 0.617
1.032TyrLys: 1.032 ± 0.399
4.128TyrLeu: 4.128 ± 1.312
0.774TyrMet: 0.774 ± 0.529
2.838TyrAsn: 2.838 ± 1.261
0.516TyrPro: 0.516 ± 0.353
0.516TyrGln: 0.516 ± 0.353
0.516TyrArg: 0.516 ± 0.353
3.612TyrSer: 3.612 ± 0.152
1.548TyrThr: 1.548 ± 0.307
1.29TyrVal: 1.29 ± 0.205
0.258TyrTrp: 0.258 ± 0.176
0.774TyrTyr: 0.774 ± 0.529
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3877 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski