Amino acid dipepetide frequency for Carnation Italian ringspot virus (CIRV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.817AlaAla: 8.817 ± 2.327
0.0AlaCys: 0.0 ± 0.0
1.556AlaAsp: 1.556 ± 0.797
2.075AlaGlu: 2.075 ± 0.363
2.075AlaPhe: 2.075 ± 0.821
6.743AlaGly: 6.743 ± 1.054
1.556AlaHis: 1.556 ± 1.006
6.224AlaIle: 6.224 ± 1.15
6.743AlaLys: 6.743 ± 2.285
6.743AlaLeu: 6.743 ± 1.11
3.112AlaMet: 3.112 ± 1.145
4.149AlaAsn: 4.149 ± 1.459
1.556AlaPro: 1.556 ± 0.797
1.556AlaGln: 1.556 ± 0.582
5.705AlaArg: 5.705 ± 2.542
5.187AlaSer: 5.187 ± 1.204
4.668AlaThr: 4.668 ± 1.29
5.705AlaVal: 5.705 ± 1.023
2.075AlaTrp: 2.075 ± 0.363
1.556AlaTyr: 1.556 ± 0.45
0.0AlaXaa: 0.0 ± 0.0
Cys
2.075CysAla: 2.075 ± 0.363
1.037CysCys: 1.037 ± 0.511
0.0CysAsp: 0.0 ± 0.0
2.075CysGlu: 2.075 ± 0.611
1.037CysPhe: 1.037 ± 0.511
0.519CysGly: 0.519 ± 0.335
0.0CysHis: 0.0 ± 0.0
0.519CysIle: 0.519 ± 0.335
0.0CysLys: 0.0 ± 0.0
0.519CysLeu: 0.519 ± 0.335
0.0CysMet: 0.0 ± 0.0
1.037CysAsn: 1.037 ± 0.511
1.037CysPro: 1.037 ± 0.547
0.519CysGln: 0.519 ± 0.335
0.519CysArg: 0.519 ± 0.335
0.519CysSer: 0.519 ± 0.335
1.037CysThr: 1.037 ± 0.79
2.593CysVal: 2.593 ± 1.081
0.0CysTrp: 0.0 ± 0.0
0.519CysTyr: 0.519 ± 0.542
0.0CysXaa: 0.0 ± 0.0
Asp
2.075AspAla: 2.075 ± 0.66
1.556AspCys: 1.556 ± 0.582
1.037AspAsp: 1.037 ± 0.411
3.112AspGlu: 3.112 ± 1.042
1.556AspPhe: 1.556 ± 0.644
4.149AspGly: 4.149 ± 1.172
0.0AspHis: 0.0 ± 0.0
1.556AspIle: 1.556 ± 0.743
1.556AspLys: 1.556 ± 0.45
6.224AspLeu: 6.224 ± 0.887
1.556AspMet: 1.556 ± 0.45
1.556AspAsn: 1.556 ± 0.688
0.519AspPro: 0.519 ± 0.335
1.556AspGln: 1.556 ± 0.585
5.187AspArg: 5.187 ± 0.938
3.112AspSer: 3.112 ± 0.911
3.631AspThr: 3.631 ± 0.904
3.631AspVal: 3.631 ± 0.714
2.593AspTrp: 2.593 ± 1.043
1.037AspTyr: 1.037 ± 0.511
0.0AspXaa: 0.0 ± 0.0
Glu
2.593GluAla: 2.593 ± 0.479
0.0GluCys: 0.0 ± 0.0
6.224GluAsp: 6.224 ± 1.724
2.593GluGlu: 2.593 ± 0.778
1.037GluPhe: 1.037 ± 0.67
3.112GluGly: 3.112 ± 0.653
0.519GluHis: 0.519 ± 0.335
1.556GluIle: 1.556 ± 0.644
3.631GluLys: 3.631 ± 1.36
4.668GluLeu: 4.668 ± 1.478
1.037GluMet: 1.037 ± 0.995
1.037GluAsn: 1.037 ± 0.511
2.593GluPro: 2.593 ± 0.535
1.556GluGln: 1.556 ± 0.681
4.668GluArg: 4.668 ± 1.507
7.261GluSer: 7.261 ± 2.092
4.149GluThr: 4.149 ± 1.154
5.187GluVal: 5.187 ± 1.23
0.519GluTrp: 0.519 ± 0.605
1.037GluTyr: 1.037 ± 0.557
0.0GluXaa: 0.0 ± 0.0
Phe
0.519PheAla: 0.519 ± 0.335
1.037PheCys: 1.037 ± 0.67
1.556PheAsp: 1.556 ± 0.797
0.519PheGlu: 0.519 ± 0.542
0.0PhePhe: 0.0 ± 0.0
5.187PheGly: 5.187 ± 0.921
0.519PheHis: 0.519 ± 0.335
0.519PheIle: 0.519 ± 0.453
2.075PheLys: 2.075 ± 2.42
3.112PheLeu: 3.112 ± 1.173
1.037PheMet: 1.037 ± 0.411
1.556PheAsn: 1.556 ± 0.45
0.519PhePro: 0.519 ± 0.453
2.075PheGln: 2.075 ± 0.801
3.112PheArg: 3.112 ± 0.453
3.112PheSer: 3.112 ± 1.532
1.556PheThr: 1.556 ± 0.45
2.075PheVal: 2.075 ± 0.363
0.519PheTrp: 0.519 ± 0.335
2.075PheTyr: 2.075 ± 0.801
0.0PheXaa: 0.0 ± 0.0
Gly
4.149GlyAla: 4.149 ± 1.609
3.112GlyCys: 3.112 ± 1.042
3.112GlyAsp: 3.112 ± 1.06
2.593GlyGlu: 2.593 ± 0.705
2.075GlyPhe: 2.075 ± 1.161
6.743GlyGly: 6.743 ± 2.395
0.0GlyHis: 0.0 ± 0.0
6.224GlyIle: 6.224 ± 1.15
5.187GlyLys: 5.187 ± 0.583
7.78GlyLeu: 7.78 ± 1.299
2.075GlyMet: 2.075 ± 0.577
4.149GlyAsn: 4.149 ± 0.451
1.037GlyPro: 1.037 ± 0.411
0.519GlyGln: 0.519 ± 0.453
4.668GlyArg: 4.668 ± 1.359
7.78GlySer: 7.78 ± 1.768
4.149GlyThr: 4.149 ± 1.844
9.336GlyVal: 9.336 ± 0.792
0.519GlyTrp: 0.519 ± 0.335
4.149GlyTyr: 4.149 ± 1.605
0.0GlyXaa: 0.0 ± 0.0
His
1.556HisAla: 1.556 ± 0.45
2.075HisCys: 2.075 ± 1.01
1.037HisAsp: 1.037 ± 0.511
0.519HisGlu: 0.519 ± 0.335
0.519HisPhe: 0.519 ± 0.453
1.037HisGly: 1.037 ± 0.67
0.519HisHis: 0.519 ± 0.335
0.519HisIle: 0.519 ± 0.335
1.037HisLys: 1.037 ± 0.67
2.075HisLeu: 2.075 ± 0.726
1.556HisMet: 1.556 ± 0.582
1.037HisAsn: 1.037 ± 0.67
0.519HisPro: 0.519 ± 0.335
0.519HisGln: 0.519 ± 0.335
1.556HisArg: 1.556 ± 0.648
1.037HisSer: 1.037 ± 0.411
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.519HisTyr: 0.519 ± 0.453
0.0HisXaa: 0.0 ± 0.0
Ile
3.631IleAla: 3.631 ± 1.144
1.037IleCys: 1.037 ± 0.511
1.037IleAsp: 1.037 ± 0.411
2.593IleGlu: 2.593 ± 1.05
1.037IlePhe: 1.037 ± 0.67
3.631IleGly: 3.631 ± 0.746
0.0IleHis: 0.0 ± 0.0
1.556IleIle: 1.556 ± 0.644
1.556IleLys: 1.556 ± 0.585
3.112IleLeu: 3.112 ± 1.164
2.075IleMet: 2.075 ± 0.685
3.112IleAsn: 3.112 ± 0.815
4.149IlePro: 4.149 ± 0.733
1.037IleGln: 1.037 ± 0.547
2.593IleArg: 2.593 ± 1.437
2.593IleSer: 2.593 ± 1.134
4.149IleThr: 4.149 ± 1.665
3.112IleVal: 3.112 ± 0.844
0.0IleTrp: 0.0 ± 0.0
1.037IleTyr: 1.037 ± 0.511
0.0IleXaa: 0.0 ± 0.0
Lys
4.668LysAla: 4.668 ± 1.377
0.0LysCys: 0.0 ± 0.0
4.149LysAsp: 4.149 ± 0.726
3.631LysGlu: 3.631 ± 1.013
0.519LysPhe: 0.519 ± 0.453
5.187LysGly: 5.187 ± 1.312
0.519LysHis: 0.519 ± 0.335
3.112LysIle: 3.112 ± 1.004
1.556LysLys: 1.556 ± 0.932
8.299LysLeu: 8.299 ± 1.475
2.075LysMet: 2.075 ± 0.713
2.593LysAsn: 2.593 ± 0.408
2.075LysPro: 2.075 ± 1.05
1.037LysGln: 1.037 ± 0.547
3.631LysArg: 3.631 ± 0.749
1.037LysSer: 1.037 ± 0.511
3.112LysThr: 3.112 ± 1.004
6.224LysVal: 6.224 ± 1.689
1.556LysTrp: 1.556 ± 0.743
1.037LysTyr: 1.037 ± 0.511
0.519LysXaa: 0.519 ± 0.335
Leu
6.743LeuAla: 6.743 ± 1.995
1.037LeuCys: 1.037 ± 0.796
4.149LeuAsp: 4.149 ± 1.321
4.149LeuGlu: 4.149 ± 1.009
2.075LeuPhe: 2.075 ± 0.886
7.261LeuGly: 7.261 ± 1.568
3.112LeuHis: 3.112 ± 0.911
4.668LeuIle: 4.668 ± 1.14
7.78LeuLys: 7.78 ± 1.41
6.224LeuLeu: 6.224 ± 1.545
2.593LeuMet: 2.593 ± 0.535
2.593LeuAsn: 2.593 ± 0.578
8.299LeuPro: 8.299 ± 1.124
2.075LeuGln: 2.075 ± 1.184
4.149LeuArg: 4.149 ± 1.665
6.224LeuSer: 6.224 ± 1.319
7.261LeuThr: 7.261 ± 1.345
5.705LeuVal: 5.705 ± 1.679
1.037LeuTrp: 1.037 ± 0.511
2.075LeuTyr: 2.075 ± 1.102
0.0LeuXaa: 0.0 ± 0.0
Met
1.556MetAla: 1.556 ± 1.146
0.519MetCys: 0.519 ± 0.335
2.075MetAsp: 2.075 ± 0.631
3.631MetGlu: 3.631 ± 0.924
1.037MetPhe: 1.037 ± 0.511
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.519MetIle: 0.519 ± 0.453
1.556MetLys: 1.556 ± 0.598
3.112MetLeu: 3.112 ± 0.9
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.0MetPro: 0.0 ± 0.0
0.519MetGln: 0.519 ± 0.542
1.037MetArg: 1.037 ± 0.557
3.112MetSer: 3.112 ± 0.583
2.593MetThr: 2.593 ± 1.147
2.593MetVal: 2.593 ± 0.969
0.0MetTrp: 0.0 ± 0.0
1.556MetTyr: 1.556 ± 0.585
0.0MetXaa: 0.0 ± 0.0
Asn
5.187AsnAla: 5.187 ± 1.112
2.075AsnCys: 2.075 ± 0.631
2.075AsnAsp: 2.075 ± 1.593
1.037AsnGlu: 1.037 ± 0.557
2.075AsnPhe: 2.075 ± 0.577
2.593AsnGly: 2.593 ± 0.9
0.519AsnHis: 0.519 ± 0.335
1.556AsnIle: 1.556 ± 0.598
3.631AsnLys: 3.631 ± 0.867
2.075AsnLeu: 2.075 ± 0.878
1.037AsnMet: 1.037 ± 0.511
1.556AsnAsn: 1.556 ± 0.797
2.593AsnPro: 2.593 ± 1.374
1.037AsnGln: 1.037 ± 1.21
1.556AsnArg: 1.556 ± 0.582
3.631AsnSer: 3.631 ± 1.718
2.593AsnThr: 2.593 ± 0.578
3.631AsnVal: 3.631 ± 0.906
0.0AsnTrp: 0.0 ± 0.0
3.112AsnTyr: 3.112 ± 0.704
0.0AsnXaa: 0.0 ± 0.0
Pro
3.631ProAla: 3.631 ± 0.746
0.0ProCys: 0.0 ± 0.0
2.075ProAsp: 2.075 ± 0.611
2.593ProGlu: 2.593 ± 0.705
1.556ProPhe: 1.556 ± 0.681
2.075ProGly: 2.075 ± 0.801
0.0ProHis: 0.0 ± 0.0
1.556ProIle: 1.556 ± 0.644
2.075ProLys: 2.075 ± 0.744
3.112ProLeu: 3.112 ± 0.982
0.0ProMet: 0.0 ± 0.0
0.519ProAsn: 0.519 ± 0.335
1.037ProPro: 1.037 ± 0.67
2.593ProGln: 2.593 ± 0.854
5.187ProArg: 5.187 ± 1.932
5.705ProSer: 5.705 ± 0.929
2.075ProThr: 2.075 ± 0.886
5.705ProVal: 5.705 ± 1.11
1.037ProTrp: 1.037 ± 0.688
2.075ProTyr: 2.075 ± 1.022
0.0ProXaa: 0.0 ± 0.0
Gln
2.075GlnAla: 2.075 ± 0.655
0.0GlnCys: 0.0 ± 0.0
0.519GlnAsp: 0.519 ± 0.605
3.112GlnGlu: 3.112 ± 1.004
1.556GlnPhe: 1.556 ± 0.585
2.075GlnGly: 2.075 ± 0.974
2.075GlnHis: 2.075 ± 0.726
1.037GlnIle: 1.037 ± 0.411
0.519GlnLys: 0.519 ± 0.335
2.593GlnLeu: 2.593 ± 0.815
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
1.556GlnPro: 1.556 ± 0.598
1.556GlnGln: 1.556 ± 0.45
1.556GlnArg: 1.556 ± 0.582
0.519GlnSer: 0.519 ± 0.542
1.556GlnThr: 1.556 ± 0.45
3.631GlnVal: 3.631 ± 0.687
0.0GlnTrp: 0.0 ± 0.0
1.037GlnTyr: 1.037 ± 0.411
0.0GlnXaa: 0.0 ± 0.0
Arg
5.187ArgAla: 5.187 ± 1.029
0.0ArgCys: 0.0 ± 0.0
4.149ArgAsp: 4.149 ± 1.256
4.668ArgGlu: 4.668 ± 0.933
4.668ArgPhe: 4.668 ± 1.507
4.668ArgGly: 4.668 ± 1.0
1.037ArgHis: 1.037 ± 0.67
3.631ArgIle: 3.631 ± 0.971
3.631ArgLys: 3.631 ± 1.576
6.743ArgLeu: 6.743 ± 1.117
0.519ArgMet: 0.519 ± 0.335
4.149ArgAsn: 4.149 ± 1.448
3.112ArgPro: 3.112 ± 1.532
1.037ArgGln: 1.037 ± 0.905
4.149ArgArg: 4.149 ± 0.733
1.556ArgSer: 1.556 ± 0.932
6.743ArgThr: 6.743 ± 0.922
6.224ArgVal: 6.224 ± 0.718
1.556ArgTrp: 1.556 ± 0.681
3.631ArgTyr: 3.631 ± 0.925
0.0ArgXaa: 0.0 ± 0.0
Ser
4.149SerAla: 4.149 ± 0.726
0.0SerCys: 0.0 ± 0.0
2.593SerAsp: 2.593 ± 1.022
2.593SerGlu: 2.593 ± 0.479
1.037SerPhe: 1.037 ± 0.67
6.224SerGly: 6.224 ± 2.814
1.556SerHis: 1.556 ± 0.797
2.593SerIle: 2.593 ± 1.032
4.668SerLys: 4.668 ± 1.559
6.743SerLeu: 6.743 ± 1.688
0.0SerMet: 0.0 ± 0.0
3.112SerAsn: 3.112 ± 1.441
3.631SerPro: 3.631 ± 1.013
2.075SerGln: 2.075 ± 0.363
6.743SerArg: 6.743 ± 1.353
3.631SerSer: 3.631 ± 1.595
3.631SerThr: 3.631 ± 1.403
7.78SerVal: 7.78 ± 2.216
2.075SerTrp: 2.075 ± 1.921
3.631SerTyr: 3.631 ± 0.896
0.0SerXaa: 0.0 ± 0.0
Thr
5.187ThrAla: 5.187 ± 2.157
1.556ThrCys: 1.556 ± 0.582
3.112ThrAsp: 3.112 ± 0.844
3.631ThrGlu: 3.631 ± 1.482
2.593ThrPhe: 2.593 ± 1.374
7.78ThrGly: 7.78 ± 1.081
2.075ThrHis: 2.075 ± 1.022
1.556ThrIle: 1.556 ± 0.648
3.112ThrLys: 3.112 ± 1.004
6.224ThrLeu: 6.224 ± 1.672
2.593ThrMet: 2.593 ± 1.233
3.112ThrAsn: 3.112 ± 1.799
2.075ThrPro: 2.075 ± 0.917
1.556ThrGln: 1.556 ± 0.797
4.668ThrArg: 4.668 ± 0.4
4.149ThrSer: 4.149 ± 2.1
5.187ThrThr: 5.187 ± 0.817
6.224ThrVal: 6.224 ± 2.046
0.0ThrTrp: 0.0 ± 0.0
3.112ThrTyr: 3.112 ± 0.453
0.0ThrXaa: 0.0 ± 0.0
Val
9.855ValAla: 9.855 ± 2.385
0.519ValCys: 0.519 ± 0.335
3.112ValAsp: 3.112 ± 1.164
8.817ValGlu: 8.817 ± 2.181
4.149ValPhe: 4.149 ± 0.464
7.78ValGly: 7.78 ± 1.01
3.631ValHis: 3.631 ± 1.126
2.593ValIle: 2.593 ± 0.578
4.668ValLys: 4.668 ± 2.037
4.668ValLeu: 4.668 ± 0.4
2.075ValMet: 2.075 ± 0.952
5.187ValAsn: 5.187 ± 1.643
5.705ValPro: 5.705 ± 0.905
1.556ValGln: 1.556 ± 0.598
6.224ValArg: 6.224 ± 0.951
4.668ValSer: 4.668 ± 2.416
6.224ValThr: 6.224 ± 2.085
3.112ValVal: 3.112 ± 0.947
0.0ValTrp: 0.0 ± 0.0
1.556ValTyr: 1.556 ± 0.45
0.0ValXaa: 0.0 ± 0.0
Trp
1.037TrpAla: 1.037 ± 0.411
0.0TrpCys: 0.0 ± 0.0
1.037TrpAsp: 1.037 ± 0.547
0.519TrpGlu: 0.519 ± 0.335
0.0TrpPhe: 0.0 ± 0.0
1.556TrpGly: 1.556 ± 0.688
0.0TrpHis: 0.0 ± 0.0
1.037TrpIle: 1.037 ± 0.511
1.037TrpLys: 1.037 ± 0.557
1.556TrpLeu: 1.556 ± 0.45
0.0TrpMet: 0.0 ± 0.0
1.037TrpAsn: 1.037 ± 0.688
0.0TrpPro: 0.0 ± 0.0
1.037TrpGln: 1.037 ± 0.557
1.037TrpArg: 1.037 ± 0.547
1.556TrpSer: 1.556 ± 0.582
1.037TrpThr: 1.037 ± 1.21
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.112TyrAla: 3.112 ± 0.919
0.0TyrCys: 0.0 ± 0.0
2.593TyrAsp: 2.593 ± 0.705
1.037TyrGlu: 1.037 ± 0.557
2.075TyrPhe: 2.075 ± 0.363
0.519TyrGly: 0.519 ± 0.453
0.0TyrHis: 0.0 ± 0.0
0.519TyrIle: 0.519 ± 0.542
0.519TyrLys: 0.519 ± 0.542
3.631TyrLeu: 3.631 ± 0.332
1.556TyrMet: 1.556 ± 0.797
2.075TyrAsn: 2.075 ± 0.655
2.593TyrPro: 2.593 ± 1.043
1.556TyrGln: 1.556 ± 0.644
3.112TyrArg: 3.112 ± 1.164
2.075TyrSer: 2.075 ± 0.748
4.149TyrThr: 4.149 ± 0.941
3.631TyrVal: 3.631 ± 0.858
0.0TyrTrp: 0.0 ± 0.0
1.556TyrTyr: 1.556 ± 0.582
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.519XaaGly: 0.519 ± 0.335
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (1929 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski