Amino acid dipepetide frequency for Spinach severe curly top virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.894AlaAla: 1.894 ± 0.811
0.0AlaCys: 0.0 ± 0.0
2.841AlaAsp: 2.841 ± 1.375
0.947AlaGlu: 0.947 ± 1.085
1.894AlaPhe: 1.894 ± 1.458
3.788AlaGly: 3.788 ± 1.407
0.0AlaHis: 0.0 ± 0.0
2.841AlaIle: 2.841 ± 1.76
1.894AlaLys: 1.894 ± 1.023
1.894AlaLeu: 1.894 ± 0.811
1.894AlaMet: 1.894 ± 1.131
3.788AlaAsn: 3.788 ± 2.676
2.841AlaPro: 2.841 ± 1.443
0.947AlaGln: 0.947 ± 0.669
0.947AlaArg: 0.947 ± 0.669
0.947AlaSer: 0.947 ± 0.669
0.947AlaThr: 0.947 ± 0.78
5.682AlaVal: 5.682 ± 2.106
0.0AlaTrp: 0.0 ± 0.0
0.0AlaTyr: 0.0 ± 0.0
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.841CysAsp: 2.841 ± 1.958
0.947CysGlu: 0.947 ± 0.669
1.894CysPhe: 1.894 ± 1.37
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.894CysIle: 1.894 ± 2.021
1.894CysLys: 1.894 ± 1.023
1.894CysLeu: 1.894 ± 1.13
0.0CysMet: 0.0 ± 0.0
0.947CysAsn: 0.947 ± 0.669
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.894CysArg: 1.894 ± 0.811
2.841CysSer: 2.841 ± 2.281
0.947CysThr: 0.947 ± 1.046
0.0CysVal: 0.0 ± 0.0
1.894CysTrp: 1.894 ± 0.932
1.894CysTyr: 1.894 ± 1.109
0.0CysXaa: 0.0 ± 0.0
Asp
0.947AspAla: 0.947 ± 0.669
0.947AspCys: 0.947 ± 0.669
3.788AspAsp: 3.788 ± 1.855
4.735AspGlu: 4.735 ± 1.853
4.735AspPhe: 4.735 ± 2.09
1.894AspGly: 1.894 ± 0.811
0.0AspHis: 0.0 ± 0.0
5.682AspIle: 5.682 ± 1.242
2.841AspLys: 2.841 ± 0.885
0.0AspLeu: 0.0 ± 0.0
6.629AspMet: 6.629 ± 2.314
3.788AspAsn: 3.788 ± 1.37
0.947AspPro: 0.947 ± 0.669
1.894AspGln: 1.894 ± 2.021
1.894AspArg: 1.894 ± 1.237
2.841AspSer: 2.841 ± 0.904
1.894AspThr: 1.894 ± 1.237
3.788AspVal: 3.788 ± 1.407
2.841AspTrp: 2.841 ± 1.306
1.894AspTyr: 1.894 ± 0.811
0.0AspXaa: 0.0 ± 0.0
Glu
2.841GluAla: 2.841 ± 1.109
0.0GluCys: 0.0 ± 0.0
1.894GluAsp: 1.894 ± 1.169
8.523GluGlu: 8.523 ± 7.255
0.0GluPhe: 0.0 ± 0.0
4.735GluGly: 4.735 ± 1.853
1.894GluHis: 1.894 ± 1.382
4.735GluIle: 4.735 ± 2.601
0.947GluLys: 0.947 ± 1.011
2.841GluLeu: 2.841 ± 1.325
0.947GluMet: 0.947 ± 0.669
1.894GluAsn: 1.894 ± 0.811
1.894GluPro: 1.894 ± 1.023
0.947GluGln: 0.947 ± 1.011
3.788GluArg: 3.788 ± 1.11
5.682GluSer: 5.682 ± 2.118
0.947GluThr: 0.947 ± 0.669
7.576GluVal: 7.576 ± 2.883
1.894GluTrp: 1.894 ± 1.023
0.947GluTyr: 0.947 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
1.894PheAla: 1.894 ± 1.451
0.947PheCys: 0.947 ± 1.046
2.841PheAsp: 2.841 ± 1.109
0.0PheGlu: 0.0 ± 0.0
1.894PhePhe: 1.894 ± 1.023
1.894PheGly: 1.894 ± 0.932
0.947PheHis: 0.947 ± 0.88
3.788PheIle: 3.788 ± 1.09
5.682PheLys: 5.682 ± 2.071
6.629PheLeu: 6.629 ± 3.323
1.894PheMet: 1.894 ± 1.866
2.841PheAsn: 2.841 ± 1.375
3.788PhePro: 3.788 ± 0.976
2.841PheGln: 2.841 ± 2.329
1.894PheArg: 1.894 ± 1.023
0.947PheSer: 0.947 ± 0.88
0.0PheThr: 0.0 ± 0.0
1.894PheVal: 1.894 ± 1.023
0.947PheTrp: 0.947 ± 0.78
2.841PheTyr: 2.841 ± 1.408
0.0PheXaa: 0.0 ± 0.0
Gly
1.894GlyAla: 1.894 ± 0.811
0.947GlyCys: 0.947 ± 1.011
3.788GlyAsp: 3.788 ± 1.855
2.841GlyGlu: 2.841 ± 1.825
1.894GlyPhe: 1.894 ± 1.109
6.629GlyGly: 6.629 ± 2.673
0.947GlyHis: 0.947 ± 0.669
1.894GlyIle: 1.894 ± 0.811
7.576GlyLys: 7.576 ± 3.243
0.947GlyLeu: 0.947 ± 1.085
0.947GlyMet: 0.947 ± 0.78
1.894GlyAsn: 1.894 ± 1.169
0.947GlyPro: 0.947 ± 1.011
4.735GlyGln: 4.735 ± 1.025
0.947GlyArg: 0.947 ± 1.011
6.629GlySer: 6.629 ± 2.58
4.735GlyThr: 4.735 ± 1.987
4.735GlyVal: 4.735 ± 1.254
0.0GlyTrp: 0.0 ± 0.0
0.0GlyTyr: 0.0 ± 0.0
0.0GlyXaa: 0.0 ± 0.0
His
0.947HisAla: 0.947 ± 0.669
1.894HisCys: 1.894 ± 1.023
0.947HisAsp: 0.947 ± 0.669
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.947HisGly: 0.947 ± 0.88
1.894HisHis: 1.894 ± 0.941
0.947HisIle: 0.947 ± 0.88
0.0HisLys: 0.0 ± 0.0
4.735HisLeu: 4.735 ± 1.336
0.0HisMet: 0.0 ± 0.0
3.788HisAsn: 3.788 ± 2.676
1.894HisPro: 1.894 ± 1.338
0.947HisGln: 0.947 ± 1.046
0.0HisArg: 0.0 ± 0.0
0.0HisSer: 0.0 ± 0.0
0.947HisThr: 0.947 ± 0.88
0.0HisVal: 0.0 ± 0.0
1.894HisTrp: 1.894 ± 0.811
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.947IleAla: 0.947 ± 0.669
0.947IleCys: 0.947 ± 1.046
2.841IleAsp: 2.841 ± 1.958
2.841IleGlu: 2.841 ± 1.376
4.735IlePhe: 4.735 ± 1.134
3.788IleGly: 3.788 ± 0.976
0.0IleHis: 0.0 ± 0.0
4.735IleIle: 4.735 ± 1.698
6.629IleLys: 6.629 ± 1.849
6.629IleLeu: 6.629 ± 1.983
0.947IleMet: 0.947 ± 1.046
0.947IleAsn: 0.947 ± 0.78
6.629IlePro: 6.629 ± 1.608
3.788IleGln: 3.788 ± 2.259
3.788IleArg: 3.788 ± 1.351
4.735IleSer: 4.735 ± 1.507
5.682IleThr: 5.682 ± 1.759
2.841IleVal: 2.841 ± 1.6
0.947IleTrp: 0.947 ± 1.011
0.947IleTyr: 0.947 ± 1.011
0.0IleXaa: 0.0 ± 0.0
Lys
2.841LysAla: 2.841 ± 1.266
1.894LysCys: 1.894 ± 1.458
9.47LysAsp: 9.47 ± 3.861
5.682LysGlu: 5.682 ± 1.844
1.894LysPhe: 1.894 ± 2.093
1.894LysGly: 1.894 ± 1.023
1.894LysHis: 1.894 ± 1.338
2.841LysIle: 2.841 ± 1.408
13.258LysLys: 13.258 ± 5.678
6.629LysLeu: 6.629 ± 1.88
2.841LysMet: 2.841 ± 1.606
2.841LysAsn: 2.841 ± 0.904
5.682LysPro: 5.682 ± 1.434
1.894LysGln: 1.894 ± 1.03
3.788LysArg: 3.788 ± 1.37
2.841LysSer: 2.841 ± 2.007
6.629LysThr: 6.629 ± 1.526
2.841LysVal: 2.841 ± 1.178
2.841LysTrp: 2.841 ± 1.414
7.576LysTyr: 7.576 ± 2.135
0.0LysXaa: 0.0 ± 0.0
Leu
2.841LeuAla: 2.841 ± 1.178
0.947LeuCys: 0.947 ± 0.669
2.841LeuAsp: 2.841 ± 1.376
3.788LeuGlu: 3.788 ± 1.339
2.841LeuPhe: 2.841 ± 3.256
2.841LeuGly: 2.841 ± 0.904
0.947LeuHis: 0.947 ± 0.669
4.735LeuIle: 4.735 ± 2.33
5.682LeuLys: 5.682 ± 1.921
5.682LeuLeu: 5.682 ± 2.701
2.841LeuMet: 2.841 ± 1.6
2.841LeuAsn: 2.841 ± 1.711
1.894LeuPro: 1.894 ± 1.13
3.788LeuGln: 3.788 ± 1.466
3.788LeuArg: 3.788 ± 2.485
8.523LeuSer: 8.523 ± 3.154
4.735LeuThr: 4.735 ± 2.036
0.0LeuVal: 0.0 ± 0.0
0.0LeuTrp: 0.0 ± 0.0
4.735LeuTyr: 4.735 ± 1.388
0.0LeuXaa: 0.0 ± 0.0
Met
0.947MetAla: 0.947 ± 0.78
0.947MetCys: 0.947 ± 1.046
0.0MetAsp: 0.0 ± 0.0
1.894MetGlu: 1.894 ± 1.37
0.947MetPhe: 0.947 ± 1.046
1.894MetGly: 1.894 ± 1.169
0.0MetHis: 0.0 ± 0.0
2.841MetIle: 2.841 ± 1.606
0.947MetLys: 0.947 ± 0.88
1.894MetLeu: 1.894 ± 1.559
1.894MetMet: 1.894 ± 1.451
0.947MetAsn: 0.947 ± 0.78
3.788MetPro: 3.788 ± 1.317
4.735MetGln: 4.735 ± 1.626
0.0MetArg: 0.0 ± 0.0
2.841MetSer: 2.841 ± 1.6
2.841MetThr: 2.841 ± 2.01
0.947MetVal: 0.947 ± 1.085
0.0MetTrp: 0.0 ± 0.0
1.894MetTyr: 1.894 ± 1.559
0.0MetXaa: 0.0 ± 0.0
Asn
4.735AsnAla: 4.735 ± 1.549
3.788AsnCys: 3.788 ± 1.006
2.841AsnAsp: 2.841 ± 1.606
2.841AsnGlu: 2.841 ± 1.266
0.947AsnPhe: 0.947 ± 1.085
0.947AsnGly: 0.947 ± 0.78
0.947AsnHis: 0.947 ± 1.046
2.841AsnIle: 2.841 ± 1.266
3.788AsnLys: 3.788 ± 1.802
1.894AsnLeu: 1.894 ± 1.338
0.947AsnMet: 0.947 ± 0.78
1.894AsnAsn: 1.894 ± 0.811
2.841AsnPro: 2.841 ± 1.375
0.947AsnGln: 0.947 ± 0.88
0.947AsnArg: 0.947 ± 0.78
0.947AsnSer: 0.947 ± 0.669
3.788AsnThr: 3.788 ± 1.777
7.576AsnVal: 7.576 ± 3.243
0.0AsnTrp: 0.0 ± 0.0
8.523AsnTyr: 8.523 ± 2.057
0.0AsnXaa: 0.0 ± 0.0
Pro
0.947ProAla: 0.947 ± 1.011
0.0ProCys: 0.0 ± 0.0
3.788ProAsp: 3.788 ± 1.129
3.788ProGlu: 3.788 ± 2.004
1.894ProPhe: 1.894 ± 0.932
2.841ProGly: 2.841 ± 1.178
0.947ProHis: 0.947 ± 0.669
4.735ProIle: 4.735 ± 1.93
3.788ProLys: 3.788 ± 1.947
2.841ProLeu: 2.841 ± 1.266
1.894ProMet: 1.894 ± 1.37
2.841ProAsn: 2.841 ± 1.443
0.947ProPro: 0.947 ± 0.669
2.841ProGln: 2.841 ± 1.375
3.788ProArg: 3.788 ± 1.947
6.629ProSer: 6.629 ± 2.104
3.788ProThr: 3.788 ± 2.086
0.947ProVal: 0.947 ± 0.669
0.947ProTrp: 0.947 ± 0.669
0.947ProTyr: 0.947 ± 1.046
0.0ProXaa: 0.0 ± 0.0
Gln
2.841GlnAla: 2.841 ± 1.443
0.947GlnCys: 0.947 ± 0.669
1.894GlnAsp: 1.894 ± 2.021
3.788GlnGlu: 3.788 ± 1.426
2.841GlnPhe: 2.841 ± 2.281
0.947GlnGly: 0.947 ± 0.78
2.841GlnHis: 2.841 ± 1.375
1.894GlnIle: 1.894 ± 1.023
4.735GlnLys: 4.735 ± 2.338
2.841GlnLeu: 2.841 ± 1.375
0.947GlnMet: 0.947 ± 0.88
1.894GlnAsn: 1.894 ± 1.338
1.894GlnPro: 1.894 ± 0.941
0.947GlnGln: 0.947 ± 1.085
1.894GlnArg: 1.894 ± 1.03
5.682GlnSer: 5.682 ± 2.377
2.841GlnThr: 2.841 ± 1.414
1.894GlnVal: 1.894 ± 1.451
0.947GlnTrp: 0.947 ± 0.78
1.894GlnTyr: 1.894 ± 0.811
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
1.894ArgCys: 1.894 ± 1.109
3.788ArgAsp: 3.788 ± 1.777
0.0ArgGlu: 0.0 ± 0.0
3.788ArgPhe: 3.788 ± 1.09
1.894ArgGly: 1.894 ± 2.171
1.894ArgHis: 1.894 ± 0.811
2.841ArgIle: 2.841 ± 0.885
5.682ArgLys: 5.682 ± 2.386
2.841ArgLeu: 2.841 ± 1.178
0.0ArgMet: 0.0 ± 0.797
0.947ArgAsn: 0.947 ± 0.78
1.894ArgPro: 1.894 ± 1.338
0.947ArgGln: 0.947 ± 0.669
4.735ArgArg: 4.735 ± 3.073
5.682ArgSer: 5.682 ± 1.408
2.841ArgThr: 2.841 ± 1.695
1.894ArgVal: 1.894 ± 1.338
0.947ArgTrp: 0.947 ± 0.669
0.947ArgTyr: 0.947 ± 1.011
0.0ArgXaa: 0.0 ± 0.0
Ser
2.841SerAla: 2.841 ± 0.885
0.947SerCys: 0.947 ± 1.011
0.947SerAsp: 0.947 ± 1.046
3.788SerGlu: 3.788 ± 1.466
4.735SerPhe: 4.735 ± 1.693
6.629SerGly: 6.629 ± 3.522
0.947SerHis: 0.947 ± 1.046
4.735SerIle: 4.735 ± 2.245
4.735SerLys: 4.735 ± 1.877
5.682SerLeu: 5.682 ± 1.824
1.894SerMet: 1.894 ± 1.559
6.629SerAsn: 6.629 ± 1.655
2.841SerPro: 2.841 ± 2.641
2.841SerGln: 2.841 ± 2.339
4.735SerArg: 4.735 ± 2.249
8.523SerSer: 8.523 ± 3.272
7.576SerThr: 7.576 ± 3.816
4.735SerVal: 4.735 ± 1.891
0.0SerTrp: 0.0 ± 0.0
1.894SerTyr: 1.894 ± 0.941
0.0SerXaa: 0.0 ± 0.0
Thr
0.947ThrAla: 0.947 ± 0.88
0.947ThrCys: 0.947 ± 0.669
1.894ThrAsp: 1.894 ± 1.559
2.841ThrGlu: 2.841 ± 3.139
2.841ThrPhe: 2.841 ± 1.235
4.735ThrGly: 4.735 ± 1.975
1.894ThrHis: 1.894 ± 0.811
2.841ThrIle: 2.841 ± 2.033
1.894ThrLys: 1.894 ± 1.023
5.682ThrLeu: 5.682 ± 1.725
1.894ThrMet: 1.894 ± 1.109
3.788ThrAsn: 3.788 ± 1.754
4.735ThrPro: 4.735 ± 1.93
4.735ThrGln: 4.735 ± 2.167
3.788ThrArg: 3.788 ± 2.262
3.788ThrSer: 3.788 ± 2.87
4.735ThrThr: 4.735 ± 2.712
3.788ThrVal: 3.788 ± 2.297
0.947ThrTrp: 0.947 ± 0.669
3.788ThrTyr: 3.788 ± 1.622
0.0ThrXaa: 0.0 ± 0.0
Val
2.841ValAla: 2.841 ± 1.325
1.894ValCys: 1.894 ± 1.503
2.841ValAsp: 2.841 ± 1.533
0.947ValGlu: 0.947 ± 0.669
2.841ValPhe: 2.841 ± 0.904
1.894ValGly: 1.894 ± 1.559
0.947ValHis: 0.947 ± 0.669
4.735ValIle: 4.735 ± 1.395
7.576ValLys: 7.576 ± 1.92
1.894ValLeu: 1.894 ± 1.338
0.947ValMet: 0.947 ± 0.905
4.735ValAsn: 4.735 ± 1.252
1.894ValPro: 1.894 ± 0.941
3.788ValGln: 3.788 ± 1.175
1.894ValArg: 1.894 ± 1.559
4.735ValSer: 4.735 ± 2.036
1.894ValThr: 1.894 ± 1.559
3.788ValVal: 3.788 ± 1.55
1.894ValTrp: 1.894 ± 1.109
3.788ValTyr: 3.788 ± 1.351
0.0ValXaa: 0.0 ± 0.0
Trp
1.894TrpAla: 1.894 ± 1.338
0.947TrpCys: 0.947 ± 1.011
0.0TrpAsp: 0.0 ± 0.0
2.841TrpGlu: 2.841 ± 2.042
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.947TrpIle: 0.947 ± 0.78
4.735TrpLys: 4.735 ± 1.891
0.947TrpLeu: 0.947 ± 1.085
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.947TrpPro: 0.947 ± 1.046
0.947TrpGln: 0.947 ± 0.669
0.947TrpArg: 0.947 ± 1.046
0.947TrpSer: 0.947 ± 0.669
2.841TrpThr: 2.841 ± 1.266
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.947TyrAla: 0.947 ± 0.78
0.947TyrCys: 0.947 ± 0.669
1.894TyrAsp: 1.894 ± 1.559
1.894TyrGlu: 1.894 ± 1.109
3.788TyrPhe: 3.788 ± 0.969
4.735TyrGly: 4.735 ± 2.188
2.841TyrHis: 2.841 ± 1.266
2.841TyrIle: 2.841 ± 1.506
3.788TyrLys: 3.788 ± 1.407
1.894TyrLeu: 1.894 ± 1.338
1.894TyrMet: 1.894 ± 1.421
4.735TyrAsn: 4.735 ± 1.758
2.841TyrPro: 2.841 ± 1.178
1.894TyrGln: 1.894 ± 1.169
0.947TyrArg: 0.947 ± 0.78
1.894TyrSer: 1.894 ± 1.023
1.894TyrThr: 1.894 ± 2.093
2.841TyrVal: 2.841 ± 1.711
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1057 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski