Amino acid dipepetide frequency for Grapevine virus F

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.466AlaAla: 4.466 ± 1.358
0.812AlaCys: 0.812 ± 0.421
3.248AlaAsp: 3.248 ± 1.028
4.872AlaGlu: 4.872 ± 0.832
2.436AlaPhe: 2.436 ± 1.264
5.684AlaGly: 5.684 ± 2.176
1.624AlaHis: 1.624 ± 0.92
2.436AlaIle: 2.436 ± 0.601
6.496AlaLys: 6.496 ± 1.833
5.278AlaLeu: 5.278 ± 2.219
1.218AlaMet: 1.218 ± 0.587
2.436AlaAsn: 2.436 ± 0.807
2.842AlaPro: 2.842 ± 1.475
2.03AlaGln: 2.03 ± 1.687
5.684AlaArg: 5.684 ± 1.216
1.624AlaSer: 1.624 ± 0.817
4.466AlaThr: 4.466 ± 1.272
4.06AlaVal: 4.06 ± 1.406
0.812AlaTrp: 0.812 ± 0.659
2.436AlaTyr: 2.436 ± 0.601
0.0AlaXaa: 0.0 ± 0.0
Cys
0.812CysAla: 0.812 ± 0.421
0.0CysCys: 0.0 ± 0.0
0.812CysAsp: 0.812 ± 0.421
1.218CysGlu: 1.218 ± 0.587
1.218CysPhe: 1.218 ± 0.632
1.624CysGly: 1.624 ± 0.92
0.0CysHis: 0.0 ± 0.0
1.218CysIle: 1.218 ± 0.632
1.624CysLys: 1.624 ± 1.178
1.624CysLeu: 1.624 ± 0.801
0.812CysMet: 0.812 ± 0.421
0.812CysAsn: 0.812 ± 0.847
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
1.624CysSer: 1.624 ± 0.843
2.03CysThr: 2.03 ± 1.054
2.436CysVal: 2.436 ± 1.007
0.0CysTrp: 0.0 ± 0.0
0.812CysTyr: 0.812 ± 0.847
0.0CysXaa: 0.0 ± 0.0
Asp
4.06AspAla: 4.06 ± 1.746
1.624AspCys: 1.624 ± 1.317
2.842AspAsp: 2.842 ± 0.91
5.278AspGlu: 5.278 ± 1.608
2.03AspPhe: 2.03 ± 0.686
4.466AspGly: 4.466 ± 0.951
2.03AspHis: 2.03 ± 1.054
2.03AspIle: 2.03 ± 0.603
2.842AspLys: 2.842 ± 0.659
4.872AspLeu: 4.872 ± 2.403
1.218AspMet: 1.218 ± 0.632
1.218AspAsn: 1.218 ± 1.272
1.624AspPro: 1.624 ± 0.843
0.812AspGln: 0.812 ± 0.421
4.06AspArg: 4.06 ± 1.436
3.248AspSer: 3.248 ± 1.562
2.842AspThr: 2.842 ± 0.995
4.872AspVal: 4.872 ± 1.384
1.624AspTrp: 1.624 ± 0.644
2.436AspTyr: 2.436 ± 0.867
0.406AspXaa: 0.406 ± 0.211
Glu
6.09GluAla: 6.09 ± 1.632
0.812GluCys: 0.812 ± 0.847
4.466GluAsp: 4.466 ± 1.128
9.744GluGlu: 9.744 ± 1.969
3.248GluPhe: 3.248 ± 1.021
5.278GluGly: 5.278 ± 1.217
1.624GluHis: 1.624 ± 0.817
4.06GluIle: 4.06 ± 2.246
4.466GluLys: 4.466 ± 2.318
6.496GluLeu: 6.496 ± 1.935
2.436GluMet: 2.436 ± 0.944
2.436GluAsn: 2.436 ± 0.601
3.248GluPro: 3.248 ± 1.196
1.624GluGln: 1.624 ± 1.295
2.842GluArg: 2.842 ± 0.615
3.248GluSer: 3.248 ± 1.202
3.248GluThr: 3.248 ± 1.995
9.338GluVal: 9.338 ± 1.841
0.406GluTrp: 0.406 ± 0.211
3.248GluTyr: 3.248 ± 1.208
0.0GluXaa: 0.0 ± 0.0
Phe
2.842PheAla: 2.842 ± 1.22
0.812PheCys: 0.812 ± 0.576
2.436PheAsp: 2.436 ± 1.173
2.436PheGlu: 2.436 ± 1.007
2.03PhePhe: 2.03 ± 0.734
3.248PheGly: 3.248 ± 1.14
1.218PheHis: 1.218 ± 0.787
2.436PheIle: 2.436 ± 0.743
2.436PheLys: 2.436 ± 1.007
2.842PheLeu: 2.842 ± 1.026
3.248PheMet: 3.248 ± 1.653
1.624PheAsn: 1.624 ± 0.843
0.812PhePro: 0.812 ± 0.576
0.0PheGln: 0.0 ± 0.0
4.466PheArg: 4.466 ± 2.67
3.248PheSer: 3.248 ± 1.093
1.624PheThr: 1.624 ± 0.843
2.436PheVal: 2.436 ± 1.041
0.0PheTrp: 0.0 ± 0.0
0.812PheTyr: 0.812 ± 0.421
0.0PheXaa: 0.0 ± 0.0
Gly
3.248GlyAla: 3.248 ± 1.255
0.406GlyCys: 0.406 ± 0.706
4.06GlyAsp: 4.06 ± 2.107
2.842GlyGlu: 2.842 ± 1.712
1.218GlyPhe: 1.218 ± 0.952
2.436GlyGly: 2.436 ± 1.23
0.812GlyHis: 0.812 ± 0.421
3.248GlyIle: 3.248 ± 0.704
3.654GlyLys: 3.654 ± 1.51
6.09GlyLeu: 6.09 ± 1.372
0.812GlyMet: 0.812 ± 0.421
3.248GlyAsn: 3.248 ± 1.14
2.436GlyPro: 2.436 ± 0.601
1.624GlyGln: 1.624 ± 0.514
6.496GlyArg: 6.496 ± 2.249
6.09GlySer: 6.09 ± 1.255
4.466GlyThr: 4.466 ± 0.617
6.496GlyVal: 6.496 ± 3.99
0.812GlyTrp: 0.812 ± 0.421
3.654GlyTyr: 3.654 ± 1.494
0.0GlyXaa: 0.0 ± 0.0
His
2.436HisAla: 2.436 ± 1.954
0.406HisCys: 0.406 ± 0.211
1.624HisAsp: 1.624 ± 0.843
2.03HisGlu: 2.03 ± 0.603
0.406HisPhe: 0.406 ± 0.211
2.436HisGly: 2.436 ± 1.007
1.624HisHis: 1.624 ± 0.514
2.03HisIle: 2.03 ± 1.054
2.03HisLys: 2.03 ± 0.682
1.624HisLeu: 1.624 ± 0.92
0.0HisMet: 0.0 ± 0.0
1.218HisAsn: 1.218 ± 0.787
1.624HisPro: 1.624 ± 0.514
0.0HisGln: 0.0 ± 0.0
1.218HisArg: 1.218 ± 0.632
1.624HisSer: 1.624 ± 0.644
1.624HisThr: 1.624 ± 0.843
0.406HisVal: 0.406 ± 0.211
0.812HisTrp: 0.812 ± 0.421
0.406HisTyr: 0.406 ± 0.764
0.0HisXaa: 0.0 ± 0.0
Ile
3.248IleAla: 3.248 ± 1.139
0.406IleCys: 0.406 ± 0.211
3.654IleAsp: 3.654 ± 1.366
2.436IleGlu: 2.436 ± 1.18
2.03IlePhe: 2.03 ± 0.682
2.436IleGly: 2.436 ± 1.281
1.218IleHis: 1.218 ± 0.632
0.812IleIle: 0.812 ± 0.421
4.872IleLys: 4.872 ± 0.994
4.06IleLeu: 4.06 ± 2.107
2.842IleMet: 2.842 ± 2.486
2.03IleAsn: 2.03 ± 0.678
1.624IlePro: 1.624 ± 0.843
0.812IleGln: 0.812 ± 0.576
2.436IleArg: 2.436 ± 0.926
6.902IleSer: 6.902 ± 2.406
3.654IleThr: 3.654 ± 1.366
2.842IleVal: 2.842 ± 1.622
0.406IleTrp: 0.406 ± 0.211
2.03IleTyr: 2.03 ± 0.791
0.0IleXaa: 0.0 ± 0.0
Lys
4.872LysAla: 4.872 ± 1.384
1.624LysCys: 1.624 ± 0.92
4.872LysAsp: 4.872 ± 2.086
5.278LysGlu: 5.278 ± 1.527
2.436LysPhe: 2.436 ± 0.867
4.872LysGly: 4.872 ± 2.032
1.218LysHis: 1.218 ± 0.503
3.248LysIle: 3.248 ± 1.093
4.872LysLys: 4.872 ± 2.306
8.526LysLeu: 8.526 ± 1.516
2.842LysMet: 2.842 ± 0.584
2.436LysAsn: 2.436 ± 1.574
1.218LysPro: 1.218 ± 0.787
1.624LysGln: 1.624 ± 0.598
2.842LysArg: 2.842 ± 0.671
3.248LysSer: 3.248 ± 1.202
4.06LysThr: 4.06 ± 2.107
5.278LysVal: 5.278 ± 0.521
0.0LysTrp: 0.0 ± 0.0
2.03LysTyr: 2.03 ± 0.734
0.0LysXaa: 0.0 ± 0.0
Leu
4.466LeuAla: 4.466 ± 1.311
2.03LeuCys: 2.03 ± 1.054
6.902LeuAsp: 6.902 ± 3.006
5.278LeuGlu: 5.278 ± 1.012
4.872LeuPhe: 4.872 ± 1.164
6.09LeuGly: 6.09 ± 3.775
2.436LeuHis: 2.436 ± 1.233
5.278LeuIle: 5.278 ± 1.097
6.496LeuLys: 6.496 ± 2.055
5.278LeuLeu: 5.278 ± 1.133
1.624LeuMet: 1.624 ± 0.514
3.654LeuAsn: 3.654 ± 1.51
3.248LeuPro: 3.248 ± 1.202
5.278LeuGln: 5.278 ± 1.648
6.09LeuArg: 6.09 ± 1.837
9.338LeuSer: 9.338 ± 3.577
2.842LeuThr: 2.842 ± 1.026
6.902LeuVal: 6.902 ± 0.79
0.406LeuTrp: 0.406 ± 0.211
4.06LeuTyr: 4.06 ± 1.578
0.0LeuXaa: 0.0 ± 0.0
Met
2.436MetAla: 2.436 ± 1.093
1.624MetCys: 1.624 ± 1.072
1.624MetAsp: 1.624 ± 0.514
1.218MetGlu: 1.218 ± 0.503
2.436MetPhe: 2.436 ± 0.807
3.654MetGly: 3.654 ± 1.326
1.218MetHis: 1.218 ± 0.587
0.406MetIle: 0.406 ± 0.211
0.406MetLys: 0.406 ± 0.211
2.842MetLeu: 2.842 ± 0.965
0.0MetMet: 0.0 ± 0.0
1.218MetAsn: 1.218 ± 0.977
3.248MetPro: 3.248 ± 1.558
0.406MetGln: 0.406 ± 0.764
2.436MetArg: 2.436 ± 1.173
2.842MetSer: 2.842 ± 2.145
0.406MetThr: 0.406 ± 0.211
2.03MetVal: 2.03 ± 0.603
0.0MetTrp: 0.0 ± 0.0
1.624MetTyr: 1.624 ± 0.817
0.0MetXaa: 0.0 ± 0.0
Asn
1.624AsnAla: 1.624 ± 0.843
0.812AsnCys: 0.812 ± 0.421
1.624AsnAsp: 1.624 ± 2.316
2.436AsnGlu: 2.436 ± 1.007
1.218AsnPhe: 1.218 ± 0.617
2.436AsnGly: 2.436 ± 0.743
0.0AsnHis: 0.0 ± 0.0
2.03AsnIle: 2.03 ± 0.965
4.466AsnLys: 4.466 ± 2.366
4.466AsnLeu: 4.466 ± 0.47
1.218AsnMet: 1.218 ± 0.587
0.812AsnAsn: 0.812 ± 0.659
1.624AsnPro: 1.624 ± 0.843
0.406AsnGln: 0.406 ± 0.211
3.248AsnArg: 3.248 ± 1.14
3.654AsnSer: 3.654 ± 1.51
3.248AsnThr: 3.248 ± 0.777
2.842AsnVal: 2.842 ± 0.995
0.0AsnTrp: 0.0 ± 0.0
2.03AsnTyr: 2.03 ± 0.83
0.0AsnXaa: 0.0 ± 0.0
Pro
1.218ProAla: 1.218 ± 0.632
0.406ProCys: 0.406 ± 0.211
1.624ProAsp: 1.624 ± 0.843
4.872ProGlu: 4.872 ± 1.406
1.624ProPhe: 1.624 ± 0.817
2.436ProGly: 2.436 ± 0.743
0.812ProHis: 0.812 ± 0.576
3.654ProIle: 3.654 ± 1.546
1.218ProLys: 1.218 ± 1.272
2.03ProLeu: 2.03 ± 1.054
0.406ProMet: 0.406 ± 0.211
2.03ProAsn: 2.03 ± 1.054
1.624ProPro: 1.624 ± 0.843
1.218ProGln: 1.218 ± 0.587
1.624ProArg: 1.624 ± 0.829
0.812ProSer: 0.812 ± 0.421
1.624ProThr: 1.624 ± 0.843
2.436ProVal: 2.436 ± 0.703
0.812ProTrp: 0.812 ± 0.421
2.03ProTyr: 2.03 ± 0.734
0.0ProXaa: 0.0 ± 0.0
Gln
1.624GlnAla: 1.624 ± 2.161
0.406GlnCys: 0.406 ± 0.211
0.406GlnAsp: 0.406 ± 0.211
2.842GlnGlu: 2.842 ± 0.965
0.406GlnPhe: 0.406 ± 0.211
1.218GlnGly: 1.218 ± 0.632
0.812GlnHis: 0.812 ± 0.576
2.436GlnIle: 2.436 ± 1.173
1.218GlnLys: 1.218 ± 0.632
1.624GlnLeu: 1.624 ± 1.577
2.03GlnMet: 2.03 ± 1.218
1.624GlnAsn: 1.624 ± 0.92
0.812GlnPro: 0.812 ± 0.576
1.218GlnGln: 1.218 ± 0.503
1.218GlnArg: 1.218 ± 0.632
0.406GlnSer: 0.406 ± 0.211
2.436GlnThr: 2.436 ± 1.264
2.436GlnVal: 2.436 ± 1.173
0.0GlnTrp: 0.0 ± 0.0
0.406GlnTyr: 0.406 ± 0.211
0.0GlnXaa: 0.0 ± 0.0
Arg
6.09ArgAla: 6.09 ± 2.028
1.218ArgCys: 1.218 ± 0.503
1.218ArgAsp: 1.218 ± 0.632
3.654ArgGlu: 3.654 ± 0.753
2.842ArgPhe: 2.842 ± 0.995
4.06ArgGly: 4.06 ± 1.916
0.406ArgHis: 0.406 ± 0.211
2.03ArgIle: 2.03 ± 0.603
4.06ArgLys: 4.06 ± 0.924
9.744ArgLeu: 9.744 ± 3.186
3.654ArgMet: 3.654 ± 1.753
2.436ArgAsn: 2.436 ± 1.007
1.218ArgPro: 1.218 ± 1.406
2.436ArgGln: 2.436 ± 1.173
3.654ArgArg: 3.654 ± 4.851
3.248ArgSer: 3.248 ± 0.685
3.248ArgThr: 3.248 ± 2.237
6.09ArgVal: 6.09 ± 3.397
0.406ArgTrp: 0.406 ± 0.211
2.436ArgTyr: 2.436 ± 0.743
0.0ArgXaa: 0.0 ± 0.0
Ser
3.654SerAla: 3.654 ± 0.838
2.436SerCys: 2.436 ± 1.264
6.09SerAsp: 6.09 ± 1.303
6.496SerGlu: 6.496 ± 0.744
1.624SerPhe: 1.624 ± 0.644
3.654SerGly: 3.654 ± 1.896
2.436SerHis: 2.436 ± 1.264
3.654SerIle: 3.654 ± 0.939
5.278SerLys: 5.278 ± 1.358
6.902SerLeu: 6.902 ± 1.924
2.842SerMet: 2.842 ± 0.671
1.218SerAsn: 1.218 ± 0.632
1.218SerPro: 1.218 ± 0.787
2.436SerGln: 2.436 ± 0.807
3.654SerArg: 3.654 ± 3.861
3.654SerSer: 3.654 ± 2.252
1.624SerThr: 1.624 ± 0.514
4.466SerVal: 4.466 ± 0.917
0.406SerTrp: 0.406 ± 0.211
2.03SerTyr: 2.03 ± 0.734
0.0SerXaa: 0.0 ± 0.0
Thr
2.842ThrAla: 2.842 ± 0.584
1.218ThrCys: 1.218 ± 0.632
2.842ThrAsp: 2.842 ± 0.615
3.654ThrGlu: 3.654 ± 1.76
4.06ThrPhe: 4.06 ± 1.578
2.436ThrGly: 2.436 ± 1.007
1.218ThrHis: 1.218 ± 0.632
3.248ThrIle: 3.248 ± 0.704
3.248ThrLys: 3.248 ± 1.208
5.684ThrLeu: 5.684 ± 2.2
1.218ThrMet: 1.218 ± 0.787
2.842ThrAsn: 2.842 ± 1.246
1.218ThrPro: 1.218 ± 0.787
0.406ThrGln: 0.406 ± 0.211
4.466ThrArg: 4.466 ± 1.502
1.624ThrSer: 1.624 ± 0.598
2.436ThrThr: 2.436 ± 0.614
5.278ThrVal: 5.278 ± 1.767
0.406ThrTrp: 0.406 ± 0.211
2.436ThrTyr: 2.436 ± 1.264
0.0ThrXaa: 0.0 ± 0.0
Val
5.684ValAla: 5.684 ± 2.066
1.218ValCys: 1.218 ± 0.787
3.654ValAsp: 3.654 ± 1.13
6.902ValGlu: 6.902 ± 1.562
4.06ValPhe: 4.06 ± 1.084
4.06ValGly: 4.06 ± 1.753
3.654ValHis: 3.654 ± 0.941
4.466ValIle: 4.466 ± 0.726
6.496ValLys: 6.496 ± 1.782
6.902ValLeu: 6.902 ± 1.75
2.03ValMet: 2.03 ± 1.5
3.654ValAsn: 3.654 ± 1.759
1.218ValPro: 1.218 ± 0.952
2.842ValGln: 2.842 ± 0.965
5.278ValArg: 5.278 ± 1.091
4.06ValSer: 4.06 ± 1.578
3.248ValThr: 3.248 ± 1.739
6.09ValVal: 6.09 ± 2.246
0.406ValTrp: 0.406 ± 0.211
2.03ValTyr: 2.03 ± 1.411
0.0ValXaa: 0.0 ± 0.0
Trp
0.812TrpAla: 0.812 ± 0.421
0.406TrpCys: 0.406 ± 0.211
0.0TrpAsp: 0.0 ± 0.0
0.812TrpGlu: 0.812 ± 0.659
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.406TrpHis: 0.406 ± 0.211
0.0TrpIle: 0.0 ± 0.0
0.406TrpLys: 0.406 ± 0.211
0.812TrpLeu: 0.812 ± 0.659
0.406TrpMet: 0.406 ± 0.211
0.0TrpAsn: 0.0 ± 0.0
0.812TrpPro: 0.812 ± 0.421
0.0TrpGln: 0.0 ± 0.0
0.406TrpArg: 0.406 ± 0.211
1.218TrpSer: 1.218 ± 0.632
1.218TrpThr: 1.218 ± 0.632
0.0TrpVal: 0.0 ± 0.0
0.406TrpTrp: 0.406 ± 0.211
0.406TrpTyr: 0.406 ± 0.211
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.03TyrAla: 2.03 ± 1.054
0.0TyrCys: 0.0 ± 0.0
2.436TyrAsp: 2.436 ± 0.601
4.06TyrGlu: 4.06 ± 0.902
0.812TyrPhe: 0.812 ± 0.576
1.624TyrGly: 1.624 ± 0.829
0.812TyrHis: 0.812 ± 0.576
2.03TyrIle: 2.03 ± 0.717
0.812TyrLys: 0.812 ± 0.421
4.872TyrLeu: 4.872 ± 1.254
0.812TyrMet: 0.812 ± 0.687
3.248TyrAsn: 3.248 ± 1.065
2.842TyrPro: 2.842 ± 0.671
0.406TyrGln: 0.406 ± 0.706
2.03TyrArg: 2.03 ± 0.734
4.06TyrSer: 4.06 ± 0.961
2.436TyrThr: 2.436 ± 0.743
1.218TyrVal: 1.218 ± 0.632
0.406TyrTrp: 0.406 ± 0.211
1.218TyrTyr: 1.218 ± 0.632
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.406XaaAla: 0.406 ± 0.211
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2464 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski