Amino acid dipepetide frequency for Eupatorium vein clearing virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.972AlaAla: 3.972 ± 1.831
0.441AlaCys: 0.441 ± 0.362
2.648AlaAsp: 2.648 ± 0.893
3.53AlaGlu: 3.53 ± 0.805
2.207AlaPhe: 2.207 ± 0.612
1.765AlaGly: 1.765 ± 0.462
2.207AlaHis: 2.207 ± 0.586
1.765AlaIle: 1.765 ± 0.355
3.53AlaLys: 3.53 ± 1.619
5.296AlaLeu: 5.296 ± 0.63
1.324AlaMet: 1.324 ± 0.63
4.854AlaAsn: 4.854 ± 1.911
2.207AlaPro: 2.207 ± 1.01
0.0AlaGln: 0.0 ± 0.0
1.324AlaArg: 1.324 ± 0.825
3.972AlaSer: 3.972 ± 1.289
1.765AlaThr: 1.765 ± 0.604
3.089AlaVal: 3.089 ± 0.873
0.0AlaTrp: 0.0 ± 0.0
3.089AlaTyr: 3.089 ± 1.032
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.883CysCys: 0.883 ± 0.39
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.441CysGly: 0.441 ± 0.763
0.0CysHis: 0.0 ± 0.0
1.324CysIle: 1.324 ± 1.085
1.324CysLys: 1.324 ± 0.36
1.765CysLeu: 1.765 ± 0.517
0.0CysMet: 0.0 ± 0.0
1.765CysAsn: 1.765 ± 1.13
3.089CysPro: 3.089 ± 1.309
0.0CysGln: 0.0 ± 0.0
0.441CysArg: 0.441 ± 0.362
0.441CysSer: 0.441 ± 0.362
0.883CysThr: 0.883 ± 0.772
0.883CysVal: 0.883 ± 0.481
0.441CysTrp: 0.441 ± 0.386
0.883CysTyr: 0.883 ± 0.363
0.0CysXaa: 0.0 ± 0.0
Asp
3.089AspAla: 3.089 ± 0.716
1.324AspCys: 1.324 ± 0.686
5.296AspAsp: 5.296 ± 1.885
4.854AspGlu: 4.854 ± 2.362
1.765AspPhe: 1.765 ± 0.726
1.765AspGly: 1.765 ± 1.009
1.324AspHis: 1.324 ± 0.36
4.854AspIle: 4.854 ± 0.851
5.296AspLys: 5.296 ± 1.96
2.648AspLeu: 2.648 ± 1.367
0.883AspMet: 0.883 ± 0.363
2.648AspAsn: 2.648 ± 1.197
2.648AspPro: 2.648 ± 1.34
2.648AspGln: 2.648 ± 1.291
3.089AspArg: 3.089 ± 2.123
2.648AspSer: 2.648 ± 0.777
2.648AspThr: 2.648 ± 1.169
1.324AspVal: 1.324 ± 0.453
0.0AspTrp: 0.0 ± 0.0
1.765AspTyr: 1.765 ± 0.644
0.0AspXaa: 0.0 ± 0.0
Glu
4.413GluAla: 4.413 ± 1.982
0.0GluCys: 0.0 ± 0.0
3.972GluAsp: 3.972 ± 0.889
8.385GluGlu: 8.385 ± 3.642
2.648GluPhe: 2.648 ± 1.283
2.648GluGly: 2.648 ± 1.133
2.648GluHis: 2.648 ± 0.722
7.061GluIle: 7.061 ± 1.651
5.296GluLys: 5.296 ± 2.222
10.15GluLeu: 10.15 ± 1.955
1.765GluMet: 1.765 ± 0.766
6.178GluAsn: 6.178 ± 2.109
2.207GluPro: 2.207 ± 1.532
3.089GluGln: 3.089 ± 1.011
1.765GluArg: 1.765 ± 0.653
3.089GluSer: 3.089 ± 1.115
2.648GluThr: 2.648 ± 0.721
2.648GluVal: 2.648 ± 0.752
0.883GluTrp: 0.883 ± 0.772
1.765GluTyr: 1.765 ± 0.355
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
1.324PheCys: 1.324 ± 0.566
2.648PheAsp: 2.648 ± 0.944
2.207PheGlu: 2.207 ± 0.816
1.324PhePhe: 1.324 ± 0.824
0.441PheGly: 0.441 ± 0.362
2.207PheHis: 2.207 ± 1.407
3.089PheIle: 3.089 ± 1.04
2.648PheLys: 2.648 ± 1.785
5.296PheLeu: 5.296 ± 2.327
1.765PheMet: 1.765 ± 0.734
2.207PheAsn: 2.207 ± 1.287
2.207PhePro: 2.207 ± 1.309
2.207PheGln: 2.207 ± 1.047
1.765PheArg: 1.765 ± 0.817
4.413PheSer: 4.413 ± 1.174
2.648PheThr: 2.648 ± 0.522
1.324PheVal: 1.324 ± 0.785
0.441PheTrp: 0.441 ± 0.362
1.765PheTyr: 1.765 ± 1.111
0.0PheXaa: 0.0 ± 0.0
Gly
2.648GlyAla: 2.648 ± 0.976
0.441GlyCys: 0.441 ± 0.386
1.324GlyAsp: 1.324 ± 0.67
4.854GlyGlu: 4.854 ± 2.154
2.207GlyPhe: 2.207 ± 0.709
2.207GlyGly: 2.207 ± 0.612
0.883GlyHis: 0.883 ± 0.39
5.296GlyIle: 5.296 ± 1.158
5.296GlyLys: 5.296 ± 0.901
4.413GlyLeu: 4.413 ± 1.999
0.441GlyMet: 0.441 ± 0.529
3.53GlyAsn: 3.53 ± 1.531
2.648GlyPro: 2.648 ± 1.459
1.324GlyGln: 1.324 ± 0.744
1.324GlyArg: 1.324 ± 0.67
2.207GlySer: 2.207 ± 0.912
4.413GlyThr: 4.413 ± 1.626
0.883GlyVal: 0.883 ± 0.504
0.0GlyTrp: 0.0 ± 0.0
2.207GlyTyr: 2.207 ± 0.976
0.0GlyXaa: 0.0 ± 0.0
His
0.883HisAla: 0.883 ± 0.504
0.441HisCys: 0.441 ± 0.362
1.324HisAsp: 1.324 ± 1.048
2.648HisGlu: 2.648 ± 0.727
0.883HisPhe: 0.883 ± 0.481
0.0HisGly: 0.0 ± 0.0
0.441HisHis: 0.441 ± 0.414
2.648HisIle: 2.648 ± 0.92
2.207HisLys: 2.207 ± 0.715
2.207HisLeu: 2.207 ± 0.447
0.883HisMet: 0.883 ± 0.724
1.324HisAsn: 1.324 ± 0.744
0.441HisPro: 0.441 ± 0.386
0.441HisGln: 0.441 ± 0.414
1.324HisArg: 1.324 ± 0.689
3.53HisSer: 3.53 ± 0.968
0.0HisThr: 0.0 ± 0.0
0.441HisVal: 0.441 ± 0.362
0.0HisTrp: 0.0 ± 0.0
1.324HisTyr: 1.324 ± 0.645
0.0HisXaa: 0.0 ± 0.0
Ile
2.207IleAla: 2.207 ± 0.694
1.765IleCys: 1.765 ± 0.653
4.854IleAsp: 4.854 ± 2.623
2.648IleGlu: 2.648 ± 0.832
3.972IlePhe: 3.972 ± 0.989
4.854IleGly: 4.854 ± 1.842
3.089IleHis: 3.089 ± 1.334
5.296IleIle: 5.296 ± 1.792
7.061IleLys: 7.061 ± 2.206
5.737IleLeu: 5.737 ± 2.276
1.765IleMet: 1.765 ± 1.078
4.854IleAsn: 4.854 ± 1.842
6.178IlePro: 6.178 ± 1.161
3.972IleGln: 3.972 ± 1.399
2.648IleArg: 2.648 ± 0.766
4.413IleSer: 4.413 ± 1.21
3.972IleThr: 3.972 ± 1.031
2.207IleVal: 2.207 ± 1.137
0.0IleTrp: 0.0 ± 0.0
3.972IleTyr: 3.972 ± 1.342
0.0IleXaa: 0.0 ± 0.0
Lys
4.854LysAla: 4.854 ± 1.703
1.324LysCys: 1.324 ± 1.157
6.178LysAsp: 6.178 ± 2.735
3.972LysGlu: 3.972 ± 1.504
3.089LysPhe: 3.089 ± 0.687
7.061LysGly: 7.061 ± 1.579
0.441LysHis: 0.441 ± 0.362
8.385LysIle: 8.385 ± 1.449
11.474LysLys: 11.474 ± 3.013
8.385LysLeu: 8.385 ± 2.317
1.324LysMet: 1.324 ± 0.68
2.207LysAsn: 2.207 ± 0.838
7.502LysPro: 7.502 ± 1.077
3.972LysGln: 3.972 ± 1.414
2.207LysArg: 2.207 ± 0.816
10.15LysSer: 10.15 ± 2.213
5.737LysThr: 5.737 ± 1.535
7.944LysVal: 7.944 ± 1.906
0.441LysTrp: 0.441 ± 0.386
2.207LysTyr: 2.207 ± 0.586
0.0LysXaa: 0.0 ± 0.0
Leu
4.854LeuAla: 4.854 ± 0.976
0.883LeuCys: 0.883 ± 0.504
5.296LeuAsp: 5.296 ± 1.113
7.502LeuGlu: 7.502 ± 1.755
2.207LeuPhe: 2.207 ± 0.976
6.62LeuGly: 6.62 ± 1.18
2.207LeuHis: 2.207 ± 0.709
7.502LeuIle: 7.502 ± 1.256
9.267LeuLys: 9.267 ± 1.853
7.061LeuLeu: 7.061 ± 1.334
1.324LeuMet: 1.324 ± 0.36
4.854LeuAsn: 4.854 ± 1.718
2.648LeuPro: 2.648 ± 1.118
3.53LeuGln: 3.53 ± 0.709
3.972LeuArg: 3.972 ± 1.375
6.62LeuSer: 6.62 ± 2.132
5.737LeuThr: 5.737 ± 1.479
5.296LeuVal: 5.296 ± 0.623
0.883LeuTrp: 0.883 ± 0.39
0.883LeuTyr: 0.883 ± 0.39
0.0LeuXaa: 0.0 ± 0.0
Met
2.207MetAla: 2.207 ± 0.447
0.441MetCys: 0.441 ± 0.362
0.441MetAsp: 0.441 ± 0.386
2.207MetGlu: 2.207 ± 0.577
1.765MetPhe: 1.765 ± 0.355
0.883MetGly: 0.883 ± 0.772
0.0MetHis: 0.0 ± 0.0
0.441MetIle: 0.441 ± 0.568
0.441MetLys: 0.441 ± 0.362
1.765MetLeu: 1.765 ± 1.193
0.0MetMet: 0.0 ± 0.0
0.883MetAsn: 0.883 ± 0.481
0.441MetPro: 0.441 ± 0.362
0.883MetGln: 0.883 ± 0.363
1.324MetArg: 1.324 ± 0.645
1.765MetSer: 1.765 ± 0.607
1.765MetThr: 1.765 ± 0.89
1.324MetVal: 1.324 ± 1.085
0.441MetTrp: 0.441 ± 0.362
0.883MetTyr: 0.883 ± 0.504
0.0MetXaa: 0.0 ± 0.0
Asn
3.53AsnAla: 3.53 ± 2.018
0.883AsnCys: 0.883 ± 0.39
3.53AsnAsp: 3.53 ± 0.669
5.296AsnGlu: 5.296 ± 2.094
3.972AsnPhe: 3.972 ± 1.544
2.207AsnGly: 2.207 ± 0.875
0.0AsnHis: 0.0 ± 0.0
3.972AsnIle: 3.972 ± 1.555
5.296AsnLys: 5.296 ± 1.246
7.944AsnLeu: 7.944 ± 2.283
1.324AsnMet: 1.324 ± 0.864
2.648AsnAsn: 2.648 ± 0.686
3.53AsnPro: 3.53 ± 1.73
3.089AsnGln: 3.089 ± 0.767
2.648AsnArg: 2.648 ± 0.835
3.972AsnSer: 3.972 ± 1.454
2.207AsnThr: 2.207 ± 0.681
3.972AsnVal: 3.972 ± 1.004
0.883AsnTrp: 0.883 ± 1.08
1.324AsnTyr: 1.324 ± 0.63
0.0AsnXaa: 0.0 ± 0.0
Pro
2.648ProAla: 2.648 ± 0.721
0.441ProCys: 0.441 ± 0.763
0.441ProAsp: 0.441 ± 0.386
7.061ProGlu: 7.061 ± 1.441
1.324ProPhe: 1.324 ± 0.645
3.089ProGly: 3.089 ± 0.842
1.324ProHis: 1.324 ± 0.826
1.765ProIle: 1.765 ± 0.571
5.737ProLys: 5.737 ± 1.098
1.324ProLeu: 1.324 ± 0.453
0.441ProMet: 0.441 ± 0.503
4.854ProAsn: 4.854 ± 2.148
3.972ProPro: 3.972 ± 2.969
3.089ProGln: 3.089 ± 0.981
1.765ProArg: 1.765 ± 1.128
4.854ProSer: 4.854 ± 0.85
3.089ProThr: 3.089 ± 0.996
3.089ProVal: 3.089 ± 0.732
0.883ProTrp: 0.883 ± 0.609
0.883ProTyr: 0.883 ± 0.77
0.0ProXaa: 0.0 ± 0.0
Gln
1.765GlnAla: 1.765 ± 0.517
0.883GlnCys: 0.883 ± 0.363
2.207GlnAsp: 2.207 ± 0.577
3.53GlnGlu: 3.53 ± 0.59
3.089GlnPhe: 3.089 ± 0.928
1.324GlnGly: 1.324 ± 0.623
0.441GlnHis: 0.441 ± 0.414
2.207GlnIle: 2.207 ± 0.728
3.089GlnLys: 3.089 ± 0.732
6.178GlnLeu: 6.178 ± 2.0
1.324GlnMet: 1.324 ± 0.686
1.324GlnAsn: 1.324 ± 0.781
1.765GlnPro: 1.765 ± 0.644
1.765GlnGln: 1.765 ± 0.95
1.765GlnArg: 1.765 ± 0.607
2.648GlnSer: 2.648 ± 0.804
2.207GlnThr: 2.207 ± 1.013
2.648GlnVal: 2.648 ± 0.903
0.441GlnTrp: 0.441 ± 0.362
0.441GlnTyr: 0.441 ± 0.414
0.0GlnXaa: 0.0 ± 0.0
Arg
0.441ArgAla: 0.441 ± 0.386
0.441ArgCys: 0.441 ± 0.362
1.324ArgAsp: 1.324 ± 0.63
2.207ArgGlu: 2.207 ± 1.292
0.883ArgPhe: 0.883 ± 0.363
1.324ArgGly: 1.324 ± 0.686
1.765ArgHis: 1.765 ± 0.934
2.648ArgIle: 2.648 ± 0.639
6.178ArgLys: 6.178 ± 2.256
3.53ArgLeu: 3.53 ± 1.122
0.883ArgMet: 0.883 ± 0.724
2.648ArgAsn: 2.648 ± 0.874
1.324ArgPro: 1.324 ± 1.157
1.324ArgGln: 1.324 ± 0.418
1.765ArgArg: 1.765 ± 0.743
2.648ArgSer: 2.648 ± 0.575
1.765ArgThr: 1.765 ± 0.734
1.324ArgVal: 1.324 ± 0.645
0.441ArgTrp: 0.441 ± 0.362
2.207ArgTyr: 2.207 ± 1.042
0.0ArgXaa: 0.0 ± 0.0
Ser
2.648SerAla: 2.648 ± 0.909
1.324SerCys: 1.324 ± 0.67
2.207SerAsp: 2.207 ± 1.099
4.413SerGlu: 4.413 ± 1.34
3.972SerPhe: 3.972 ± 0.663
5.737SerGly: 5.737 ± 1.018
1.765SerHis: 1.765 ± 0.949
6.178SerIle: 6.178 ± 2.192
9.267SerLys: 9.267 ± 1.589
4.854SerLeu: 4.854 ± 0.946
2.207SerMet: 2.207 ± 0.519
3.53SerAsn: 3.53 ± 0.992
3.53SerPro: 3.53 ± 0.56
3.53SerGln: 3.53 ± 1.323
3.089SerArg: 3.089 ± 1.303
10.15SerSer: 10.15 ± 2.792
2.648SerThr: 2.648 ± 1.052
1.765SerVal: 1.765 ± 0.599
2.207SerTrp: 2.207 ± 0.922
2.207SerTyr: 2.207 ± 1.128
0.0SerXaa: 0.0 ± 0.0
Thr
3.972ThrAla: 3.972 ± 0.75
0.441ThrCys: 0.441 ± 0.362
3.972ThrAsp: 3.972 ± 1.261
2.207ThrGlu: 2.207 ± 0.669
2.648ThrPhe: 2.648 ± 0.844
1.765ThrGly: 1.765 ± 0.779
0.441ThrHis: 0.441 ± 0.362
3.972ThrIle: 3.972 ± 1.054
7.061ThrLys: 7.061 ± 0.676
1.765ThrLeu: 1.765 ± 0.854
0.883ThrMet: 0.883 ± 0.481
5.296ThrAsn: 5.296 ± 0.919
1.324ThrPro: 1.324 ± 0.931
1.765ThrGln: 1.765 ± 0.984
0.883ThrArg: 0.883 ± 0.609
5.737ThrSer: 5.737 ± 1.404
2.207ThrThr: 2.207 ± 0.748
3.972ThrVal: 3.972 ± 1.359
0.883ThrTrp: 0.883 ± 0.363
1.765ThrTyr: 1.765 ± 0.571
0.0ThrXaa: 0.0 ± 0.0
Val
1.765ValAla: 1.765 ± 0.355
0.441ValCys: 0.441 ± 0.362
2.207ValAsp: 2.207 ± 0.691
4.413ValGlu: 4.413 ± 0.818
2.648ValPhe: 2.648 ± 0.727
1.765ValGly: 1.765 ± 1.018
1.324ValHis: 1.324 ± 0.821
3.53ValIle: 3.53 ± 0.902
5.296ValLys: 5.296 ± 0.819
4.413ValLeu: 4.413 ± 1.44
0.883ValMet: 0.883 ± 0.481
3.972ValAsn: 3.972 ± 0.824
1.765ValPro: 1.765 ± 0.726
0.883ValGln: 0.883 ± 0.504
2.648ValArg: 2.648 ± 0.84
1.765ValSer: 1.765 ± 0.726
3.53ValThr: 3.53 ± 0.924
2.207ValVal: 2.207 ± 0.925
0.0ValTrp: 0.0 ± 0.0
2.648ValTyr: 2.648 ± 0.678
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.765TrpAsp: 1.765 ± 0.84
0.0TrpGlu: 0.0 ± 0.0
0.441TrpPhe: 0.441 ± 0.362
0.441TrpGly: 0.441 ± 0.362
0.0TrpHis: 0.0 ± 0.0
0.441TrpIle: 0.441 ± 0.386
1.324TrpLys: 1.324 ± 0.686
0.883TrpLeu: 0.883 ± 0.609
0.0TrpMet: 0.0 ± 0.0
0.883TrpAsn: 0.883 ± 0.752
0.441TrpPro: 0.441 ± 0.763
0.883TrpGln: 0.883 ± 0.724
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.324TrpThr: 1.324 ± 0.645
0.883TrpVal: 0.883 ± 0.492
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.089TyrAla: 3.089 ± 0.848
0.441TyrCys: 0.441 ± 0.414
0.883TyrAsp: 0.883 ± 0.772
1.324TyrGlu: 1.324 ± 0.648
0.883TyrPhe: 0.883 ± 0.772
2.207TyrGly: 2.207 ± 0.823
0.883TyrHis: 0.883 ± 0.363
3.089TyrIle: 3.089 ± 1.303
1.765TyrLys: 1.765 ± 0.704
3.53TyrLeu: 3.53 ± 0.647
0.441TyrMet: 0.441 ± 0.386
1.765TyrAsn: 1.765 ± 0.779
2.648TyrPro: 2.648 ± 1.082
2.648TyrGln: 2.648 ± 0.859
1.324TyrArg: 1.324 ± 0.781
2.207TyrSer: 2.207 ± 0.815
1.765TyrThr: 1.765 ± 1.049
0.883TyrVal: 0.883 ± 0.716
0.441TyrTrp: 0.441 ± 0.362
0.441TyrTyr: 0.441 ± 0.358
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2267 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski