Amino acid dipepetide frequency for Elm carlavirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.275AlaAla: 2.275 ± 0.648
1.138AlaCys: 1.138 ± 0.486
3.413AlaAsp: 3.413 ± 1.372
3.792AlaGlu: 3.792 ± 1.448
4.551AlaPhe: 4.551 ± 1.277
4.551AlaGly: 4.551 ± 1.277
1.517AlaHis: 1.517 ± 0.469
5.309AlaIle: 5.309 ± 1.361
6.068AlaLys: 6.068 ± 1.253
3.792AlaLeu: 3.792 ± 1.24
1.517AlaMet: 1.517 ± 0.754
4.171AlaAsn: 4.171 ± 1.168
1.138AlaPro: 1.138 ± 0.949
2.275AlaGln: 2.275 ± 0.785
3.034AlaArg: 3.034 ± 0.964
5.688AlaSer: 5.688 ± 2.165
3.034AlaThr: 3.034 ± 0.938
3.792AlaVal: 3.792 ± 1.036
0.379AlaTrp: 0.379 ± 0.706
1.138AlaTyr: 1.138 ± 0.949
0.0AlaXaa: 0.0 ± 0.0
Cys
1.517CysAla: 1.517 ± 0.469
0.0CysCys: 0.0 ± 0.0
2.275CysAsp: 2.275 ± 1.222
1.517CysGlu: 1.517 ± 3.074
2.655CysPhe: 2.655 ± 0.797
1.138CysGly: 1.138 ± 0.915
0.379CysHis: 0.379 ± 0.196
1.517CysIle: 1.517 ± 0.469
1.896CysLys: 1.896 ± 0.676
2.655CysLeu: 2.655 ± 0.924
0.758CysMet: 0.758 ± 0.797
1.517CysAsn: 1.517 ± 2.147
0.758CysPro: 0.758 ± 0.393
0.758CysGln: 0.758 ± 0.665
2.275CysArg: 2.275 ± 2.029
0.758CysSer: 0.758 ± 0.393
2.655CysThr: 2.655 ± 1.715
1.896CysVal: 1.896 ± 0.676
0.379CysTrp: 0.379 ± 0.196
1.517CysTyr: 1.517 ± 0.785
0.0CysXaa: 0.0 ± 0.0
Asp
1.896AspAla: 1.896 ± 0.676
2.275AspCys: 2.275 ± 0.648
1.896AspAsp: 1.896 ± 0.982
4.551AspGlu: 4.551 ± 1.074
6.068AspPhe: 6.068 ± 1.52
3.413AspGly: 3.413 ± 1.367
1.517AspHis: 1.517 ± 0.469
1.896AspIle: 1.896 ± 0.646
4.171AspLys: 4.171 ± 1.679
5.309AspLeu: 5.309 ± 2.075
1.896AspMet: 1.896 ± 0.888
1.896AspAsn: 1.896 ± 0.937
3.034AspPro: 3.034 ± 1.116
2.275AspGln: 2.275 ± 0.648
1.517AspArg: 1.517 ± 1.147
2.655AspSer: 2.655 ± 2.413
1.138AspThr: 1.138 ± 0.608
6.068AspVal: 6.068 ± 2.16
0.758AspTrp: 0.758 ± 0.393
1.896AspTyr: 1.896 ± 0.937
0.0AspXaa: 0.0 ± 0.0
Glu
5.688GluAla: 5.688 ± 2.945
0.379GluCys: 0.379 ± 0.196
3.413GluAsp: 3.413 ± 1.767
5.309GluGlu: 5.309 ± 1.869
5.309GluPhe: 5.309 ± 1.187
6.826GluGly: 6.826 ± 3.165
2.275GluHis: 2.275 ± 0.54
4.93GluIle: 4.93 ± 3.067
3.413GluLys: 3.413 ± 1.275
6.068GluLeu: 6.068 ± 0.775
1.138GluMet: 1.138 ± 0.589
2.275GluAsn: 2.275 ± 0.54
2.655GluPro: 2.655 ± 1.204
2.275GluGln: 2.275 ± 1.178
2.655GluArg: 2.655 ± 0.924
6.447GluSer: 6.447 ± 1.752
1.896GluThr: 1.896 ± 0.676
3.792GluVal: 3.792 ± 0.747
0.758GluTrp: 0.758 ± 0.665
0.758GluTyr: 0.758 ± 1.122
0.0GluXaa: 0.0 ± 0.0
Phe
6.068PheAla: 6.068 ± 1.982
1.517PheCys: 1.517 ± 0.785
4.171PheAsp: 4.171 ± 1.506
4.93PheGlu: 4.93 ± 1.882
1.896PhePhe: 1.896 ± 0.982
4.171PheGly: 4.171 ± 2.726
0.379PheHis: 0.379 ± 0.196
2.275PheIle: 2.275 ± 1.586
3.792PheLys: 3.792 ± 0.747
7.205PheLeu: 7.205 ± 2.381
0.758PheMet: 0.758 ± 0.393
2.275PheAsn: 2.275 ± 0.648
1.138PhePro: 1.138 ± 0.589
2.275PheGln: 2.275 ± 1.178
1.517PheArg: 1.517 ± 0.785
4.551PheSer: 4.551 ± 0.778
5.309PheThr: 5.309 ± 1.262
3.034PheVal: 3.034 ± 1.08
0.758PheTrp: 0.758 ± 0.393
1.517PheTyr: 1.517 ± 0.785
0.0PheXaa: 0.0 ± 0.0
Gly
3.413GlyAla: 3.413 ± 1.248
2.275GlyCys: 2.275 ± 1.222
7.584GlyAsp: 7.584 ± 2.04
4.551GlyGlu: 4.551 ± 1.661
3.792GlyPhe: 3.792 ± 0.747
4.551GlyGly: 4.551 ± 1.025
0.379GlyHis: 0.379 ± 0.196
4.551GlyIle: 4.551 ± 1.857
6.447GlyLys: 6.447 ± 2.027
4.93GlyLeu: 4.93 ± 2.221
1.138GlyMet: 1.138 ± 0.486
1.517GlyAsn: 1.517 ± 0.785
1.138GlyPro: 1.138 ± 1.635
1.517GlyGln: 1.517 ± 0.785
5.309GlyArg: 5.309 ± 1.262
4.551GlySer: 4.551 ± 1.212
3.034GlyThr: 3.034 ± 0.786
5.309GlyVal: 5.309 ± 1.624
0.758GlyTrp: 0.758 ± 0.393
2.275GlyTyr: 2.275 ± 0.785
0.0GlyXaa: 0.0 ± 0.0
His
1.517HisAla: 1.517 ± 0.469
1.138HisCys: 1.138 ± 1.261
1.138HisAsp: 1.138 ± 0.589
1.138HisGlu: 1.138 ± 1.423
0.758HisPhe: 0.758 ± 0.665
1.517HisGly: 1.517 ± 1.532
0.379HisHis: 0.379 ± 0.196
0.0HisIle: 0.0 ± 0.0
2.275HisLys: 2.275 ± 0.648
3.034HisLeu: 3.034 ± 0.52
0.758HisMet: 0.758 ± 0.393
0.758HisAsn: 0.758 ± 0.938
0.758HisPro: 0.758 ± 0.393
0.379HisGln: 0.379 ± 1.051
1.517HisArg: 1.517 ± 0.785
4.171HisSer: 4.171 ± 0.547
0.379HisThr: 0.379 ± 0.196
1.896HisVal: 1.896 ± 0.966
0.379HisTrp: 0.379 ± 0.196
1.896HisTyr: 1.896 ± 0.676
0.0HisXaa: 0.0 ± 0.0
Ile
4.171IleAla: 4.171 ± 2.107
1.896IleCys: 1.896 ± 1.259
3.034IleAsp: 3.034 ± 1.158
3.034IleGlu: 3.034 ± 1.08
2.655IlePhe: 2.655 ± 0.492
3.792IleGly: 3.792 ± 1.036
1.517IleHis: 1.517 ± 0.905
1.896IleIle: 1.896 ± 2.45
5.688IleLys: 5.688 ± 1.758
3.413IleLeu: 3.413 ± 1.042
1.896IleMet: 1.896 ± 0.982
1.896IleAsn: 1.896 ± 0.646
2.275IlePro: 2.275 ± 0.648
1.896IleGln: 1.896 ± 0.531
1.896IleArg: 1.896 ± 2.085
3.792IleSer: 3.792 ± 1.331
3.792IleThr: 3.792 ± 2.792
3.034IleVal: 3.034 ± 2.049
0.758IleTrp: 0.758 ± 0.966
1.138IleTyr: 1.138 ± 0.589
0.0IleXaa: 0.0 ± 0.0
Lys
4.171LysAla: 4.171 ± 1.506
1.517LysCys: 1.517 ± 0.469
2.655LysAsp: 2.655 ± 0.797
6.068LysGlu: 6.068 ± 1.257
4.171LysPhe: 4.171 ± 1.011
7.205LysGly: 7.205 ± 1.767
3.792LysHis: 3.792 ± 1.423
3.034LysIle: 3.034 ± 0.52
5.688LysLys: 5.688 ± 2.266
5.309LysLeu: 5.309 ± 0.887
1.896LysMet: 1.896 ± 0.531
4.171LysAsn: 4.171 ± 1.336
4.171LysPro: 4.171 ± 2.583
1.517LysGln: 1.517 ± 2.825
4.93LysArg: 4.93 ± 1.652
5.309LysSer: 5.309 ± 1.485
3.792LysThr: 3.792 ± 1.239
4.93LysVal: 4.93 ± 1.884
0.758LysTrp: 0.758 ± 0.393
2.655LysTyr: 2.655 ± 0.924
0.0LysXaa: 0.0 ± 0.0
Leu
4.171LeuAla: 4.171 ± 1.603
2.275LeuCys: 2.275 ± 0.785
4.93LeuAsp: 4.93 ± 1.334
5.309LeuGlu: 5.309 ± 0.984
4.551LeuPhe: 4.551 ± 2.013
7.205LeuGly: 7.205 ± 1.686
0.379LeuHis: 0.379 ± 0.196
4.93LeuIle: 4.93 ± 1.312
9.48LeuLys: 9.48 ± 1.591
8.343LeuLeu: 8.343 ± 2.667
2.275LeuMet: 2.275 ± 0.922
5.309LeuAsn: 5.309 ± 2.075
3.413LeuPro: 3.413 ± 1.831
1.138LeuGln: 1.138 ± 2.118
6.068LeuArg: 6.068 ± 1.502
5.688LeuSer: 5.688 ± 2.534
5.309LeuThr: 5.309 ± 1.415
7.584LeuVal: 7.584 ± 1.733
0.758LeuTrp: 0.758 ± 0.393
1.896LeuTyr: 1.896 ± 0.982
0.0LeuXaa: 0.0 ± 0.0
Met
3.034MetAla: 3.034 ± 0.964
0.758MetCys: 0.758 ± 0.393
1.517MetAsp: 1.517 ± 0.469
2.275MetGlu: 2.275 ± 0.971
0.379MetPhe: 0.379 ± 0.196
2.275MetGly: 2.275 ± 1.178
1.138MetHis: 1.138 ± 0.589
2.655MetIle: 2.655 ± 0.935
0.0MetLys: 0.0 ± 0.0
1.896MetLeu: 1.896 ± 0.874
0.379MetMet: 0.379 ± 0.196
0.758MetAsn: 0.758 ± 0.665
0.758MetPro: 0.758 ± 0.966
1.138MetGln: 1.138 ± 0.874
2.275MetArg: 2.275 ± 1.178
0.379MetSer: 0.379 ± 0.196
1.138MetThr: 1.138 ± 0.486
0.379MetVal: 0.379 ± 0.196
0.758MetTrp: 0.758 ± 0.573
0.758MetTyr: 0.758 ± 0.573
0.0MetXaa: 0.0 ± 0.0
Asn
2.655AsnAla: 2.655 ± 0.797
1.517AsnCys: 1.517 ± 0.905
1.138AsnAsp: 1.138 ± 0.486
3.792AsnGlu: 3.792 ± 1.321
3.413AsnPhe: 3.413 ± 1.767
1.896AsnGly: 1.896 ± 0.531
1.896AsnHis: 1.896 ± 0.995
1.517AsnIle: 1.517 ± 1.077
4.551AsnLys: 4.551 ± 2.657
4.551AsnLeu: 4.551 ± 1.218
0.379AsnMet: 0.379 ± 0.196
1.896AsnAsn: 1.896 ± 0.966
0.379AsnPro: 0.379 ± 0.706
0.758AsnGln: 0.758 ± 0.573
1.896AsnArg: 1.896 ± 0.966
3.034AsnSer: 3.034 ± 3.259
1.138AsnThr: 1.138 ± 0.589
1.896AsnVal: 1.896 ± 0.982
1.138AsnTrp: 1.138 ± 0.486
1.517AsnTyr: 1.517 ± 0.785
0.0AsnXaa: 0.0 ± 0.0
Pro
3.034ProAla: 3.034 ± 2.397
1.517ProCys: 1.517 ± 0.612
1.896ProAsp: 1.896 ± 0.646
2.275ProGlu: 2.275 ± 0.648
1.517ProPhe: 1.517 ± 0.785
2.655ProGly: 2.655 ± 0.915
1.517ProHis: 1.517 ± 2.892
1.896ProIle: 1.896 ± 0.646
3.034ProLys: 3.034 ± 0.964
1.517ProLeu: 1.517 ± 1.067
1.138ProMet: 1.138 ± 1.25
1.896ProAsn: 1.896 ± 0.676
3.413ProPro: 3.413 ± 1.732
0.758ProGln: 0.758 ± 0.573
1.517ProArg: 1.517 ± 0.785
2.275ProSer: 2.275 ± 0.885
3.034ProThr: 3.034 ± 0.93
1.896ProVal: 1.896 ± 1.259
0.379ProTrp: 0.379 ± 0.196
0.758ProTyr: 0.758 ± 0.393
0.0ProXaa: 0.0 ± 0.0
Gln
1.517GlnAla: 1.517 ± 0.788
1.138GlnCys: 1.138 ± 1.192
0.758GlnAsp: 0.758 ± 0.573
3.034GlnGlu: 3.034 ± 0.964
1.138GlnPhe: 1.138 ± 0.486
1.896GlnGly: 1.896 ± 0.531
1.138GlnHis: 1.138 ± 0.589
0.758GlnIle: 0.758 ± 0.665
1.138GlnLys: 1.138 ± 1.271
2.275GlnLeu: 2.275 ± 0.648
0.379GlnMet: 0.379 ± 0.196
0.379GlnAsn: 0.379 ± 0.196
1.138GlnPro: 1.138 ± 1.271
0.379GlnGln: 0.379 ± 0.706
0.758GlnArg: 0.758 ± 0.393
2.275GlnSer: 2.275 ± 0.937
0.379GlnThr: 0.379 ± 0.706
1.517GlnVal: 1.517 ± 0.469
0.379GlnTrp: 0.379 ± 0.706
1.138GlnTyr: 1.138 ± 0.589
0.0GlnXaa: 0.0 ± 0.0
Arg
3.034ArgAla: 3.034 ± 0.964
1.138ArgCys: 1.138 ± 0.589
3.034ArgAsp: 3.034 ± 1.185
4.171ArgGlu: 4.171 ± 1.062
3.413ArgPhe: 3.413 ± 0.613
1.896ArgGly: 1.896 ± 1.719
1.517ArgHis: 1.517 ± 0.612
1.138ArgIle: 1.138 ± 1.423
2.655ArgLys: 2.655 ± 0.797
5.688ArgLeu: 5.688 ± 1.758
1.517ArgMet: 1.517 ± 0.469
2.275ArgAsn: 2.275 ± 1.72
1.896ArgPro: 1.896 ± 2.194
0.0ArgGln: 0.0 ± 0.0
3.034ArgArg: 3.034 ± 1.571
3.792ArgSer: 3.792 ± 3.524
3.034ArgThr: 3.034 ± 1.571
2.655ArgVal: 2.655 ± 0.797
0.758ArgTrp: 0.758 ± 0.573
3.034ArgTyr: 3.034 ± 1.809
0.0ArgXaa: 0.0 ± 0.0
Ser
3.792SerAla: 3.792 ± 1.156
1.138SerCys: 1.138 ± 0.608
5.688SerAsp: 5.688 ± 1.852
3.034SerGlu: 3.034 ± 1.964
2.275SerPhe: 2.275 ± 0.785
2.655SerGly: 2.655 ± 0.937
1.896SerHis: 1.896 ± 1.448
5.309SerIle: 5.309 ± 1.409
7.205SerLys: 7.205 ± 1.102
6.447SerLeu: 6.447 ± 2.102
2.655SerMet: 2.655 ± 0.489
2.275SerAsn: 2.275 ± 1.006
3.792SerPro: 3.792 ± 0.573
1.896SerGln: 1.896 ± 0.531
2.275SerArg: 2.275 ± 0.785
7.584SerSer: 7.584 ± 2.972
5.688SerThr: 5.688 ± 4.3
5.309SerVal: 5.309 ± 2.21
0.0SerTrp: 0.0 ± 0.0
1.517SerTyr: 1.517 ± 0.469
0.0SerXaa: 0.0 ± 0.0
Thr
4.171ThrAla: 4.171 ± 0.88
2.275ThrCys: 2.275 ± 0.785
1.517ThrAsp: 1.517 ± 0.785
3.413ThrGlu: 3.413 ± 1.428
6.068ThrPhe: 6.068 ± 2.125
6.068ThrGly: 6.068 ± 1.831
1.138ThrHis: 1.138 ± 0.486
3.792ThrIle: 3.792 ± 1.681
4.93ThrLys: 4.93 ± 0.614
4.551ThrLeu: 4.551 ± 2.506
1.517ThrMet: 1.517 ± 0.469
1.517ThrAsn: 1.517 ± 1.147
0.758ThrPro: 0.758 ± 0.393
0.0ThrGln: 0.0 ± 0.0
1.517ThrArg: 1.517 ± 2.144
4.551ThrSer: 4.551 ± 1.783
2.275ThrThr: 2.275 ± 0.937
3.034ThrVal: 3.034 ± 1.535
0.379ThrTrp: 0.379 ± 0.706
0.379ThrTyr: 0.379 ± 0.196
0.0ThrXaa: 0.0 ± 0.0
Val
4.93ValAla: 4.93 ± 1.997
3.034ValCys: 3.034 ± 2.205
3.792ValAsp: 3.792 ± 0.573
3.792ValGlu: 3.792 ± 1.321
2.655ValPhe: 2.655 ± 0.924
3.034ValGly: 3.034 ± 1.809
1.896ValHis: 1.896 ± 1.259
3.034ValIle: 3.034 ± 1.224
3.792ValLys: 3.792 ± 1.352
10.239ValLeu: 10.239 ± 2.229
0.758ValMet: 0.758 ± 0.573
1.517ValAsn: 1.517 ± 0.469
3.413ValPro: 3.413 ± 0.613
1.138ValGln: 1.138 ± 0.589
3.413ValArg: 3.413 ± 1.139
3.413ValSer: 3.413 ± 0.613
4.93ValThr: 4.93 ± 1.354
4.551ValVal: 4.551 ± 1.833
0.0ValTrp: 0.0 ± 0.0
1.896ValTyr: 1.896 ± 0.937
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.379TrpCys: 0.379 ± 0.196
1.517TrpAsp: 1.517 ± 1.147
0.0TrpGlu: 0.0 ± 0.0
0.758TrpPhe: 0.758 ± 0.393
0.379TrpGly: 0.379 ± 0.196
0.758TrpHis: 0.758 ± 0.665
0.758TrpIle: 0.758 ± 0.393
0.379TrpLys: 0.379 ± 0.196
1.517TrpLeu: 1.517 ± 0.905
0.0TrpMet: 0.0 ± 0.0
0.758TrpAsn: 0.758 ± 0.573
0.379TrpPro: 0.379 ± 0.196
0.379TrpGln: 0.379 ± 0.706
0.758TrpArg: 0.758 ± 1.412
0.379TrpSer: 0.379 ± 0.196
0.0TrpThr: 0.0 ± 0.0
1.138TrpVal: 1.138 ± 0.589
0.0TrpTrp: 0.0 ± 0.0
0.379TrpTyr: 0.379 ± 0.196
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.517TyrAla: 1.517 ± 0.469
1.517TyrCys: 1.517 ± 1.457
1.138TyrAsp: 1.138 ± 0.589
2.275TyrGlu: 2.275 ± 0.54
1.517TyrPhe: 1.517 ± 0.905
1.896TyrGly: 1.896 ± 0.982
0.379TyrHis: 0.379 ± 0.196
2.275TyrIle: 2.275 ± 1.178
1.138TyrLys: 1.138 ± 0.589
2.275TyrLeu: 2.275 ± 1.178
1.517TyrMet: 1.517 ± 0.785
1.896TyrAsn: 1.896 ± 0.531
1.517TyrPro: 1.517 ± 0.905
0.758TyrGln: 0.758 ± 0.665
1.517TyrArg: 1.517 ± 0.905
1.138TyrSer: 1.138 ± 0.589
1.896TyrThr: 1.896 ± 1.259
1.517TyrVal: 1.517 ± 0.785
0.379TyrTrp: 0.379 ± 0.196
0.758TyrTyr: 0.758 ± 0.665
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2638 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski