Amino acid dipepetide frequency for Alstroemeria virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.608AlaAla: 5.608 ± 2.022
0.863AlaCys: 0.863 ± 0.63
4.314AlaAsp: 4.314 ± 1.719
3.02AlaGlu: 3.02 ± 0.675
2.157AlaPhe: 2.157 ± 0.958
5.608AlaGly: 5.608 ± 3.192
1.726AlaHis: 1.726 ± 0.814
3.451AlaIle: 3.451 ± 1.146
5.177AlaLys: 5.177 ± 2.167
9.922AlaLeu: 9.922 ± 3.056
3.451AlaMet: 3.451 ± 2.401
6.471AlaAsn: 6.471 ± 2.121
4.314AlaPro: 4.314 ± 3.085
3.451AlaGln: 3.451 ± 1.829
3.02AlaArg: 3.02 ± 1.661
4.745AlaSer: 4.745 ± 1.115
5.608AlaThr: 5.608 ± 0.706
3.883AlaVal: 3.883 ± 1.135
0.431AlaTrp: 0.431 ± 0.229
3.883AlaTyr: 3.883 ± 1.557
0.0AlaXaa: 0.0 ± 0.0
Cys
0.431CysAla: 0.431 ± 0.229
0.0CysCys: 0.0 ± 0.0
0.431CysAsp: 0.431 ± 0.781
2.588CysGlu: 2.588 ± 1.074
0.0CysPhe: 0.0 ± 0.0
1.294CysGly: 1.294 ± 0.537
0.431CysHis: 0.431 ± 0.229
0.431CysIle: 0.431 ± 1.131
1.726CysLys: 1.726 ± 0.814
0.0CysLeu: 0.0 ± 0.0
1.294CysMet: 1.294 ± 0.556
0.863CysAsn: 0.863 ± 0.457
1.726CysPro: 1.726 ± 1.169
0.431CysGln: 0.431 ± 0.229
0.431CysArg: 0.431 ± 0.229
2.588CysSer: 2.588 ± 1.861
0.431CysThr: 0.431 ± 1.192
0.863CysVal: 0.863 ± 1.575
0.0CysTrp: 0.0 ± 0.0
0.863CysTyr: 0.863 ± 0.457
0.0CysXaa: 0.0 ± 0.0
Asp
3.451AspAla: 3.451 ± 1.829
0.0AspCys: 0.0 ± 0.0
3.02AspAsp: 3.02 ± 1.6
2.157AspGlu: 2.157 ± 0.621
4.314AspPhe: 4.314 ± 1.043
2.588AspGly: 2.588 ± 1.403
1.294AspHis: 1.294 ± 0.686
3.451AspIle: 3.451 ± 1.146
1.294AspLys: 1.294 ± 0.686
4.314AspLeu: 4.314 ± 1.621
2.157AspMet: 2.157 ± 1.143
2.157AspAsn: 2.157 ± 0.968
3.883AspPro: 3.883 ± 2.841
1.726AspGln: 1.726 ± 1.099
1.726AspArg: 1.726 ± 0.864
4.745AspSer: 4.745 ± 2.514
4.314AspThr: 4.314 ± 1.07
2.588AspVal: 2.588 ± 0.648
0.431AspTrp: 0.431 ± 0.229
2.157AspTyr: 2.157 ± 1.143
0.0AspXaa: 0.0 ± 0.0
Glu
8.197GluAla: 8.197 ± 2.572
1.294GluCys: 1.294 ± 0.556
1.294GluAsp: 1.294 ± 0.686
2.588GluGlu: 2.588 ± 1.112
3.02GluPhe: 3.02 ± 0.949
1.294GluGly: 1.294 ± 0.686
0.863GluHis: 0.863 ± 0.63
3.451GluIle: 3.451 ± 1.096
3.883GluLys: 3.883 ± 1.505
3.883GluLeu: 3.883 ± 1.612
1.294GluMet: 1.294 ± 0.818
2.588GluAsn: 2.588 ± 0.856
3.883GluPro: 3.883 ± 1.354
1.294GluGln: 1.294 ± 0.537
1.726GluArg: 1.726 ± 0.914
1.294GluSer: 1.294 ± 0.948
4.745GluThr: 4.745 ± 1.378
3.883GluVal: 3.883 ± 1.053
0.431GluTrp: 0.431 ± 0.229
0.863GluTyr: 0.863 ± 0.457
0.0GluXaa: 0.0 ± 0.0
Phe
2.157PheAla: 2.157 ± 1.308
2.588PheCys: 2.588 ± 1.341
3.451PheAsp: 3.451 ± 1.096
3.451PheGlu: 3.451 ± 1.829
1.726PhePhe: 1.726 ± 0.633
2.588PheGly: 2.588 ± 2.127
2.157PheHis: 2.157 ± 0.621
4.745PheIle: 4.745 ± 0.902
1.726PheLys: 1.726 ± 0.534
4.745PheLeu: 4.745 ± 1.115
1.294PheMet: 1.294 ± 0.686
2.588PheAsn: 2.588 ± 1.055
3.451PhePro: 3.451 ± 1.296
1.726PheGln: 1.726 ± 0.914
2.157PheArg: 2.157 ± 0.855
1.726PheSer: 1.726 ± 2.493
2.588PheThr: 2.588 ± 0.648
2.588PheVal: 2.588 ± 0.947
0.0PheTrp: 0.0 ± 0.0
1.294PheTyr: 1.294 ± 0.537
0.0PheXaa: 0.0 ± 0.0
Gly
2.588GlyAla: 2.588 ± 0.947
2.157GlyCys: 2.157 ± 1.034
3.451GlyAsp: 3.451 ± 1.727
1.726GlyGlu: 1.726 ± 0.93
2.588GlyPhe: 2.588 ± 0.905
2.588GlyGly: 2.588 ± 1.657
1.726GlyHis: 1.726 ± 0.914
3.02GlyIle: 3.02 ± 1.781
2.588GlyLys: 2.588 ± 1.074
2.157GlyLeu: 2.157 ± 2.341
0.0GlyMet: 0.0 ± 0.0
1.294GlyAsn: 1.294 ± 0.556
2.157GlyPro: 2.157 ± 1.148
0.863GlyGln: 0.863 ± 0.63
1.294GlyArg: 1.294 ± 0.556
2.588GlySer: 2.588 ± 1.112
3.451GlyThr: 3.451 ± 0.696
3.02GlyVal: 3.02 ± 2.458
1.726GlyTrp: 1.726 ± 0.93
2.157GlyTyr: 2.157 ± 0.813
0.0GlyXaa: 0.0 ± 0.0
His
1.294HisAla: 1.294 ± 0.686
0.863HisCys: 0.863 ± 1.044
0.863HisAsp: 0.863 ± 0.457
0.863HisGlu: 0.863 ± 0.457
3.02HisPhe: 3.02 ± 1.046
2.157HisGly: 2.157 ± 0.7
1.726HisHis: 1.726 ± 0.864
0.431HisIle: 0.431 ± 0.229
1.726HisLys: 1.726 ± 0.914
5.608HisLeu: 5.608 ± 1.661
0.431HisMet: 0.431 ± 1.018
0.863HisAsn: 0.863 ± 1.044
3.02HisPro: 3.02 ± 1.046
2.588HisGln: 2.588 ± 1.055
3.02HisArg: 3.02 ± 1.6
3.451HisSer: 3.451 ± 2.735
2.588HisThr: 2.588 ± 0.905
0.431HisVal: 0.431 ± 0.229
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.883IleAla: 3.883 ± 1.612
1.294IleCys: 1.294 ± 1.374
1.726IleAsp: 1.726 ± 0.93
3.451IleGlu: 3.451 ± 0.772
2.588IlePhe: 2.588 ± 1.074
1.294IleGly: 1.294 ± 0.948
0.863IleHis: 0.863 ± 0.457
2.588IleIle: 2.588 ± 2.747
2.157IleLys: 2.157 ± 1.143
3.883IleLeu: 3.883 ± 0.648
2.157IleMet: 2.157 ± 0.968
2.157IleAsn: 2.157 ± 0.968
3.883IlePro: 3.883 ± 1.354
3.02IleGln: 3.02 ± 1.141
4.314IleArg: 4.314 ± 1.043
3.883IleSer: 3.883 ± 1.284
5.177IleThr: 5.177 ± 2.177
1.726IleVal: 1.726 ± 1.169
0.431IleTrp: 0.431 ± 1.192
1.726IleTyr: 1.726 ± 0.864
0.0IleXaa: 0.0 ± 0.0
Lys
4.745LysAla: 4.745 ± 1.886
0.431LysCys: 0.431 ± 0.229
2.588LysAsp: 2.588 ± 1.371
2.157LysGlu: 2.157 ± 0.968
4.314LysPhe: 4.314 ± 2.88
0.863LysGly: 0.863 ± 0.457
1.294LysHis: 1.294 ± 0.686
3.883LysIle: 3.883 ± 2.057
2.157LysLys: 2.157 ± 0.621
7.765LysLeu: 7.765 ± 1.401
1.726LysMet: 1.726 ± 0.914
1.294LysAsn: 1.294 ± 0.686
5.608LysPro: 5.608 ± 2.971
2.588LysGln: 2.588 ± 0.769
3.883LysArg: 3.883 ± 1.354
3.883LysSer: 3.883 ± 0.648
3.883LysThr: 3.883 ± 1.92
4.314LysVal: 4.314 ± 1.686
1.294LysTrp: 1.294 ± 0.931
0.431LysTyr: 0.431 ± 0.229
0.0LysXaa: 0.0 ± 0.0
Leu
6.903LeuAla: 6.903 ± 6.369
0.863LeuCys: 0.863 ± 1.044
6.04LeuAsp: 6.04 ± 1.609
2.588LeuGlu: 2.588 ± 0.769
4.745LeuPhe: 4.745 ± 2.407
4.314LeuGly: 4.314 ± 1.084
4.314LeuHis: 4.314 ± 1.242
3.451LeuIle: 3.451 ± 3.289
8.197LeuLys: 8.197 ± 2.92
8.628LeuLeu: 8.628 ± 3.689
0.0LeuMet: 0.0 ± 0.0
6.04LeuAsn: 6.04 ± 2.267
4.745LeuPro: 4.745 ± 1.854
3.451LeuGln: 3.451 ± 1.067
3.02LeuArg: 3.02 ± 1.6
6.04LeuSer: 6.04 ± 1.39
8.197LeuThr: 8.197 ± 3.283
3.883LeuVal: 3.883 ± 1.272
0.431LeuTrp: 0.431 ± 0.229
3.883LeuTyr: 3.883 ± 1.885
0.0LeuXaa: 0.0 ± 0.0
Met
2.588MetAla: 2.588 ± 1.067
0.0MetCys: 0.0 ± 0.0
0.863MetAsp: 0.863 ± 0.457
0.431MetGlu: 0.431 ± 1.131
0.863MetPhe: 0.863 ± 0.457
0.863MetGly: 0.863 ± 0.457
1.294MetHis: 1.294 ± 0.686
0.431MetIle: 0.431 ± 0.229
1.294MetLys: 1.294 ± 0.686
2.157MetLeu: 2.157 ± 0.772
0.0MetMet: 0.0 ± 0.0
0.431MetAsn: 0.431 ± 0.229
0.863MetPro: 0.863 ± 1.044
1.726MetGln: 1.726 ± 0.914
1.294MetArg: 1.294 ± 0.686
1.294MetSer: 1.294 ± 0.686
1.726MetThr: 1.726 ± 0.914
1.294MetVal: 1.294 ± 0.686
0.863MetTrp: 0.863 ± 0.568
1.294MetTyr: 1.294 ± 1.214
0.0MetXaa: 0.0 ± 0.0
Asn
5.177AsnAla: 5.177 ± 1.483
2.157AsnCys: 2.157 ± 1.034
3.02AsnAsp: 3.02 ± 0.949
1.294AsnGlu: 1.294 ± 0.537
2.157AsnPhe: 2.157 ± 0.813
2.157AsnGly: 2.157 ± 1.994
3.451AsnHis: 3.451 ± 1.727
2.157AsnIle: 2.157 ± 1.385
3.883AsnLys: 3.883 ± 0.896
4.314AsnLeu: 4.314 ± 1.07
0.863AsnMet: 0.863 ± 0.457
1.726AsnAsn: 1.726 ± 0.914
3.451AsnPro: 3.451 ± 1.111
0.431AsnGln: 0.431 ± 0.229
0.863AsnArg: 0.863 ± 0.568
2.588AsnSer: 2.588 ± 0.648
4.314AsnThr: 4.314 ± 1.719
2.157AsnVal: 2.157 ± 0.855
0.0AsnTrp: 0.0 ± 0.0
2.588AsnTyr: 2.588 ± 0.856
0.0AsnXaa: 0.0 ± 0.0
Pro
6.04ProAla: 6.04 ± 2.339
0.431ProCys: 0.431 ± 0.229
3.451ProAsp: 3.451 ± 1.296
7.765ProGlu: 7.765 ± 2.462
3.02ProPhe: 3.02 ± 1.006
2.157ProGly: 2.157 ± 1.965
1.726ProHis: 1.726 ± 1.26
3.883ProIle: 3.883 ± 1.084
4.745ProLys: 4.745 ± 2.011
3.451ProLeu: 3.451 ± 1.582
0.431ProMet: 0.431 ± 0.229
5.608ProAsn: 5.608 ± 2.679
6.903ProPro: 6.903 ± 3.344
0.863ProGln: 0.863 ± 0.457
1.726ProArg: 1.726 ± 0.814
6.471ProSer: 6.471 ± 1.964
6.04ProThr: 6.04 ± 1.707
4.314ProVal: 4.314 ± 0.809
1.726ProTrp: 1.726 ± 0.93
2.157ProTyr: 2.157 ± 0.621
0.0ProXaa: 0.0 ± 0.0
Gln
1.726GlnAla: 1.726 ± 0.633
0.0GlnCys: 0.0 ± 0.0
3.02GlnAsp: 3.02 ± 1.6
1.726GlnGlu: 1.726 ± 0.534
1.726GlnPhe: 1.726 ± 0.814
0.863GlnGly: 0.863 ± 0.457
3.451GlnHis: 3.451 ± 0.696
1.726GlnIle: 1.726 ± 0.914
1.726GlnLys: 1.726 ± 0.914
3.02GlnLeu: 3.02 ± 1.6
1.726GlnMet: 1.726 ± 1.103
0.863GlnAsn: 0.863 ± 0.457
3.451GlnPro: 3.451 ± 1.346
2.588GlnGln: 2.588 ± 0.905
2.157GlnArg: 2.157 ± 0.772
3.883GlnSer: 3.883 ± 0.919
4.314GlnThr: 4.314 ± 1.043
2.157GlnVal: 2.157 ± 0.772
1.294GlnTrp: 1.294 ± 0.686
0.863GlnTyr: 0.863 ± 0.63
0.0GlnXaa: 0.0 ± 0.0
Arg
2.588ArgAla: 2.588 ± 1.341
0.863ArgCys: 0.863 ± 0.457
3.02ArgAsp: 3.02 ± 1.006
2.157ArgGlu: 2.157 ± 0.772
1.726ArgPhe: 1.726 ± 0.534
1.294ArgGly: 1.294 ± 0.686
0.431ArgHis: 0.431 ± 0.781
1.294ArgIle: 1.294 ± 0.97
2.157ArgLys: 2.157 ± 0.7
1.294ArgLeu: 1.294 ± 0.686
1.294ArgMet: 1.294 ± 0.686
2.588ArgAsn: 2.588 ± 1.371
1.294ArgPro: 1.294 ± 2.229
5.177ArgGln: 5.177 ± 2.225
1.294ArgArg: 1.294 ± 0.931
2.157ArgSer: 2.157 ± 1.101
5.177ArgThr: 5.177 ± 1.537
2.588ArgVal: 2.588 ± 0.75
0.0ArgTrp: 0.0 ± 0.0
1.726ArgTyr: 1.726 ± 0.914
0.0ArgXaa: 0.0 ± 0.0
Ser
5.608SerAla: 5.608 ± 1.203
0.431SerCys: 0.431 ± 0.229
2.157SerAsp: 2.157 ± 1.419
3.02SerGlu: 3.02 ± 0.949
2.588SerPhe: 2.588 ± 0.936
3.451SerGly: 3.451 ± 1.829
2.588SerHis: 2.588 ± 2.719
4.745SerIle: 4.745 ± 1.193
5.608SerLys: 5.608 ± 0.934
4.314SerLeu: 4.314 ± 2.292
0.863SerMet: 0.863 ± 0.457
3.02SerAsn: 3.02 ± 2.958
7.334SerPro: 7.334 ± 2.047
3.451SerGln: 3.451 ± 1.146
1.294SerArg: 1.294 ± 0.556
4.745SerSer: 4.745 ± 2.681
6.04SerThr: 6.04 ± 2.339
4.745SerVal: 4.745 ± 1.56
0.0SerTrp: 0.0 ± 0.0
0.863SerTyr: 0.863 ± 1.044
0.0SerXaa: 0.0 ± 0.0
Thr
7.765ThrAla: 7.765 ± 2.802
0.863ThrCys: 0.863 ± 0.63
2.588ThrAsp: 2.588 ± 0.769
3.451ThrGlu: 3.451 ± 1.829
4.314ThrPhe: 4.314 ± 1.691
3.451ThrGly: 3.451 ± 1.522
3.02ThrHis: 3.02 ± 0.807
3.883ThrIle: 3.883 ± 1.397
4.745ThrLys: 4.745 ± 2.231
9.06ThrLeu: 9.06 ± 2.948
1.294ThrMet: 1.294 ± 0.686
1.294ThrAsn: 1.294 ± 0.686
6.903ThrPro: 6.903 ± 2.614
4.314ThrGln: 4.314 ± 2.286
3.451ThrArg: 3.451 ± 1.629
4.745ThrSer: 4.745 ± 0.964
6.903ThrThr: 6.903 ± 2.999
6.04ThrVal: 6.04 ± 0.883
0.0ThrTrp: 0.0 ± 0.0
2.588ThrTyr: 2.588 ± 0.769
0.0ThrXaa: 0.0 ± 0.0
Val
3.451ValAla: 3.451 ± 1.582
1.294ValCys: 1.294 ± 0.931
3.883ValAsp: 3.883 ± 1.053
3.451ValGlu: 3.451 ± 1.829
1.726ValPhe: 1.726 ± 0.914
3.02ValGly: 3.02 ± 1.669
1.726ValHis: 1.726 ± 0.534
2.157ValIle: 2.157 ± 2.011
2.588ValLys: 2.588 ± 1.055
6.471ValLeu: 6.471 ± 1.561
0.431ValMet: 0.431 ± 0.229
3.451ValAsn: 3.451 ± 1.266
4.314ValPro: 4.314 ± 1.773
2.157ValGln: 2.157 ± 0.772
2.157ValArg: 2.157 ± 1.034
3.02ValSer: 3.02 ± 1.669
3.883ValThr: 3.883 ± 1.616
3.02ValVal: 3.02 ± 1.141
0.431ValTrp: 0.431 ± 0.663
2.588ValTyr: 2.588 ± 0.855
0.0ValXaa: 0.0 ± 0.0
Trp
0.863TrpAla: 0.863 ± 0.568
0.0TrpCys: 0.0 ± 0.0
0.863TrpAsp: 0.863 ± 0.568
1.294TrpGlu: 1.294 ± 0.686
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.431TrpHis: 0.431 ± 0.229
0.431TrpIle: 0.431 ± 0.229
0.863TrpLys: 0.863 ± 0.457
2.157TrpLeu: 2.157 ± 1.965
0.0TrpMet: 0.0 ± 0.0
1.294TrpAsn: 1.294 ± 1.083
0.431TrpPro: 0.431 ± 1.131
0.431TrpGln: 0.431 ± 0.229
0.0TrpArg: 0.0 ± 0.0
0.863TrpSer: 0.863 ± 0.457
0.0TrpThr: 0.0 ± 0.0
0.431TrpVal: 0.431 ± 0.229
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
6.04TyrAla: 6.04 ± 1.075
0.431TyrCys: 0.431 ± 0.229
1.726TyrAsp: 1.726 ± 0.914
2.588TyrGlu: 2.588 ± 0.905
2.157TyrPhe: 2.157 ± 0.958
1.294TyrGly: 1.294 ± 0.686
0.431TyrHis: 0.431 ± 0.229
2.588TyrIle: 2.588 ± 1.371
0.863TyrLys: 0.863 ± 1.044
2.588TyrLeu: 2.588 ± 0.936
0.431TyrMet: 0.431 ± 0.229
2.157TyrAsn: 2.157 ± 0.621
1.294TyrPro: 1.294 ± 0.556
0.431TyrGln: 0.431 ± 0.663
1.294TyrArg: 1.294 ± 1.374
2.157TyrSer: 2.157 ± 0.772
1.294TyrThr: 1.294 ± 0.537
1.294TyrVal: 1.294 ± 0.948
0.863TyrTrp: 0.863 ± 0.457
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2319 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski