Amino acid dipepetide frequency for Allium virus X

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.937AlaAla: 5.937 ± 1.813
0.848AlaCys: 0.848 ± 0.937
4.665AlaAsp: 4.665 ± 1.136
4.241AlaGlu: 4.241 ± 1.19
3.393AlaPhe: 3.393 ± 1.282
3.817AlaGly: 3.817 ± 3.062
1.696AlaHis: 1.696 ± 2.711
3.393AlaIle: 3.393 ± 1.552
5.513AlaLys: 5.513 ± 2.995
11.026AlaLeu: 11.026 ± 1.776
2.12AlaMet: 2.12 ± 1.138
2.545AlaAsn: 2.545 ± 0.859
4.241AlaPro: 4.241 ± 1.126
3.817AlaGln: 3.817 ± 1.413
4.665AlaArg: 4.665 ± 1.975
5.937AlaSer: 5.937 ± 1.515
7.209AlaThr: 7.209 ± 2.216
5.089AlaVal: 5.089 ± 3.565
0.424AlaTrp: 0.424 ± 0.23
2.545AlaTyr: 2.545 ± 1.204
0.0AlaXaa: 0.0 ± 0.0
Cys
1.696CysAla: 1.696 ± 0.898
0.0CysCys: 0.0 ± 0.0
1.272CysAsp: 1.272 ± 1.233
0.424CysGlu: 0.424 ± 0.23
1.272CysPhe: 1.272 ± 0.691
0.848CysGly: 0.848 ± 0.937
0.0CysHis: 0.0 ± 0.0
0.848CysIle: 0.848 ± 0.461
0.848CysLys: 0.848 ± 0.461
0.848CysLeu: 0.848 ± 0.461
0.424CysMet: 0.424 ± 0.23
0.424CysAsn: 0.424 ± 0.23
2.12CysPro: 2.12 ± 2.714
0.848CysGln: 0.848 ± 0.461
0.848CysArg: 0.848 ± 0.692
1.272CysSer: 1.272 ± 1.563
0.424CysThr: 0.424 ± 0.23
1.272CysVal: 1.272 ± 0.888
0.424CysTrp: 0.424 ± 0.23
1.696CysTyr: 1.696 ± 1.381
0.0CysXaa: 0.0 ± 0.0
Asp
3.817AspAla: 3.817 ± 1.289
1.272AspCys: 1.272 ± 1.188
2.12AspAsp: 2.12 ± 0.689
2.12AspGlu: 2.12 ± 0.815
2.969AspPhe: 2.969 ± 1.034
2.12AspGly: 2.12 ± 1.658
2.545AspHis: 2.545 ± 0.852
3.817AspIle: 3.817 ± 2.074
2.969AspLys: 2.969 ± 0.67
2.969AspLeu: 2.969 ± 1.613
0.0AspMet: 0.0 ± 0.0
1.272AspAsn: 1.272 ± 1.233
2.969AspPro: 2.969 ± 1.226
2.12AspGln: 2.12 ± 1.152
1.272AspArg: 1.272 ± 1.188
3.817AspSer: 3.817 ± 0.986
4.241AspThr: 4.241 ± 1.59
2.545AspVal: 2.545 ± 0.821
0.848AspTrp: 0.848 ± 0.461
0.424AspTyr: 0.424 ± 0.23
0.0AspXaa: 0.0 ± 0.0
Glu
4.665GluAla: 4.665 ± 1.137
2.12GluCys: 2.12 ± 1.152
2.12GluAsp: 2.12 ± 1.152
1.696GluGlu: 1.696 ± 0.629
2.545GluPhe: 2.545 ± 0.821
2.545GluGly: 2.545 ± 0.821
2.12GluHis: 2.12 ± 1.152
2.969GluIle: 2.969 ± 1.034
3.393GluLys: 3.393 ± 1.227
6.361GluLeu: 6.361 ± 1.587
0.0GluMet: 0.0 ± 0.0
2.545GluAsn: 2.545 ± 1.382
4.665GluPro: 4.665 ± 1.136
2.12GluGln: 2.12 ± 0.717
2.12GluArg: 2.12 ± 1.152
2.12GluSer: 2.12 ± 1.152
2.545GluThr: 2.545 ± 0.821
2.545GluVal: 2.545 ± 1.15
0.424GluTrp: 0.424 ± 0.23
1.272GluTyr: 1.272 ± 0.629
0.0GluXaa: 0.0 ± 0.0
Phe
2.969PheAla: 2.969 ± 0.67
2.12PheCys: 2.12 ± 1.293
2.969PheAsp: 2.969 ± 1.228
1.696PheGlu: 1.696 ± 0.618
1.696PhePhe: 1.696 ± 1.383
2.969PheGly: 2.969 ± 1.613
1.696PheHis: 1.696 ± 0.922
3.393PheIle: 3.393 ± 1.303
2.545PheLys: 2.545 ± 1.382
6.361PheLeu: 6.361 ± 2.611
1.696PheMet: 1.696 ± 0.922
2.12PheAsn: 2.12 ± 0.689
1.696PhePro: 1.696 ± 1.886
2.969PheGln: 2.969 ± 0.989
1.272PheArg: 1.272 ± 0.629
2.969PheSer: 2.969 ± 1.034
4.665PheThr: 4.665 ± 2.293
1.696PheVal: 1.696 ± 0.922
0.848PheTrp: 0.848 ± 0.461
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
3.817GlyAla: 3.817 ± 1.381
0.848GlyCys: 0.848 ± 0.461
3.393GlyAsp: 3.393 ± 0.886
3.817GlyGlu: 3.817 ± 1.174
3.817GlyPhe: 3.817 ± 1.381
1.696GlyGly: 1.696 ± 1.044
2.12GlyHis: 2.12 ± 1.668
2.545GlyIle: 2.545 ± 0.821
1.696GlyLys: 1.696 ± 1.437
3.817GlyLeu: 3.817 ± 1.594
0.0GlyMet: 0.0 ± 0.0
1.696GlyAsn: 1.696 ± 0.618
4.241GlyPro: 4.241 ± 2.197
2.12GlyGln: 2.12 ± 1.331
1.272GlyArg: 1.272 ± 0.62
2.12GlySer: 2.12 ± 1.293
3.393GlyThr: 3.393 ± 1.687
3.393GlyVal: 3.393 ± 2.489
0.424GlyTrp: 0.424 ± 0.23
2.12GlyTyr: 2.12 ± 1.152
0.0GlyXaa: 0.0 ± 0.0
His
5.513HisAla: 5.513 ± 2.04
0.424HisCys: 0.424 ± 0.23
2.12HisAsp: 2.12 ± 1.152
2.12HisGlu: 2.12 ± 0.689
2.545HisPhe: 2.545 ± 0.821
2.969HisGly: 2.969 ± 0.829
2.12HisHis: 2.12 ± 1.812
0.848HisIle: 0.848 ± 0.718
1.272HisLys: 1.272 ± 0.62
4.241HisLeu: 4.241 ± 2.885
0.0HisMet: 0.0 ± 0.0
1.272HisAsn: 1.272 ± 0.691
2.12HisPro: 2.12 ± 1.331
2.12HisGln: 2.12 ± 1.153
2.545HisArg: 2.545 ± 1.259
4.665HisSer: 4.665 ± 3.396
3.393HisThr: 3.393 ± 2.462
1.696HisVal: 1.696 ± 1.074
0.848HisTrp: 0.848 ± 0.461
1.696HisTyr: 1.696 ± 0.922
0.0HisXaa: 0.0 ± 0.0
Ile
6.361IleAla: 6.361 ± 1.963
0.424IleCys: 0.424 ± 0.23
0.424IleAsp: 0.424 ± 0.23
2.969IleGlu: 2.969 ± 1.294
3.817IlePhe: 3.817 ± 0.986
0.848IleGly: 0.848 ± 0.718
1.696IleHis: 1.696 ± 0.898
1.696IleIle: 1.696 ± 0.618
2.969IleLys: 2.969 ± 0.989
4.241IleLeu: 4.241 ± 1.59
1.272IleMet: 1.272 ± 0.691
2.545IleAsn: 2.545 ± 1.077
3.393IlePro: 3.393 ± 1.257
2.12IleGln: 2.12 ± 0.689
2.12IleArg: 2.12 ± 1.331
3.817IleSer: 3.817 ± 2.048
4.241IleThr: 4.241 ± 1.495
0.848IleVal: 0.848 ± 0.461
0.0IleTrp: 0.0 ± 0.0
0.848IleTyr: 0.848 ± 0.461
0.0IleXaa: 0.0 ± 0.0
Lys
3.393LysAla: 3.393 ± 1.227
0.0LysCys: 0.0 ± 0.0
2.545LysAsp: 2.545 ± 1.382
2.12LysGlu: 2.12 ± 0.717
2.545LysPhe: 2.545 ± 1.149
2.12LysGly: 2.12 ± 0.815
1.272LysHis: 1.272 ± 0.691
2.545LysIle: 2.545 ± 0.709
3.393LysLys: 3.393 ± 1.843
6.361LysLeu: 6.361 ± 3.456
0.848LysMet: 0.848 ± 0.461
2.545LysAsn: 2.545 ± 1.077
3.817LysPro: 3.817 ± 1.11
3.817LysGln: 3.817 ± 1.11
2.969LysArg: 2.969 ± 0.989
2.545LysSer: 2.545 ± 0.858
4.665LysThr: 4.665 ± 1.566
3.817LysVal: 3.817 ± 0.628
0.0LysTrp: 0.0 ± 0.0
1.272LysTyr: 1.272 ± 0.62
0.0LysXaa: 0.0 ± 0.0
Leu
7.634LeuAla: 7.634 ± 1.599
1.696LeuCys: 1.696 ± 1.074
6.361LeuAsp: 6.361 ± 1.453
4.241LeuGlu: 4.241 ± 1.59
5.089LeuPhe: 5.089 ± 1.641
5.089LeuGly: 5.089 ± 1.17
6.361LeuHis: 6.361 ± 0.839
4.241LeuIle: 4.241 ± 2.155
5.513LeuLys: 5.513 ± 2.488
9.754LeuLeu: 9.754 ± 4.49
0.424LeuMet: 0.424 ± 1.14
4.241LeuAsn: 4.241 ± 1.608
8.482LeuPro: 8.482 ± 2.257
5.513LeuGln: 5.513 ± 1.054
4.665LeuArg: 4.665 ± 1.186
12.723LeuSer: 12.723 ± 2.002
5.937LeuThr: 5.937 ± 1.311
5.937LeuVal: 5.937 ± 1.744
1.272LeuTrp: 1.272 ± 0.888
2.12LeuTyr: 2.12 ± 0.689
0.0LeuXaa: 0.0 ± 0.0
Met
1.696MetAla: 1.696 ± 0.922
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
1.696MetGlu: 1.696 ± 0.922
0.424MetPhe: 0.424 ± 0.824
0.424MetGly: 0.424 ± 0.23
0.424MetHis: 0.424 ± 0.23
0.848MetIle: 0.848 ± 0.461
1.272MetLys: 1.272 ± 0.691
2.545MetLeu: 2.545 ± 0.859
0.0MetMet: 0.0 ± 0.0
0.848MetAsn: 0.848 ± 0.718
0.848MetPro: 0.848 ± 0.461
1.696MetGln: 1.696 ± 0.629
1.696MetArg: 1.696 ± 1.148
1.272MetSer: 1.272 ± 0.888
2.12MetThr: 2.12 ± 1.293
0.848MetVal: 0.848 ± 0.461
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.393AsnAla: 3.393 ± 0.886
1.272AsnCys: 1.272 ± 1.188
1.696AsnAsp: 1.696 ± 1.148
1.272AsnGlu: 1.272 ± 0.691
2.545AsnPhe: 2.545 ± 0.709
1.696AsnGly: 1.696 ± 1.381
1.272AsnHis: 1.272 ± 1.57
1.696AsnIle: 1.696 ± 0.965
2.969AsnLys: 2.969 ± 1.223
2.12AsnLeu: 2.12 ± 0.965
0.424AsnMet: 0.424 ± 0.23
1.696AsnAsn: 1.696 ± 1.383
4.665AsnPro: 4.665 ± 1.816
1.696AsnGln: 1.696 ± 0.629
0.848AsnArg: 0.848 ± 0.692
4.241AsnSer: 4.241 ± 1.19
4.665AsnThr: 4.665 ± 1.136
0.848AsnVal: 0.848 ± 0.692
0.848AsnTrp: 0.848 ± 0.692
1.696AsnTyr: 1.696 ± 1.874
0.0AsnXaa: 0.0 ± 0.0
Pro
4.665ProAla: 4.665 ± 1.137
1.272ProCys: 1.272 ± 0.691
5.089ProAsp: 5.089 ± 0.926
4.665ProGlu: 4.665 ± 1.566
3.393ProPhe: 3.393 ± 1.227
3.393ProGly: 3.393 ± 1.906
4.241ProHis: 4.241 ± 3.927
4.241ProIle: 4.241 ± 1.166
4.665ProLys: 4.665 ± 1.85
6.361ProLeu: 6.361 ± 4.543
0.424ProMet: 0.424 ± 0.961
2.12ProAsn: 2.12 ± 2.497
8.482ProPro: 8.482 ± 3.036
1.696ProGln: 1.696 ± 0.922
2.545ProArg: 2.545 ± 0.821
3.817ProSer: 3.817 ± 1.921
6.361ProThr: 6.361 ± 2.602
1.696ProVal: 1.696 ± 0.922
1.272ProTrp: 1.272 ± 0.629
1.272ProTyr: 1.272 ± 0.629
0.0ProXaa: 0.0 ± 0.0
Gln
3.393GlnAla: 3.393 ± 0.707
0.848GlnCys: 0.848 ± 1.269
2.545GlnAsp: 2.545 ± 1.204
2.969GlnGlu: 2.969 ± 0.989
1.272GlnPhe: 1.272 ± 0.691
2.545GlnGly: 2.545 ± 0.821
1.272GlnHis: 1.272 ± 1.142
1.272GlnIle: 1.272 ± 0.691
1.272GlnLys: 1.272 ± 0.691
7.634GlnLeu: 7.634 ± 2.578
0.848GlnMet: 0.848 ± 0.461
3.393GlnAsn: 3.393 ± 1.075
3.817GlnPro: 3.817 ± 0.977
2.969GlnGln: 2.969 ± 1.613
1.272GlnArg: 1.272 ± 0.691
2.545GlnSer: 2.545 ± 0.859
2.545GlnThr: 2.545 ± 0.859
1.272GlnVal: 1.272 ± 2.647
1.696GlnTrp: 1.696 ± 0.629
0.848GlnTyr: 0.848 ± 0.692
0.0GlnXaa: 0.0 ± 0.0
Arg
2.969ArgAla: 2.969 ± 1.673
0.848ArgCys: 0.848 ± 1.413
1.696ArgAsp: 1.696 ± 0.922
2.969ArgGlu: 2.969 ± 1.613
1.696ArgPhe: 1.696 ± 1.383
2.969ArgGly: 2.969 ± 0.829
0.848ArgHis: 0.848 ± 0.718
2.969ArgIle: 2.969 ± 1.613
0.848ArgLys: 0.848 ± 0.692
4.241ArgLeu: 4.241 ± 1.378
1.696ArgMet: 1.696 ± 0.922
1.272ArgAsn: 1.272 ± 0.691
2.969ArgPro: 2.969 ± 0.989
2.545ArgGln: 2.545 ± 1.149
3.817ArgArg: 3.817 ± 1.402
1.696ArgSer: 1.696 ± 0.629
4.241ArgThr: 4.241 ± 1.084
1.696ArgVal: 1.696 ± 0.922
0.0ArgTrp: 0.0 ± 0.0
2.12ArgTyr: 2.12 ± 1.152
0.0ArgXaa: 0.0 ± 0.0
Ser
4.241SerAla: 4.241 ± 3.588
0.848SerCys: 0.848 ± 1.269
2.545SerAsp: 2.545 ± 1.382
3.817SerGlu: 3.817 ± 1.381
2.12SerPhe: 2.12 ± 1.152
4.665SerGly: 4.665 ± 1.292
5.513SerHis: 5.513 ± 3.443
2.969SerIle: 2.969 ± 1.772
3.393SerLys: 3.393 ± 0.886
5.937SerLeu: 5.937 ± 3.532
2.12SerMet: 2.12 ± 0.7
4.241SerAsn: 4.241 ± 1.137
2.545SerPro: 2.545 ± 1.149
4.241SerGln: 4.241 ± 1.643
2.545SerArg: 2.545 ± 0.821
3.393SerSer: 3.393 ± 1.852
7.209SerThr: 7.209 ± 2.604
2.969SerVal: 2.969 ± 0.829
0.424SerTrp: 0.424 ± 0.824
2.545SerTyr: 2.545 ± 1.382
0.0SerXaa: 0.0 ± 0.0
Thr
6.785ThrAla: 6.785 ± 2.757
1.272ThrCys: 1.272 ± 0.691
3.393ThrAsp: 3.393 ± 0.707
4.241ThrGlu: 4.241 ± 1.434
3.393ThrPhe: 3.393 ± 0.707
4.665ThrGly: 4.665 ± 1.325
4.241ThrHis: 4.241 ± 1.132
2.12ThrIle: 2.12 ± 1.152
2.545ThrLys: 2.545 ± 1.503
12.723ThrLeu: 12.723 ± 3.015
3.817ThrMet: 3.817 ± 0.933
3.393ThrAsn: 3.393 ± 2.823
6.785ThrPro: 6.785 ± 3.288
1.272ThrGln: 1.272 ± 1.188
2.12ThrArg: 2.12 ± 0.689
6.785ThrSer: 6.785 ± 1.952
4.665ThrThr: 4.665 ± 2.564
1.696ThrVal: 1.696 ± 0.618
0.424ThrTrp: 0.424 ± 0.23
2.545ThrTyr: 2.545 ± 1.077
0.0ThrXaa: 0.0 ± 0.0
Val
4.241ValAla: 4.241 ± 1.148
0.848ValCys: 0.848 ± 0.718
0.848ValAsp: 0.848 ± 0.718
2.969ValGlu: 2.969 ± 0.989
2.545ValPhe: 2.545 ± 1.883
1.696ValGly: 1.696 ± 2.161
3.393ValHis: 3.393 ± 1.179
1.696ValIle: 1.696 ± 1.074
2.545ValLys: 2.545 ± 1.15
5.937ValLeu: 5.937 ± 0.974
1.272ValMet: 1.272 ± 0.62
2.545ValAsn: 2.545 ± 1.149
2.12ValPro: 2.12 ± 0.815
1.272ValGln: 1.272 ± 0.691
2.12ValArg: 2.12 ± 1.152
1.272ValSer: 1.272 ± 0.888
3.817ValThr: 3.817 ± 2.174
3.817ValVal: 3.817 ± 2.622
0.848ValTrp: 0.848 ± 1.709
0.848ValTyr: 0.848 ± 0.461
0.0ValXaa: 0.0 ± 0.0
Trp
2.12TrpAla: 2.12 ± 1.348
0.0TrpCys: 0.0 ± 0.0
0.424TrpAsp: 0.424 ± 0.23
0.848TrpGlu: 0.848 ± 0.461
0.0TrpPhe: 0.0 ± 0.0
0.424TrpGly: 0.424 ± 0.23
0.0TrpHis: 0.0 ± 0.0
0.848TrpIle: 0.848 ± 0.461
1.272TrpLys: 1.272 ± 0.62
0.848TrpLeu: 0.848 ± 0.461
0.0TrpMet: 0.0 ± 0.0
0.848TrpAsn: 0.848 ± 0.692
0.0TrpPro: 0.0 ± 0.0
0.848TrpGln: 0.848 ± 0.692
1.272TrpArg: 1.272 ± 0.691
0.0TrpSer: 0.0 ± 0.0
0.424TrpThr: 0.424 ± 1.385
1.272TrpVal: 1.272 ± 0.691
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.241TyrAla: 4.241 ± 0.96
0.848TyrCys: 0.848 ± 0.937
0.0TyrAsp: 0.0 ± 0.0
0.0TyrGlu: 0.0 ± 0.0
1.272TyrPhe: 1.272 ± 0.691
0.848TyrGly: 0.848 ± 0.461
1.696TyrHis: 1.696 ± 0.629
1.696TyrIle: 1.696 ± 1.401
1.696TyrLys: 1.696 ± 0.922
2.969TyrLeu: 2.969 ± 0.838
0.848TyrMet: 0.848 ± 0.461
0.0TyrAsn: 0.0 ± 0.0
1.696TyrPro: 1.696 ± 0.618
0.424TyrGln: 0.424 ± 0.23
2.12TyrArg: 2.12 ± 0.965
1.272TyrSer: 1.272 ± 0.691
2.12TyrThr: 2.12 ± 1.153
1.696TyrVal: 1.696 ± 0.922
0.424TyrTrp: 0.424 ± 0.23
0.848TyrTyr: 0.848 ± 0.692
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2359 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski