Amino acid dipepetide frequency for Bacilli bacterium VT-13-104

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.706AlaAla: 4.706 ± 0.086
0.572AlaCys: 0.572 ± 0.024
3.06AlaAsp: 3.06 ± 0.057
4.129AlaGlu: 4.129 ± 0.076
3.109AlaPhe: 3.109 ± 0.055
4.705AlaGly: 4.705 ± 0.08
1.097AlaHis: 1.097 ± 0.04
6.077AlaIle: 6.077 ± 0.1
4.65AlaLys: 4.65 ± 0.083
6.454AlaLeu: 6.454 ± 0.086
1.957AlaMet: 1.957 ± 0.048
2.841AlaAsn: 2.841 ± 0.054
1.848AlaPro: 1.848 ± 0.047
1.878AlaGln: 1.878 ± 0.047
2.423AlaArg: 2.423 ± 0.058
3.703AlaSer: 3.703 ± 0.064
3.384AlaThr: 3.384 ± 0.062
4.745AlaVal: 4.745 ± 0.077
0.541AlaTrp: 0.541 ± 0.024
2.216AlaTyr: 2.216 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
0.362CysAla: 0.362 ± 0.022
0.071CysCys: 0.071 ± 0.01
0.34CysAsp: 0.34 ± 0.02
0.401CysGlu: 0.401 ± 0.022
0.285CysPhe: 0.285 ± 0.018
0.601CysGly: 0.601 ± 0.03
0.204CysHis: 0.204 ± 0.017
0.481CysIle: 0.481 ± 0.025
0.363CysLys: 0.363 ± 0.022
0.543CysLeu: 0.543 ± 0.022
0.173CysMet: 0.173 ± 0.014
0.312CysAsn: 0.312 ± 0.02
0.304CysPro: 0.304 ± 0.019
0.204CysGln: 0.204 ± 0.015
0.215CysArg: 0.215 ± 0.016
0.468CysSer: 0.468 ± 0.021
0.373CysThr: 0.373 ± 0.02
0.359CysVal: 0.359 ± 0.018
0.052CysTrp: 0.052 ± 0.006
0.212CysTyr: 0.212 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.027AspAla: 3.027 ± 0.054
0.321AspCys: 0.321 ± 0.02
2.72AspAsp: 2.72 ± 0.057
4.624AspGlu: 4.624 ± 0.076
2.537AspPhe: 2.537 ± 0.058
3.312AspGly: 3.312 ± 0.061
1.134AspHis: 1.134 ± 0.039
4.537AspIle: 4.537 ± 0.079
3.64AspLys: 3.64 ± 0.068
4.998AspLeu: 4.998 ± 0.072
1.515AspMet: 1.515 ± 0.04
2.15AspAsn: 2.15 ± 0.052
1.936AspPro: 1.936 ± 0.05
1.829AspGln: 1.829 ± 0.051
2.12AspArg: 2.12 ± 0.046
2.774AspSer: 2.774 ± 0.051
2.564AspThr: 2.564 ± 0.061
3.866AspVal: 3.866 ± 0.067
0.594AspTrp: 0.594 ± 0.027
2.264AspTyr: 2.264 ± 0.052
0.0AspXaa: 0.0 ± 0.0
Glu
5.002GluAla: 5.002 ± 0.076
0.315GluCys: 0.315 ± 0.021
4.018GluAsp: 4.018 ± 0.074
7.688GluGlu: 7.688 ± 0.121
2.695GluPhe: 2.695 ± 0.065
4.22GluGly: 4.22 ± 0.081
1.448GluHis: 1.448 ± 0.038
6.33GluIle: 6.33 ± 0.093
7.381GluLys: 7.381 ± 0.095
7.218GluLeu: 7.218 ± 0.11
2.359GluMet: 2.359 ± 0.05
4.361GluAsn: 4.361 ± 0.074
1.96GluPro: 1.96 ± 0.045
3.137GluGln: 3.137 ± 0.067
3.331GluArg: 3.331 ± 0.066
3.712GluSer: 3.712 ± 0.065
3.946GluThr: 3.946 ± 0.069
5.381GluVal: 5.381 ± 0.086
0.806GluTrp: 0.806 ± 0.03
2.434GluTyr: 2.434 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
2.855PheAla: 2.855 ± 0.059
0.319PheCys: 0.319 ± 0.02
2.42PheAsp: 2.42 ± 0.05
2.765PheGlu: 2.765 ± 0.052
2.34PhePhe: 2.34 ± 0.061
3.37PheGly: 3.37 ± 0.072
1.05PheHis: 1.05 ± 0.036
4.397PheIle: 4.397 ± 0.092
2.497PheLys: 2.497 ± 0.051
4.598PheLeu: 4.598 ± 0.094
1.196PheMet: 1.196 ± 0.039
1.971PheAsn: 1.971 ± 0.051
1.684PhePro: 1.684 ± 0.041
1.601PheGln: 1.601 ± 0.044
1.5PheArg: 1.5 ± 0.04
3.227PheSer: 3.227 ± 0.062
2.564PheThr: 2.564 ± 0.062
3.078PheVal: 3.078 ± 0.065
0.446PheTrp: 0.446 ± 0.025
1.711PheTyr: 1.711 ± 0.045
0.0PheXaa: 0.0 ± 0.0
Gly
4.641GlyAla: 4.641 ± 0.094
0.541GlyCys: 0.541 ± 0.023
3.119GlyAsp: 3.119 ± 0.052
4.41GlyGlu: 4.41 ± 0.082
3.35GlyPhe: 3.35 ± 0.057
4.737GlyGly: 4.737 ± 0.097
1.276GlyHis: 1.276 ± 0.035
6.343GlyIle: 6.343 ± 0.102
5.281GlyLys: 5.281 ± 0.089
6.403GlyLeu: 6.403 ± 0.093
2.112GlyMet: 2.112 ± 0.054
3.059GlyAsn: 3.059 ± 0.056
1.675GlyPro: 1.675 ± 0.047
1.941GlyGln: 1.941 ± 0.045
2.421GlyArg: 2.421 ± 0.053
3.801GlySer: 3.801 ± 0.064
3.946GlyThr: 3.946 ± 0.066
5.01GlyVal: 5.01 ± 0.084
0.768GlyTrp: 0.768 ± 0.029
2.785GlyTyr: 2.785 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.31HisAla: 1.31 ± 0.039
0.169HisCys: 0.169 ± 0.012
1.058HisAsp: 1.058 ± 0.036
1.413HisGlu: 1.413 ± 0.038
1.072HisPhe: 1.072 ± 0.036
1.394HisGly: 1.394 ± 0.042
0.621HisHis: 0.621 ± 0.029
1.621HisIle: 1.621 ± 0.041
1.004HisLys: 1.004 ± 0.031
1.976HisLeu: 1.976 ± 0.049
0.523HisMet: 0.523 ± 0.024
0.823HisAsn: 0.823 ± 0.03
1.03HisPro: 1.03 ± 0.036
0.781HisGln: 0.781 ± 0.027
0.789HisArg: 0.789 ± 0.03
1.208HisSer: 1.208 ± 0.038
1.013HisThr: 1.013 ± 0.039
1.438HisVal: 1.438 ± 0.043
0.243HisTrp: 0.243 ± 0.015
0.856HisTyr: 0.856 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.987IleAla: 5.987 ± 0.093
0.605IleCys: 0.605 ± 0.026
4.493IleAsp: 4.493 ± 0.075
5.871IleGlu: 5.871 ± 0.074
3.731IlePhe: 3.731 ± 0.087
6.443IleGly: 6.443 ± 0.093
1.856IleHis: 1.856 ± 0.045
7.142IleIle: 7.142 ± 0.131
5.324IleLys: 5.324 ± 0.09
7.766IleLeu: 7.766 ± 0.108
2.033IleMet: 2.033 ± 0.051
3.902IleAsn: 3.902 ± 0.073
3.745IlePro: 3.745 ± 0.074
3.104IleGln: 3.104 ± 0.053
3.181IleArg: 3.181 ± 0.055
5.606IleSer: 5.606 ± 0.099
4.791IleThr: 4.791 ± 0.075
5.986IleVal: 5.986 ± 0.088
0.679IleTrp: 0.679 ± 0.031
2.769IleTyr: 2.769 ± 0.058
0.0IleXaa: 0.0 ± 0.0
Lys
4.493LysAla: 4.493 ± 0.072
0.277LysCys: 0.277 ± 0.018
4.289LysAsp: 4.289 ± 0.071
7.865LysGlu: 7.865 ± 0.108
2.004LysPhe: 2.004 ± 0.049
4.547LysGly: 4.547 ± 0.08
1.405LysHis: 1.405 ± 0.041
5.057LysIle: 5.057 ± 0.073
6.22LysLys: 6.22 ± 0.091
6.0LysLeu: 6.0 ± 0.072
2.295LysMet: 2.295 ± 0.05
3.91LysAsn: 3.91 ± 0.067
2.16LysPro: 2.16 ± 0.048
3.349LysGln: 3.349 ± 0.074
3.462LysArg: 3.462 ± 0.077
3.688LysSer: 3.688 ± 0.066
3.567LysThr: 3.567 ± 0.068
4.922LysVal: 4.922 ± 0.078
0.786LysTrp: 0.786 ± 0.026
2.424LysTyr: 2.424 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
6.437LeuAla: 6.437 ± 0.099
0.532LeuCys: 0.532 ± 0.023
4.985LeuAsp: 4.985 ± 0.08
6.905LeuGlu: 6.905 ± 0.095
4.779LeuPhe: 4.779 ± 0.099
6.427LeuGly: 6.427 ± 0.098
1.807LeuHis: 1.807 ± 0.041
7.703LeuIle: 7.703 ± 0.12
6.599LeuLys: 6.599 ± 0.093
9.336LeuLeu: 9.336 ± 0.154
2.408LeuMet: 2.408 ± 0.056
4.633LeuAsn: 4.633 ± 0.071
3.726LeuPro: 3.726 ± 0.068
3.251LeuGln: 3.251 ± 0.059
3.425LeuArg: 3.425 ± 0.063
6.456LeuSer: 6.456 ± 0.087
5.436LeuThr: 5.436 ± 0.083
5.9LeuVal: 5.9 ± 0.088
0.684LeuTrp: 0.684 ± 0.031
2.956LeuTyr: 2.956 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
1.871MetAla: 1.871 ± 0.047
0.137MetCys: 0.137 ± 0.012
1.768MetAsp: 1.768 ± 0.047
2.35MetGlu: 2.35 ± 0.049
1.059MetPhe: 1.059 ± 0.04
1.924MetGly: 1.924 ± 0.05
0.473MetHis: 0.473 ± 0.023
2.247MetIle: 2.247 ± 0.054
2.572MetLys: 2.572 ± 0.047
2.333MetLeu: 2.333 ± 0.055
0.858MetMet: 0.858 ± 0.034
1.705MetAsn: 1.705 ± 0.044
0.881MetPro: 0.881 ± 0.031
0.862MetGln: 0.862 ± 0.03
0.997MetArg: 0.997 ± 0.033
1.682MetSer: 1.682 ± 0.047
1.454MetThr: 1.454 ± 0.037
1.921MetVal: 1.921 ± 0.042
0.208MetTrp: 0.208 ± 0.013
0.791MetTyr: 0.791 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.756AsnAla: 2.756 ± 0.055
0.328AsnCys: 0.328 ± 0.019
2.565AsnAsp: 2.565 ± 0.059
4.067AsnGlu: 4.067 ± 0.07
1.884AsnPhe: 1.884 ± 0.049
3.423AsnGly: 3.423 ± 0.066
1.207AsnHis: 1.207 ± 0.038
3.926AsnIle: 3.926 ± 0.068
3.705AsnLys: 3.705 ± 0.078
4.005AsnLeu: 4.005 ± 0.08
1.364AsnMet: 1.364 ± 0.042
2.466AsnAsn: 2.466 ± 0.066
2.274AsnPro: 2.274 ± 0.053
2.302AsnGln: 2.302 ± 0.054
2.179AsnArg: 2.179 ± 0.052
2.449AsnSer: 2.449 ± 0.05
2.372AsnThr: 2.372 ± 0.055
3.19AsnVal: 3.19 ± 0.059
0.569AsnTrp: 0.569 ± 0.026
1.843AsnTyr: 1.843 ± 0.048
0.0AsnXaa: 0.0 ± 0.0
Pro
1.945ProAla: 1.945 ± 0.054
0.186ProCys: 0.186 ± 0.015
1.957ProAsp: 1.957 ± 0.058
2.989ProGlu: 2.989 ± 0.065
1.975ProPhe: 1.975 ± 0.046
2.175ProGly: 2.175 ± 0.057
0.718ProHis: 0.718 ± 0.03
3.196ProIle: 3.196 ± 0.065
2.346ProLys: 2.346 ± 0.058
3.222ProLeu: 3.222 ± 0.06
0.852ProMet: 0.852 ± 0.033
1.993ProAsn: 1.993 ± 0.051
0.969ProPro: 0.969 ± 0.038
0.948ProGln: 0.948 ± 0.034
1.034ProArg: 1.034 ± 0.035
2.129ProSer: 2.129 ± 0.049
2.064ProThr: 2.064 ± 0.048
2.595ProVal: 2.595 ± 0.053
0.337ProTrp: 0.337 ± 0.02
1.442ProTyr: 1.442 ± 0.044
0.0ProXaa: 0.0 ± 0.0
Gln
2.479GlnAla: 2.479 ± 0.052
0.162GlnCys: 0.162 ± 0.013
1.651GlnAsp: 1.651 ± 0.04
2.835GlnGlu: 2.835 ± 0.06
1.558GlnPhe: 1.558 ± 0.044
2.054GlnGly: 2.054 ± 0.048
0.684GlnHis: 0.684 ± 0.026
2.823GlnIle: 2.823 ± 0.055
2.6GlnLys: 2.6 ± 0.057
3.809GlnLeu: 3.809 ± 0.071
1.053GlnMet: 1.053 ± 0.028
1.638GlnAsn: 1.638 ± 0.043
1.111GlnPro: 1.111 ± 0.037
1.535GlnGln: 1.535 ± 0.055
1.43GlnArg: 1.43 ± 0.045
2.121GlnSer: 2.121 ± 0.06
1.911GlnThr: 1.911 ± 0.049
2.467GlnVal: 2.467 ± 0.052
0.348GlnTrp: 0.348 ± 0.019
1.173GlnTyr: 1.173 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
2.143ArgAla: 2.143 ± 0.044
0.237ArgCys: 0.237 ± 0.016
1.975ArgAsp: 1.975 ± 0.052
3.189ArgGlu: 3.189 ± 0.065
1.851ArgPhe: 1.851 ± 0.046
2.216ArgGly: 2.216 ± 0.044
0.713ArgHis: 0.713 ± 0.025
3.142ArgIle: 3.142 ± 0.055
3.381ArgLys: 3.381 ± 0.067
3.671ArgLeu: 3.671 ± 0.066
1.302ArgMet: 1.302 ± 0.037
2.106ArgAsn: 2.106 ± 0.047
1.204ArgPro: 1.204 ± 0.04
1.36ArgGln: 1.36 ± 0.042
1.757ArgArg: 1.757 ± 0.051
2.097ArgSer: 2.097 ± 0.043
1.968ArgThr: 1.968 ± 0.045
2.432ArgVal: 2.432 ± 0.051
0.353ArgTrp: 0.353 ± 0.022
1.566ArgTyr: 1.566 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
3.259SerAla: 3.259 ± 0.064
0.353SerCys: 0.353 ± 0.02
2.902SerAsp: 2.902 ± 0.056
4.139SerGlu: 4.139 ± 0.078
3.134SerPhe: 3.134 ± 0.061
4.283SerGly: 4.283 ± 0.07
1.192SerHis: 1.192 ± 0.04
5.499SerIle: 5.499 ± 0.082
3.997SerLys: 3.997 ± 0.063
5.855SerLeu: 5.855 ± 0.088
1.691SerMet: 1.691 ± 0.043
2.979SerAsn: 2.979 ± 0.061
2.065SerPro: 2.065 ± 0.051
1.847SerGln: 1.847 ± 0.05
2.16SerArg: 2.16 ± 0.045
3.803SerSer: 3.803 ± 0.083
3.19SerThr: 3.19 ± 0.064
3.952SerVal: 3.952 ± 0.058
0.588SerTrp: 0.588 ± 0.03
2.295SerTyr: 2.295 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
3.631ThrAla: 3.631 ± 0.071
0.34ThrCys: 0.34 ± 0.017
2.818ThrAsp: 2.818 ± 0.048
3.632ThrGlu: 3.632 ± 0.072
2.706ThrPhe: 2.706 ± 0.052
4.0ThrGly: 4.0 ± 0.068
1.022ThrHis: 1.022 ± 0.027
4.903ThrIle: 4.903 ± 0.079
3.567ThrLys: 3.567 ± 0.069
5.154ThrLeu: 5.154 ± 0.064
1.313ThrMet: 1.313 ± 0.036
2.674ThrAsn: 2.674 ± 0.056
2.234ThrPro: 2.234 ± 0.05
1.386ThrGln: 1.386 ± 0.038
1.679ThrArg: 1.679 ± 0.045
3.237ThrSer: 3.237 ± 0.058
3.024ThrThr: 3.024 ± 0.051
4.071ThrVal: 4.071 ± 0.069
0.485ThrTrp: 0.485 ± 0.024
1.986ThrTyr: 1.986 ± 0.054
0.0ThrXaa: 0.0 ± 0.0
Val
4.684ValAla: 4.684 ± 0.08
0.514ValCys: 0.514 ± 0.028
3.802ValAsp: 3.802 ± 0.071
5.114ValGlu: 5.114 ± 0.08
3.151ValPhe: 3.151 ± 0.064
4.7ValGly: 4.7 ± 0.091
1.315ValHis: 1.315 ± 0.043
6.053ValIle: 6.053 ± 0.085
4.706ValLys: 4.706 ± 0.064
6.538ValLeu: 6.538 ± 0.102
1.882ValMet: 1.882 ± 0.045
3.218ValAsn: 3.218 ± 0.062
2.565ValPro: 2.565 ± 0.053
2.298ValGln: 2.298 ± 0.049
2.556ValArg: 2.556 ± 0.054
4.357ValSer: 4.357 ± 0.075
3.924ValThr: 3.924 ± 0.076
4.909ValVal: 4.909 ± 0.075
0.56ValTrp: 0.56 ± 0.027
2.258ValTyr: 2.258 ± 0.053
0.0ValXaa: 0.0 ± 0.0
Trp
0.52TrpAla: 0.52 ± 0.024
0.08TrpCys: 0.08 ± 0.012
0.514TrpAsp: 0.514 ± 0.026
0.665TrpGlu: 0.665 ± 0.03
0.556TrpPhe: 0.556 ± 0.027
0.624TrpGly: 0.624 ± 0.034
0.173TrpHis: 0.173 ± 0.014
0.875TrpIle: 0.875 ± 0.033
0.736TrpLys: 0.736 ± 0.033
1.013TrpLeu: 1.013 ± 0.035
0.319TrpMet: 0.319 ± 0.02
0.525TrpAsn: 0.525 ± 0.024
0.25TrpPro: 0.25 ± 0.018
0.312TrpGln: 0.312 ± 0.02
0.381TrpArg: 0.381 ± 0.022
0.512TrpSer: 0.512 ± 0.023
0.459TrpThr: 0.459 ± 0.02
0.609TrpVal: 0.609 ± 0.023
0.132TrpTrp: 0.132 ± 0.015
0.337TrpTyr: 0.337 ± 0.021
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.983TyrAla: 1.983 ± 0.053
0.263TyrCys: 0.263 ± 0.017
2.01TyrAsp: 2.01 ± 0.054
2.694TyrGlu: 2.694 ± 0.057
1.81TyrPhe: 1.81 ± 0.051
2.463TyrGly: 2.463 ± 0.052
0.949TyrHis: 0.949 ± 0.035
2.661TyrIle: 2.661 ± 0.059
2.152TyrLys: 2.152 ± 0.05
3.493TyrLeu: 3.493 ± 0.065
0.886TyrMet: 0.886 ± 0.028
1.61TyrAsn: 1.61 ± 0.048
1.448TyrPro: 1.448 ± 0.041
1.52TyrGln: 1.52 ± 0.046
1.624TyrArg: 1.624 ± 0.042
2.153TyrSer: 2.153 ± 0.049
1.91TyrThr: 1.91 ± 0.041
2.274TyrVal: 2.274 ± 0.047
0.413TyrTrp: 0.413 ± 0.022
1.51TyrTyr: 1.51 ± 0.049
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3231 proteins (926547 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski