Amino acid dipepetide frequency for Tolumonas auensis (strain DSM 9187 / TA4)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.914AlaAla: 9.914 ± 0.15
1.118AlaCys: 1.118 ± 0.032
5.315AlaAsp: 5.315 ± 0.076
6.466AlaGlu: 6.466 ± 0.077
3.355AlaPhe: 3.355 ± 0.064
7.37AlaGly: 7.37 ± 0.1
1.926AlaHis: 1.926 ± 0.045
5.936AlaIle: 5.936 ± 0.087
4.429AlaLys: 4.429 ± 0.083
10.824AlaLeu: 10.824 ± 0.128
2.77AlaMet: 2.77 ± 0.046
3.173AlaAsn: 3.173 ± 0.06
3.292AlaPro: 3.292 ± 0.055
4.064AlaGln: 4.064 ± 0.072
4.717AlaArg: 4.717 ± 0.086
4.874AlaSer: 4.874 ± 0.082
4.709AlaThr: 4.709 ± 0.078
6.658AlaVal: 6.658 ± 0.08
1.166AlaTrp: 1.166 ± 0.033
2.436AlaTyr: 2.436 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.918CysAla: 0.918 ± 0.026
0.208CysCys: 0.208 ± 0.016
0.634CysAsp: 0.634 ± 0.028
0.62CysGlu: 0.62 ± 0.025
0.514CysPhe: 0.514 ± 0.022
1.063CysGly: 1.063 ± 0.034
0.367CysHis: 0.367 ± 0.022
0.613CysIle: 0.613 ± 0.028
0.408CysLys: 0.408 ± 0.022
1.086CysLeu: 1.086 ± 0.035
0.238CysMet: 0.238 ± 0.014
0.376CysAsn: 0.376 ± 0.018
0.55CysPro: 0.55 ± 0.025
0.564CysGln: 0.564 ± 0.023
0.674CysArg: 0.674 ± 0.027
0.712CysSer: 0.712 ± 0.026
0.51CysThr: 0.51 ± 0.022
0.694CysVal: 0.694 ± 0.028
0.162CysTrp: 0.162 ± 0.013
0.37CysTyr: 0.37 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
4.877AspAla: 4.877 ± 0.074
0.593AspCys: 0.593 ± 0.024
2.922AspAsp: 2.922 ± 0.077
3.781AspGlu: 3.781 ± 0.064
2.152AspPhe: 2.152 ± 0.042
3.767AspGly: 3.767 ± 0.069
1.138AspHis: 1.138 ± 0.043
3.529AspIle: 3.529 ± 0.063
2.596AspLys: 2.596 ± 0.051
5.423AspLeu: 5.423 ± 0.074
1.36AspMet: 1.36 ± 0.033
2.03AspAsn: 2.03 ± 0.048
2.386AspPro: 2.386 ± 0.056
1.925AspGln: 1.925 ± 0.045
2.571AspArg: 2.571 ± 0.046
2.86AspSer: 2.86 ± 0.048
2.627AspThr: 2.627 ± 0.057
3.916AspVal: 3.916 ± 0.069
0.976AspTrp: 0.976 ± 0.033
1.867AspTyr: 1.867 ± 0.048
0.0AspXaa: 0.0 ± 0.0
Glu
5.102GluAla: 5.102 ± 0.078
0.572GluCys: 0.572 ± 0.026
2.431GluAsp: 2.431 ± 0.053
3.359GluGlu: 3.359 ± 0.075
2.114GluPhe: 2.114 ± 0.043
3.293GluGly: 3.293 ± 0.058
1.552GluHis: 1.552 ± 0.043
3.685GluIle: 3.685 ± 0.068
3.281GluLys: 3.281 ± 0.065
7.334GluLeu: 7.334 ± 0.085
1.688GluMet: 1.688 ± 0.044
2.33GluAsn: 2.33 ± 0.047
2.231GluPro: 2.231 ± 0.052
4.125GluGln: 4.125 ± 0.068
3.571GluArg: 3.571 ± 0.058
2.985GluSer: 2.985 ± 0.062
2.91GluThr: 2.91 ± 0.058
3.612GluVal: 3.612 ± 0.068
0.83GluTrp: 0.83 ± 0.029
1.699GluTyr: 1.699 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.567PheAla: 3.567 ± 0.066
0.539PheCys: 0.539 ± 0.023
2.387PheAsp: 2.387 ± 0.052
1.938PheGlu: 1.938 ± 0.042
1.655PhePhe: 1.655 ± 0.048
3.173PheGly: 3.173 ± 0.072
0.887PheHis: 0.887 ± 0.03
2.635PheIle: 2.635 ± 0.064
1.497PheLys: 1.497 ± 0.04
3.592PheLeu: 3.592 ± 0.066
1.027PheMet: 1.027 ± 0.035
1.717PheAsn: 1.717 ± 0.042
1.545PhePro: 1.545 ± 0.041
1.243PheGln: 1.243 ± 0.036
1.914PheArg: 1.914 ± 0.039
3.01PheSer: 3.01 ± 0.056
2.182PheThr: 2.182 ± 0.044
2.536PheVal: 2.536 ± 0.053
0.551PheTrp: 0.551 ± 0.022
1.241PheTyr: 1.241 ± 0.038
0.0PheXaa: 0.0 ± 0.0
Gly
5.631GlyAla: 5.631 ± 0.086
1.026GlyCys: 1.026 ± 0.032
3.563GlyAsp: 3.563 ± 0.067
4.168GlyGlu: 4.168 ± 0.067
3.183GlyPhe: 3.183 ± 0.057
5.043GlyGly: 5.043 ± 0.085
1.75GlyHis: 1.75 ± 0.042
5.155GlyIle: 5.155 ± 0.076
4.014GlyLys: 4.014 ± 0.059
7.454GlyLeu: 7.454 ± 0.087
2.217GlyMet: 2.217 ± 0.044
2.48GlyAsn: 2.48 ± 0.061
1.879GlyPro: 1.879 ± 0.042
2.936GlyGln: 2.936 ± 0.056
3.496GlyArg: 3.496 ± 0.061
4.208GlySer: 4.208 ± 0.066
3.536GlyThr: 3.536 ± 0.059
5.16GlyVal: 5.16 ± 0.079
1.12GlyTrp: 1.12 ± 0.035
2.56GlyTyr: 2.56 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
1.993HisAla: 1.993 ± 0.042
0.328HisCys: 0.328 ± 0.021
1.277HisAsp: 1.277 ± 0.038
1.16HisGlu: 1.16 ± 0.036
1.099HisPhe: 1.099 ± 0.039
1.833HisGly: 1.833 ± 0.043
0.846HisHis: 0.846 ± 0.036
1.44HisIle: 1.44 ± 0.04
0.958HisLys: 0.958 ± 0.031
2.557HisLeu: 2.557 ± 0.055
0.542HisMet: 0.542 ± 0.023
0.843HisAsn: 0.843 ± 0.027
1.432HisPro: 1.432 ± 0.036
1.254HisGln: 1.254 ± 0.035
1.279HisArg: 1.279 ± 0.04
1.338HisSer: 1.338 ± 0.04
1.087HisThr: 1.087 ± 0.033
1.338HisVal: 1.338 ± 0.038
0.423HisTrp: 0.423 ± 0.026
0.882HisTyr: 0.882 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.52IleAla: 6.52 ± 0.08
0.82IleCys: 0.82 ± 0.033
3.694IleAsp: 3.694 ± 0.061
3.647IleGlu: 3.647 ± 0.07
2.045IlePhe: 2.045 ± 0.053
4.757IleGly: 4.757 ± 0.075
1.344IleHis: 1.344 ± 0.038
3.465IleIle: 3.465 ± 0.063
2.781IleLys: 2.781 ± 0.053
5.36IleLeu: 5.36 ± 0.083
1.398IleMet: 1.398 ± 0.038
2.563IleAsn: 2.563 ± 0.048
3.036IlePro: 3.036 ± 0.049
2.31IleGln: 2.31 ± 0.055
3.401IleArg: 3.401 ± 0.063
4.114IleSer: 4.114 ± 0.071
3.601IleThr: 3.601 ± 0.061
3.92IleVal: 3.92 ± 0.071
0.675IleTrp: 0.675 ± 0.029
1.747IleTyr: 1.747 ± 0.042
0.0IleXaa: 0.0 ± 0.0
Lys
4.579LysAla: 4.579 ± 0.074
0.306LysCys: 0.306 ± 0.017
2.291LysAsp: 2.291 ± 0.053
2.927LysGlu: 2.927 ± 0.059
1.364LysPhe: 1.364 ± 0.038
2.864LysGly: 2.864 ± 0.059
0.984LysHis: 0.984 ± 0.03
2.773LysIle: 2.773 ± 0.052
2.52LysLys: 2.52 ± 0.062
4.872LysLeu: 4.872 ± 0.078
1.278LysMet: 1.278 ± 0.032
1.876LysAsn: 1.876 ± 0.049
2.251LysPro: 2.251 ± 0.05
2.496LysGln: 2.496 ± 0.052
2.401LysArg: 2.401 ± 0.046
2.559LysSer: 2.559 ± 0.056
2.727LysThr: 2.727 ± 0.057
3.24LysVal: 3.24 ± 0.059
0.482LysTrp: 0.482 ± 0.024
1.234LysTyr: 1.234 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
11.125LeuAla: 11.125 ± 0.11
1.234LeuCys: 1.234 ± 0.036
5.858LeuAsp: 5.858 ± 0.077
5.754LeuGlu: 5.754 ± 0.084
4.474LeuPhe: 4.474 ± 0.086
7.106LeuGly: 7.106 ± 0.087
2.538LeuHis: 2.538 ± 0.058
6.394LeuIle: 6.394 ± 0.08
5.053LeuLys: 5.053 ± 0.069
12.855LeuLeu: 12.855 ± 0.175
2.863LeuMet: 2.863 ± 0.051
4.224LeuAsn: 4.224 ± 0.063
5.708LeuPro: 5.708 ± 0.088
5.314LeuGln: 5.314 ± 0.103
5.727LeuArg: 5.727 ± 0.08
7.565LeuSer: 7.565 ± 0.106
6.224LeuThr: 6.224 ± 0.089
6.936LeuVal: 6.936 ± 0.085
1.32LeuTrp: 1.32 ± 0.039
2.621LeuTyr: 2.621 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
2.71MetAla: 2.71 ± 0.059
0.216MetCys: 0.216 ± 0.015
1.238MetAsp: 1.238 ± 0.033
1.286MetGlu: 1.286 ± 0.043
0.831MetPhe: 0.831 ± 0.033
1.775MetGly: 1.775 ± 0.042
0.526MetHis: 0.526 ± 0.023
1.396MetIle: 1.396 ± 0.038
1.367MetLys: 1.367 ± 0.035
3.083MetLeu: 3.083 ± 0.063
0.772MetMet: 0.772 ± 0.029
1.101MetAsn: 1.101 ± 0.029
1.269MetPro: 1.269 ± 0.032
1.365MetGln: 1.365 ± 0.034
1.267MetArg: 1.267 ± 0.04
1.911MetSer: 1.911 ± 0.047
1.735MetThr: 1.735 ± 0.041
1.852MetVal: 1.852 ± 0.04
0.234MetTrp: 0.234 ± 0.016
0.5MetTyr: 0.5 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
3.316AsnAla: 3.316 ± 0.053
0.382AsnCys: 0.382 ± 0.021
2.043AsnAsp: 2.043 ± 0.052
2.092AsnGlu: 2.092 ± 0.05
1.248AsnPhe: 1.248 ± 0.038
2.874AsnGly: 2.874 ± 0.059
0.81AsnHis: 0.81 ± 0.027
2.43AsnIle: 2.43 ± 0.051
1.846AsnLys: 1.846 ± 0.048
3.499AsnLeu: 3.499 ± 0.051
0.927AsnMet: 0.927 ± 0.033
1.595AsnAsn: 1.595 ± 0.044
2.109AsnPro: 2.109 ± 0.049
1.797AsnGln: 1.797 ± 0.04
1.897AsnArg: 1.897 ± 0.049
2.028AsnSer: 2.028 ± 0.047
1.885AsnThr: 1.885 ± 0.05
2.286AsnVal: 2.286 ± 0.042
0.616AsnTrp: 0.616 ± 0.026
1.167AsnTyr: 1.167 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
4.647ProAla: 4.647 ± 0.075
0.406ProCys: 0.406 ± 0.021
2.891ProAsp: 2.891 ± 0.064
3.534ProGlu: 3.534 ± 0.06
1.824ProPhe: 1.824 ± 0.044
2.96ProGly: 2.96 ± 0.069
1.061ProHis: 1.061 ± 0.031
2.108ProIle: 2.108 ± 0.051
1.71ProLys: 1.71 ± 0.042
4.823ProLeu: 4.823 ± 0.094
1.086ProMet: 1.086 ± 0.035
1.39ProAsn: 1.39 ± 0.035
1.379ProPro: 1.379 ± 0.042
1.95ProGln: 1.95 ± 0.045
1.822ProArg: 1.822 ± 0.042
2.192ProSer: 2.192 ± 0.052
2.051ProThr: 2.051 ± 0.047
3.997ProVal: 3.997 ± 0.068
0.573ProTrp: 0.573 ± 0.024
1.302ProTyr: 1.302 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
4.789GlnAla: 4.789 ± 0.08
0.407GlnCys: 0.407 ± 0.023
2.022GlnAsp: 2.022 ± 0.049
2.463GlnGlu: 2.463 ± 0.052
1.803GlnPhe: 1.803 ± 0.041
2.959GlnGly: 2.959 ± 0.055
1.478GlnHis: 1.478 ± 0.042
2.883GlnIle: 2.883 ± 0.055
2.08GlnLys: 2.08 ± 0.052
5.978GlnLeu: 5.978 ± 0.114
1.137GlnMet: 1.137 ± 0.033
1.568GlnAsn: 1.568 ± 0.042
2.212GlnPro: 2.212 ± 0.049
3.973GlnGln: 3.973 ± 0.105
2.967GlnArg: 2.967 ± 0.055
2.571GlnSer: 2.571 ± 0.052
2.321GlnThr: 2.321 ± 0.047
3.048GlnVal: 3.048 ± 0.065
0.718GlnTrp: 0.718 ± 0.029
1.309GlnTyr: 1.309 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
4.022ArgAla: 4.022 ± 0.072
0.593ArgCys: 0.593 ± 0.025
2.731ArgAsp: 2.731 ± 0.05
3.337ArgGlu: 3.337 ± 0.062
2.409ArgPhe: 2.409 ± 0.048
2.9ArgGly: 2.9 ± 0.052
1.508ArgHis: 1.508 ± 0.043
3.423ArgIle: 3.423 ± 0.057
2.408ArgLys: 2.408 ± 0.053
6.161ArgLeu: 6.161 ± 0.083
1.464ArgMet: 1.464 ± 0.044
1.914ArgAsn: 1.914 ± 0.046
2.036ArgPro: 2.036 ± 0.046
3.117ArgGln: 3.117 ± 0.067
3.032ArgArg: 3.032 ± 0.064
2.749ArgSer: 2.749 ± 0.048
2.425ArgThr: 2.425 ± 0.059
3.325ArgVal: 3.325 ± 0.068
0.855ArgTrp: 0.855 ± 0.033
2.06ArgTyr: 2.06 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
5.737SerAla: 5.737 ± 0.079
0.679SerCys: 0.679 ± 0.028
3.278SerAsp: 3.278 ± 0.057
3.345SerGlu: 3.345 ± 0.059
2.458SerPhe: 2.458 ± 0.052
5.074SerGly: 5.074 ± 0.072
1.477SerHis: 1.477 ± 0.036
3.318SerIle: 3.318 ± 0.059
2.408SerLys: 2.408 ± 0.051
6.677SerLeu: 6.677 ± 0.093
1.55SerMet: 1.55 ± 0.039
1.995SerAsn: 1.995 ± 0.045
2.465SerPro: 2.465 ± 0.055
2.55SerGln: 2.55 ± 0.055
3.115SerArg: 3.115 ± 0.058
3.765SerSer: 3.765 ± 0.072
2.698SerThr: 2.698 ± 0.057
4.39SerVal: 4.39 ± 0.071
0.848SerTrp: 0.848 ± 0.03
1.69SerTyr: 1.69 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
4.996ThrAla: 4.996 ± 0.075
0.474ThrCys: 0.474 ± 0.02
2.867ThrAsp: 2.867 ± 0.067
3.097ThrGlu: 3.097 ± 0.061
1.863ThrPhe: 1.863 ± 0.041
4.34ThrGly: 4.34 ± 0.067
1.195ThrHis: 1.195 ± 0.033
3.052ThrIle: 3.052 ± 0.058
1.872ThrLys: 1.872 ± 0.052
6.241ThrLeu: 6.241 ± 0.079
1.077ThrMet: 1.077 ± 0.03
1.706ThrAsn: 1.706 ± 0.042
3.007ThrPro: 3.007 ± 0.056
2.13ThrGln: 2.13 ± 0.045
2.629ThrArg: 2.629 ± 0.062
2.993ThrSer: 2.993 ± 0.055
2.811ThrThr: 2.811 ± 0.063
3.74ThrVal: 3.74 ± 0.08
0.581ThrTrp: 0.581 ± 0.026
1.307ThrTyr: 1.307 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
6.763ValAla: 6.763 ± 0.089
0.802ValCys: 0.802 ± 0.028
3.672ValAsp: 3.672 ± 0.075
3.776ValGlu: 3.776 ± 0.071
2.615ValPhe: 2.615 ± 0.057
4.518ValGly: 4.518 ± 0.078
1.318ValHis: 1.318 ± 0.036
4.705ValIle: 4.705 ± 0.071
3.216ValLys: 3.216 ± 0.063
7.289ValLeu: 7.289 ± 0.099
2.021ValMet: 2.021 ± 0.044
2.533ValAsn: 2.533 ± 0.06
2.913ValPro: 2.913 ± 0.051
2.497ValGln: 2.497 ± 0.051
3.357ValArg: 3.357 ± 0.062
4.583ValSer: 4.583 ± 0.065
4.033ValThr: 4.033 ± 0.067
5.141ValVal: 5.141 ± 0.088
0.799ValTrp: 0.799 ± 0.028
1.824ValTyr: 1.824 ± 0.045
0.0ValXaa: 0.0 ± 0.0
Trp
0.86TrpAla: 0.86 ± 0.027
0.18TrpCys: 0.18 ± 0.013
0.61TrpAsp: 0.61 ± 0.025
0.538TrpGlu: 0.538 ± 0.024
0.625TrpPhe: 0.625 ± 0.024
0.796TrpGly: 0.796 ± 0.028
0.413TrpHis: 0.413 ± 0.023
0.752TrpIle: 0.752 ± 0.027
0.483TrpLys: 0.483 ± 0.023
2.332TrpLeu: 2.332 ± 0.06
0.379TrpMet: 0.379 ± 0.019
0.452TrpAsn: 0.452 ± 0.019
0.557TrpPro: 0.557 ± 0.025
1.188TrpGln: 1.188 ± 0.043
0.788TrpArg: 0.788 ± 0.028
0.757TrpSer: 0.757 ± 0.029
0.535TrpThr: 0.535 ± 0.021
0.803TrpVal: 0.803 ± 0.029
0.211TrpTrp: 0.211 ± 0.015
0.366TrpTyr: 0.366 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.449TyrAla: 2.449 ± 0.056
0.399TyrCys: 0.399 ± 0.02
1.57TyrAsp: 1.57 ± 0.042
1.416TyrGlu: 1.416 ± 0.043
1.207TyrPhe: 1.207 ± 0.034
2.114TyrGly: 2.114 ± 0.051
0.79TyrHis: 0.79 ± 0.032
1.511TyrIle: 1.511 ± 0.036
1.097TyrLys: 1.097 ± 0.033
3.456TyrLeu: 3.456 ± 0.066
0.636TyrMet: 0.636 ± 0.023
0.981TyrAsn: 0.981 ± 0.033
1.447TyrPro: 1.447 ± 0.039
1.884TyrGln: 1.884 ± 0.048
1.845TyrArg: 1.845 ± 0.041
1.739TyrSer: 1.739 ± 0.045
1.371TyrThr: 1.371 ± 0.04
1.785TyrVal: 1.785 ± 0.043
0.484TyrTrp: 0.484 ± 0.02
0.944TyrTyr: 0.944 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3126 proteins (1023253 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski