Amino acid dipepetide frequency for Geovibrio thiophilus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.902AlaAla: 8.902 ± 0.133
1.063AlaCys: 1.063 ± 0.045
5.589AlaAsp: 5.589 ± 0.098
7.461AlaGlu: 7.461 ± 0.123
3.665AlaPhe: 3.665 ± 0.072
7.218AlaGly: 7.218 ± 0.094
1.325AlaHis: 1.325 ± 0.04
3.951AlaIle: 3.951 ± 0.071
4.861AlaLys: 4.861 ± 0.076
7.641AlaLeu: 7.641 ± 0.103
2.154AlaMet: 2.154 ± 0.043
2.322AlaAsn: 2.322 ± 0.055
2.405AlaPro: 2.405 ± 0.061
1.928AlaGln: 1.928 ± 0.046
3.207AlaArg: 3.207 ± 0.07
4.858AlaSer: 4.858 ± 0.092
2.895AlaThr: 2.895 ± 0.075
7.964AlaVal: 7.964 ± 0.117
0.545AlaTrp: 0.545 ± 0.022
2.581AlaTyr: 2.581 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.996CysAla: 0.996 ± 0.037
0.182CysCys: 0.182 ± 0.015
0.543CysAsp: 0.543 ± 0.026
0.698CysGlu: 0.698 ± 0.028
0.541CysPhe: 0.541 ± 0.029
1.345CysGly: 1.345 ± 0.042
0.461CysHis: 0.461 ± 0.055
0.678CysIle: 0.678 ± 0.032
0.489CysLys: 0.489 ± 0.024
0.886CysLeu: 0.886 ± 0.035
0.301CysMet: 0.301 ± 0.019
0.369CysAsn: 0.369 ± 0.021
0.622CysPro: 0.622 ± 0.034
0.205CysGln: 0.205 ± 0.017
0.677CysArg: 0.677 ± 0.029
0.811CysSer: 0.811 ± 0.032
0.626CysThr: 0.626 ± 0.032
0.857CysVal: 0.857 ± 0.035
0.09CysTrp: 0.09 ± 0.011
0.304CysTyr: 0.304 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
4.195AspAla: 4.195 ± 0.075
0.661AspCys: 0.661 ± 0.032
3.004AspAsp: 3.004 ± 0.076
4.365AspGlu: 4.365 ± 0.079
3.017AspPhe: 3.017 ± 0.054
4.183AspGly: 4.183 ± 0.103
0.806AspHis: 0.806 ± 0.032
4.872AspIle: 4.872 ± 0.077
3.846AspLys: 3.846 ± 0.074
4.256AspLeu: 4.256 ± 0.068
1.678AspMet: 1.678 ± 0.045
2.234AspAsn: 2.234 ± 0.049
1.96AspPro: 1.96 ± 0.049
0.878AspGln: 0.878 ± 0.03
2.737AspArg: 2.737 ± 0.076
3.528AspSer: 3.528 ± 0.074
2.963AspThr: 2.963 ± 0.066
3.79AspVal: 3.79 ± 0.081
0.521AspTrp: 0.521 ± 0.025
2.333AspTyr: 2.333 ± 0.057
0.0AspXaa: 0.0 ± 0.0
Glu
5.564GluAla: 5.564 ± 0.09
0.65GluCys: 0.65 ± 0.029
3.644GluAsp: 3.644 ± 0.078
5.384GluGlu: 5.384 ± 0.1
2.827GluPhe: 2.827 ± 0.053
4.408GluGly: 4.408 ± 0.067
1.368GluHis: 1.368 ± 0.052
5.913GluIle: 5.913 ± 0.094
6.659GluLys: 6.659 ± 0.105
6.796GluLeu: 6.796 ± 0.105
2.133GluMet: 2.133 ± 0.047
3.841GluAsn: 3.841 ± 0.07
2.032GluPro: 2.032 ± 0.055
2.155GluGln: 2.155 ± 0.053
3.539GluArg: 3.539 ± 0.064
3.933GluSer: 3.933 ± 0.079
4.137GluThr: 4.137 ± 0.072
4.278GluVal: 4.278 ± 0.085
0.547GluTrp: 0.547 ± 0.026
2.594GluTyr: 2.594 ± 0.064
0.0GluXaa: 0.0 ± 0.0
Phe
3.786PheAla: 3.786 ± 0.063
0.595PheCys: 0.595 ± 0.028
2.759PheAsp: 2.759 ± 0.059
3.009PheGlu: 3.009 ± 0.056
2.657PhePhe: 2.657 ± 0.071
3.772PheGly: 3.772 ± 0.068
0.771PheHis: 0.771 ± 0.026
3.651PheIle: 3.651 ± 0.074
2.382PheLys: 2.382 ± 0.048
4.449PheLeu: 4.449 ± 0.075
1.497PheMet: 1.497 ± 0.043
1.822PheAsn: 1.822 ± 0.046
1.647PhePro: 1.647 ± 0.048
0.993PheGln: 0.993 ± 0.035
2.587PheArg: 2.587 ± 0.053
3.77PheSer: 3.77 ± 0.076
2.916PheThr: 2.916 ± 0.061
3.011PheVal: 3.011 ± 0.057
0.401PheTrp: 0.401 ± 0.025
1.767PheTyr: 1.767 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
5.568GlyAla: 5.568 ± 0.094
1.153GlyCys: 1.153 ± 0.038
3.8GlyAsp: 3.8 ± 0.08
4.983GlyGlu: 4.983 ± 0.078
3.971GlyPhe: 3.971 ± 0.071
5.933GlyGly: 5.933 ± 0.109
1.288GlyHis: 1.288 ± 0.038
5.93GlyIle: 5.93 ± 0.092
5.149GlyLys: 5.149 ± 0.078
6.562GlyLeu: 6.562 ± 0.092
2.334GlyMet: 2.334 ± 0.052
2.686GlyAsn: 2.686 ± 0.063
1.123GlyPro: 1.123 ± 0.034
1.721GlyGln: 1.721 ± 0.042
4.151GlyArg: 4.151 ± 0.064
4.788GlySer: 4.788 ± 0.077
4.265GlyThr: 4.265 ± 0.116
5.456GlyVal: 5.456 ± 0.084
0.696GlyTrp: 0.696 ± 0.03
2.865GlyTyr: 2.865 ± 0.053
0.0GlyXaa: 0.0 ± 0.0
His
1.284HisAla: 1.284 ± 0.038
0.234HisCys: 0.234 ± 0.017
0.846HisAsp: 0.846 ± 0.035
1.127HisGlu: 1.127 ± 0.038
0.902HisPhe: 0.902 ± 0.039
1.414HisGly: 1.414 ± 0.045
0.363HisHis: 0.363 ± 0.021
1.36HisIle: 1.36 ± 0.039
1.075HisLys: 1.075 ± 0.037
1.441HisLeu: 1.441 ± 0.039
0.512HisMet: 0.512 ± 0.025
0.672HisAsn: 0.672 ± 0.028
0.883HisPro: 0.883 ± 0.031
0.393HisGln: 0.393 ± 0.021
0.895HisArg: 0.895 ± 0.032
1.171HisSer: 1.171 ± 0.043
0.941HisThr: 0.941 ± 0.034
1.015HisVal: 1.015 ± 0.035
0.135HisTrp: 0.135 ± 0.014
0.623HisTyr: 0.623 ± 0.029
0.0HisXaa: 0.0 ± 0.0
Ile
5.733IleAla: 5.733 ± 0.087
0.873IleCys: 0.873 ± 0.038
3.984IleAsp: 3.984 ± 0.06
4.829IleGlu: 4.829 ± 0.089
3.249IlePhe: 3.249 ± 0.074
4.779IleGly: 4.779 ± 0.072
1.164IleHis: 1.164 ± 0.045
5.548IleIle: 5.548 ± 0.098
4.871IleLys: 4.871 ± 0.076
6.193IleLeu: 6.193 ± 0.095
1.824IleMet: 1.824 ± 0.047
3.33IleAsn: 3.33 ± 0.065
2.864IlePro: 2.864 ± 0.061
1.662IleGln: 1.662 ± 0.045
3.458IleArg: 3.458 ± 0.061
5.212IleSer: 5.212 ± 0.088
4.46IleThr: 4.46 ± 0.083
4.412IleVal: 4.412 ± 0.07
0.482IleTrp: 0.482 ± 0.024
2.415IleTyr: 2.415 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
5.39LysAla: 5.39 ± 0.085
0.722LysCys: 0.722 ± 0.034
3.869LysAsp: 3.869 ± 0.082
5.167LysGlu: 5.167 ± 0.089
2.42LysPhe: 2.42 ± 0.049
4.721LysGly: 4.721 ± 0.074
1.155LysHis: 1.155 ± 0.04
4.9LysIle: 4.9 ± 0.077
5.49LysLys: 5.49 ± 0.094
5.735LysLeu: 5.735 ± 0.085
1.934LysMet: 1.934 ± 0.047
3.5LysAsn: 3.5 ± 0.075
2.522LysPro: 2.522 ± 0.053
1.782LysGln: 1.782 ± 0.052
3.106LysArg: 3.106 ± 0.056
4.071LysSer: 4.071 ± 0.073
4.156LysThr: 4.156 ± 0.069
4.1LysVal: 4.1 ± 0.077
0.48LysTrp: 0.48 ± 0.024
2.471LysTyr: 2.471 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
7.494LeuAla: 7.494 ± 0.105
1.023LeuCys: 1.023 ± 0.035
4.756LeuAsp: 4.756 ± 0.068
6.296LeuGlu: 6.296 ± 0.105
4.374LeuPhe: 4.374 ± 0.091
6.297LeuGly: 6.297 ± 0.103
1.623LeuHis: 1.623 ± 0.047
6.146LeuIle: 6.146 ± 0.107
6.894LeuLys: 6.894 ± 0.094
8.302LeuLeu: 8.302 ± 0.131
2.582LeuMet: 2.582 ± 0.048
3.972LeuAsn: 3.972 ± 0.067
3.561LeuPro: 3.561 ± 0.068
1.909LeuGln: 1.909 ± 0.048
4.557LeuArg: 4.557 ± 0.085
6.702LeuSer: 6.702 ± 0.11
5.378LeuThr: 5.378 ± 0.083
5.355LeuVal: 5.355 ± 0.08
0.61LeuTrp: 0.61 ± 0.029
2.873LeuTyr: 2.873 ± 0.062
0.0LeuXaa: 0.0 ± 0.0
Met
2.093MetAla: 2.093 ± 0.051
0.262MetCys: 0.262 ± 0.017
1.499MetAsp: 1.499 ± 0.045
1.872MetGlu: 1.872 ± 0.049
1.218MetPhe: 1.218 ± 0.037
1.954MetGly: 1.954 ± 0.043
0.44MetHis: 0.44 ± 0.024
1.849MetIle: 1.849 ± 0.044
2.533MetLys: 2.533 ± 0.042
2.888MetLeu: 2.888 ± 0.065
0.77MetMet: 0.77 ± 0.03
1.496MetAsn: 1.496 ± 0.043
1.301MetPro: 1.301 ± 0.046
0.744MetGln: 0.744 ± 0.031
1.486MetArg: 1.486 ± 0.043
1.809MetSer: 1.809 ± 0.047
1.628MetThr: 1.628 ± 0.043
1.598MetVal: 1.598 ± 0.041
0.164MetTrp: 0.164 ± 0.014
0.77MetTyr: 0.77 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.171AsnAla: 3.171 ± 0.064
0.485AsnCys: 0.485 ± 0.023
2.016AsnAsp: 2.016 ± 0.062
2.521AsnGlu: 2.521 ± 0.053
1.913AsnPhe: 1.913 ± 0.05
3.193AsnGly: 3.193 ± 0.092
0.589AsnHis: 0.589 ± 0.026
3.677AsnIle: 3.677 ± 0.079
2.312AsnLys: 2.312 ± 0.057
3.775AsnLeu: 3.775 ± 0.072
1.254AsnMet: 1.254 ± 0.037
1.528AsnAsn: 1.528 ± 0.049
1.969AsnPro: 1.969 ± 0.048
0.796AsnGln: 0.796 ± 0.034
2.064AsnArg: 2.064 ± 0.052
2.577AsnSer: 2.577 ± 0.048
2.291AsnThr: 2.291 ± 0.051
2.768AsnVal: 2.768 ± 0.057
0.306AsnTrp: 0.306 ± 0.02
1.389AsnTyr: 1.389 ± 0.037
0.0AsnXaa: 0.0 ± 0.0
Pro
3.328ProAla: 3.328 ± 0.069
0.36ProCys: 0.36 ± 0.022
2.49ProAsp: 2.49 ± 0.057
3.572ProGlu: 3.572 ± 0.067
1.865ProPhe: 1.865 ± 0.047
2.002ProGly: 2.002 ± 0.057
0.678ProHis: 0.678 ± 0.032
1.672ProIle: 1.672 ± 0.039
1.87ProLys: 1.87 ± 0.05
3.104ProLeu: 3.104 ± 0.063
0.834ProMet: 0.834 ± 0.032
1.067ProAsn: 1.067 ± 0.04
1.049ProPro: 1.049 ± 0.044
1.058ProGln: 1.058 ± 0.056
1.012ProArg: 1.012 ± 0.039
2.295ProSer: 2.295 ± 0.054
1.377ProThr: 1.377 ± 0.038
3.541ProVal: 3.541 ± 0.068
0.314ProTrp: 0.314 ± 0.017
1.325ProTyr: 1.325 ± 0.04
0.0ProXaa: 0.0 ± 0.0
Gln
2.023GlnAla: 2.023 ± 0.055
0.214GlnCys: 0.214 ± 0.015
1.167GlnAsp: 1.167 ± 0.036
1.662GlnGlu: 1.662 ± 0.048
0.976GlnPhe: 0.976 ± 0.036
1.721GlnGly: 1.721 ± 0.059
0.396GlnHis: 0.396 ± 0.021
1.801GlnIle: 1.801 ± 0.052
1.827GlnLys: 1.827 ± 0.046
1.963GlnLeu: 1.963 ± 0.049
0.74GlnMet: 0.74 ± 0.027
1.069GlnAsn: 1.069 ± 0.036
0.818GlnPro: 0.818 ± 0.03
0.76GlnGln: 0.76 ± 0.032
1.149GlnArg: 1.149 ± 0.04
1.42GlnSer: 1.42 ± 0.046
1.44GlnThr: 1.44 ± 0.039
1.521GlnVal: 1.521 ± 0.041
0.222GlnTrp: 0.222 ± 0.017
0.796GlnTyr: 0.796 ± 0.033
0.0GlnXaa: 0.0 ± 0.0
Arg
3.211ArgAla: 3.211 ± 0.059
0.477ArgCys: 0.477 ± 0.026
2.463ArgAsp: 2.463 ± 0.058
3.708ArgGlu: 3.708 ± 0.074
2.476ArgPhe: 2.476 ± 0.06
2.763ArgGly: 2.763 ± 0.057
0.936ArgHis: 0.936 ± 0.037
3.883ArgIle: 3.883 ± 0.073
3.563ArgLys: 3.563 ± 0.068
5.14ArgLeu: 5.14 ± 0.088
1.539ArgMet: 1.539 ± 0.034
2.148ArgAsn: 2.148 ± 0.05
1.447ArgPro: 1.447 ± 0.065
1.426ArgGln: 1.426 ± 0.041
2.646ArgArg: 2.646 ± 0.056
2.618ArgSer: 2.618 ± 0.047
2.507ArgThr: 2.507 ± 0.054
2.959ArgVal: 2.959 ± 0.064
0.378ArgTrp: 0.378 ± 0.02
1.773ArgTyr: 1.773 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
5.852SerAla: 5.852 ± 0.098
0.794SerCys: 0.794 ± 0.035
3.507SerAsp: 3.507 ± 0.078
4.388SerGlu: 4.388 ± 0.08
3.601SerPhe: 3.601 ± 0.064
6.105SerGly: 6.105 ± 0.084
1.018SerHis: 1.018 ± 0.034
4.078SerIle: 4.078 ± 0.081
3.46SerLys: 3.46 ± 0.067
6.112SerLeu: 6.112 ± 0.086
1.849SerMet: 1.849 ± 0.05
1.964SerAsn: 1.964 ± 0.05
2.139SerPro: 2.139 ± 0.048
1.357SerGln: 1.357 ± 0.043
3.036SerArg: 3.036 ± 0.06
4.472SerSer: 4.472 ± 0.103
2.817SerThr: 2.817 ± 0.074
5.675SerVal: 5.675 ± 0.087
0.556SerTrp: 0.556 ± 0.029
2.37SerTyr: 2.37 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
5.64ThrAla: 5.64 ± 0.101
0.521ThrCys: 0.521 ± 0.029
3.887ThrAsp: 3.887 ± 0.075
4.195ThrGlu: 4.195 ± 0.071
2.345ThrPhe: 2.345 ± 0.05
5.456ThrGly: 5.456 ± 0.093
0.941ThrHis: 0.941 ± 0.032
2.963ThrIle: 2.963 ± 0.062
2.793ThrLys: 2.793 ± 0.052
4.74ThrLeu: 4.74 ± 0.075
1.179ThrMet: 1.179 ± 0.035
1.613ThrAsn: 1.613 ± 0.048
2.306ThrPro: 2.306 ± 0.06
1.088ThrGln: 1.088 ± 0.039
1.865ThrArg: 1.865 ± 0.046
2.917ThrSer: 2.917 ± 0.062
2.353ThrThr: 2.353 ± 0.071
4.947ThrVal: 4.947 ± 0.101
0.361ThrTrp: 0.361 ± 0.021
1.767ThrTyr: 1.767 ± 0.057
0.0ThrXaa: 0.0 ± 0.0
Val
4.904ValAla: 4.904 ± 0.077
0.945ValCys: 0.945 ± 0.033
3.454ValAsp: 3.454 ± 0.067
4.251ValGlu: 4.251 ± 0.077
3.983ValPhe: 3.983 ± 0.071
3.895ValGly: 3.895 ± 0.076
1.181ValHis: 1.181 ± 0.033
5.505ValIle: 5.505 ± 0.074
5.066ValLys: 5.066 ± 0.076
6.756ValLeu: 6.756 ± 0.099
2.108ValMet: 2.108 ± 0.049
3.157ValAsn: 3.157 ± 0.063
2.627ValPro: 2.627 ± 0.059
1.796ValGln: 1.796 ± 0.044
3.59ValArg: 3.59 ± 0.063
5.212ValSer: 5.212 ± 0.084
4.346ValThr: 4.346 ± 0.1
4.555ValVal: 4.555 ± 0.084
0.499ValTrp: 0.499 ± 0.025
2.573ValTyr: 2.573 ± 0.055
0.0ValXaa: 0.0 ± 0.0
Trp
0.596TrpAla: 0.596 ± 0.03
0.081TrpCys: 0.081 ± 0.009
0.425TrpAsp: 0.425 ± 0.021
0.524TrpGlu: 0.524 ± 0.028
0.395TrpPhe: 0.395 ± 0.02
0.536TrpGly: 0.536 ± 0.026
0.2TrpHis: 0.2 ± 0.016
0.493TrpIle: 0.493 ± 0.026
0.512TrpLys: 0.512 ± 0.027
0.822TrpLeu: 0.822 ± 0.034
0.215TrpMet: 0.215 ± 0.016
0.39TrpAsn: 0.39 ± 0.022
0.149TrpPro: 0.149 ± 0.014
0.255TrpGln: 0.255 ± 0.019
0.384TrpArg: 0.384 ± 0.021
0.47TrpSer: 0.47 ± 0.021
0.388TrpThr: 0.388 ± 0.025
0.481TrpVal: 0.481 ± 0.024
0.107TrpTrp: 0.107 ± 0.011
0.273TrpTyr: 0.273 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.804TyrAla: 2.804 ± 0.059
0.388TyrCys: 0.388 ± 0.024
2.124TyrAsp: 2.124 ± 0.054
2.391TyrGlu: 2.391 ± 0.056
1.847TyrPhe: 1.847 ± 0.045
2.725TyrGly: 2.725 ± 0.061
0.583TyrHis: 0.583 ± 0.026
2.472TyrIle: 2.472 ± 0.047
2.038TyrLys: 2.038 ± 0.048
3.142TyrLeu: 3.142 ± 0.065
0.936TyrMet: 0.936 ± 0.034
1.416TyrAsn: 1.416 ± 0.045
1.39TyrPro: 1.39 ± 0.039
0.781TyrGln: 0.781 ± 0.028
1.915TyrArg: 1.915 ± 0.046
2.515TyrSer: 2.515 ± 0.056
1.994TyrThr: 1.994 ± 0.057
2.118TyrVal: 2.118 ± 0.05
0.282TyrTrp: 0.282 ± 0.02
1.361TyrTyr: 1.361 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2687 proteins (900581 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski