Amino acid dipepetide frequency for Tenacibaculum sp. MAR_2009_124

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.118AlaAla: 3.118 ± 0.07
0.503AlaCys: 0.503 ± 0.019
2.917AlaAsp: 2.917 ± 0.094
3.101AlaGlu: 3.101 ± 0.055
2.766AlaPhe: 2.766 ± 0.054
3.548AlaGly: 3.548 ± 0.084
0.912AlaHis: 0.912 ± 0.028
4.807AlaIle: 4.807 ± 0.068
4.143AlaLys: 4.143 ± 0.078
5.049AlaLeu: 5.049 ± 0.087
1.124AlaMet: 1.124 ± 0.032
3.36AlaAsn: 3.36 ± 0.072
1.782AlaPro: 1.782 ± 0.058
1.902AlaGln: 1.902 ± 0.039
1.644AlaArg: 1.644 ± 0.042
4.084AlaSer: 4.084 ± 0.096
3.426AlaThr: 3.426 ± 0.11
3.346AlaVal: 3.346 ± 0.063
0.501AlaTrp: 0.501 ± 0.021
2.207AlaTyr: 2.207 ± 0.05
0.0AlaXaa: 0.0 ± 0.0
Cys
0.398CysAla: 0.398 ± 0.026
0.124CysCys: 0.124 ± 0.01
0.606CysAsp: 0.606 ± 0.063
0.421CysGlu: 0.421 ± 0.019
0.459CysPhe: 0.459 ± 0.019
0.572CysGly: 0.572 ± 0.025
0.17CysHis: 0.17 ± 0.013
0.63CysIle: 0.63 ± 0.027
0.515CysLys: 0.515 ± 0.02
0.661CysLeu: 0.661 ± 0.029
0.151CysMet: 0.151 ± 0.011
0.508CysAsn: 0.508 ± 0.032
0.304CysPro: 0.304 ± 0.018
0.203CysGln: 0.203 ± 0.013
0.2CysArg: 0.2 ± 0.012
0.767CysSer: 0.767 ± 0.054
0.436CysThr: 0.436 ± 0.031
0.475CysVal: 0.475 ± 0.03
0.079CysTrp: 0.079 ± 0.008
0.323CysTyr: 0.323 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
3.195AspAla: 3.195 ± 0.089
0.438AspCys: 0.438 ± 0.021
2.9AspAsp: 2.9 ± 0.074
3.409AspGlu: 3.409 ± 0.055
3.498AspPhe: 3.498 ± 0.056
4.403AspGly: 4.403 ± 0.357
0.884AspHis: 0.884 ± 0.029
4.635AspIle: 4.635 ± 0.073
4.09AspLys: 4.09 ± 0.064
5.35AspLeu: 5.35 ± 0.085
0.888AspMet: 0.888 ± 0.023
3.632AspAsn: 3.632 ± 0.1
1.812AspPro: 1.812 ± 0.144
1.483AspGln: 1.483 ± 0.042
1.799AspArg: 1.799 ± 0.046
3.68AspSer: 3.68 ± 0.118
3.2AspThr: 3.2 ± 0.141
3.693AspVal: 3.693 ± 0.076
0.655AspTrp: 0.655 ± 0.022
2.732AspTyr: 2.732 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
3.726GluAla: 3.726 ± 0.071
0.353GluCys: 0.353 ± 0.017
3.391GluAsp: 3.391 ± 0.054
5.27GluGlu: 5.27 ± 0.111
3.049GluPhe: 3.049 ± 0.053
3.83GluGly: 3.83 ± 0.062
1.155GluHis: 1.155 ± 0.033
5.619GluIle: 5.619 ± 0.076
6.241GluLys: 6.241 ± 0.105
6.749GluLeu: 6.749 ± 0.152
1.401GluMet: 1.401 ± 0.033
5.03GluAsn: 5.03 ± 0.076
1.472GluPro: 1.472 ± 0.039
2.163GluGln: 2.163 ± 0.048
2.43GluArg: 2.43 ± 0.049
3.703GluSer: 3.703 ± 0.058
3.657GluThr: 3.657 ± 0.076
4.489GluVal: 4.489 ± 0.062
0.566GluTrp: 0.566 ± 0.02
2.675GluTyr: 2.675 ± 0.054
0.0GluXaa: 0.0 ± 0.0
Phe
2.478PheAla: 2.478 ± 0.053
0.404PheCys: 0.404 ± 0.018
3.144PheAsp: 3.144 ± 0.053
3.362PheGlu: 3.362 ± 0.044
2.746PhePhe: 2.746 ± 0.057
3.355PheGly: 3.355 ± 0.05
0.88PheHis: 0.88 ± 0.028
4.094PheIle: 4.094 ± 0.067
3.941PheLys: 3.941 ± 0.064
4.549PheLeu: 4.549 ± 0.089
1.015PheMet: 1.015 ± 0.028
3.648PheAsn: 3.648 ± 0.057
1.5PhePro: 1.5 ± 0.034
1.492PheGln: 1.492 ± 0.037
1.595PheArg: 1.595 ± 0.039
4.318PheSer: 4.318 ± 0.058
3.164PheThr: 3.164 ± 0.061
2.988PheVal: 2.988 ± 0.048
0.561PheTrp: 0.561 ± 0.022
2.236PheTyr: 2.236 ± 0.049
0.0PheXaa: 0.0 ± 0.0
Gly
3.834GlyAla: 3.834 ± 0.077
0.664GlyCys: 0.664 ± 0.054
3.7GlyAsp: 3.7 ± 0.18
3.621GlyGlu: 3.621 ± 0.064
3.397GlyPhe: 3.397 ± 0.057
4.531GlyGly: 4.531 ± 0.095
1.058GlyHis: 1.058 ± 0.034
5.325GlyIle: 5.325 ± 0.093
4.846GlyLys: 4.846 ± 0.076
5.203GlyLeu: 5.203 ± 0.071
1.367GlyMet: 1.367 ± 0.033
4.114GlyAsn: 4.114 ± 0.072
1.142GlyPro: 1.142 ± 0.033
1.707GlyGln: 1.707 ± 0.043
2.008GlyArg: 2.008 ± 0.048
4.21GlySer: 4.21 ± 0.112
4.134GlyThr: 4.134 ± 0.128
4.804GlyVal: 4.804 ± 0.108
0.697GlyTrp: 0.697 ± 0.022
2.833GlyTyr: 2.833 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
0.823HisAla: 0.823 ± 0.025
0.168HisCys: 0.168 ± 0.012
0.886HisAsp: 0.886 ± 0.039
0.992HisGlu: 0.992 ± 0.031
1.029HisPhe: 1.029 ± 0.028
0.979HisGly: 0.979 ± 0.023
0.474HisHis: 0.474 ± 0.023
1.381HisIle: 1.381 ± 0.035
1.332HisLys: 1.332 ± 0.036
1.783HisLeu: 1.783 ± 0.035
0.298HisMet: 0.298 ± 0.015
1.04HisAsn: 1.04 ± 0.031
0.755HisPro: 0.755 ± 0.025
0.694HisGln: 0.694 ± 0.022
0.604HisArg: 0.604 ± 0.022
1.155HisSer: 1.155 ± 0.034
0.901HisThr: 0.901 ± 0.026
0.914HisVal: 0.914 ± 0.027
0.189HisTrp: 0.189 ± 0.012
0.849HisTyr: 0.849 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
4.88IleAla: 4.88 ± 0.065
0.73IleCys: 0.73 ± 0.032
5.337IleAsp: 5.337 ± 0.091
6.05IleGlu: 6.05 ± 0.104
3.457IlePhe: 3.457 ± 0.062
5.016IleGly: 5.016 ± 0.069
1.438IleHis: 1.438 ± 0.036
6.031IleIle: 6.031 ± 0.105
6.261IleLys: 6.261 ± 0.104
6.762IleLeu: 6.762 ± 0.084
1.257IleMet: 1.257 ± 0.032
5.376IleAsn: 5.376 ± 0.077
3.261IlePro: 3.261 ± 0.061
2.615IleGln: 2.615 ± 0.046
2.48IleArg: 2.48 ± 0.046
6.137IleSer: 6.137 ± 0.073
5.127IleThr: 5.127 ± 0.122
4.716IleVal: 4.716 ± 0.064
0.736IleTrp: 0.736 ± 0.024
2.884IleTyr: 2.884 ± 0.046
0.0IleXaa: 0.0 ± 0.0
Lys
4.39LysAla: 4.39 ± 0.078
0.379LysCys: 0.379 ± 0.016
4.325LysAsp: 4.325 ± 0.073
7.458LysGlu: 7.458 ± 0.131
3.026LysPhe: 3.026 ± 0.059
4.676LysGly: 4.676 ± 0.078
1.467LysHis: 1.467 ± 0.046
6.38LysIle: 6.38 ± 0.109
8.028LysLys: 8.028 ± 0.154
6.95LysLeu: 6.95 ± 0.102
1.797LysMet: 1.797 ± 0.042
5.89LysAsn: 5.89 ± 0.104
2.231LysPro: 2.231 ± 0.04
2.781LysGln: 2.781 ± 0.056
3.013LysArg: 3.013 ± 0.058
5.025LysSer: 5.025 ± 0.075
4.818LysThr: 4.818 ± 0.081
4.831LysVal: 4.831 ± 0.069
0.806LysTrp: 0.806 ± 0.024
3.178LysTyr: 3.178 ± 0.065
0.0LysXaa: 0.0 ± 0.0
Leu
4.928LeuAla: 4.928 ± 0.069
0.645LeuCys: 0.645 ± 0.023
5.021LeuAsp: 5.021 ± 0.107
5.984LeuGlu: 5.984 ± 0.093
4.859LeuPhe: 4.859 ± 0.093
5.641LeuGly: 5.641 ± 0.104
1.546LeuHis: 1.546 ± 0.037
6.862LeuIle: 6.862 ± 0.104
8.082LeuLys: 8.082 ± 0.126
8.549LeuLeu: 8.549 ± 0.138
1.746LeuMet: 1.746 ± 0.045
6.201LeuAsn: 6.201 ± 0.071
3.452LeuPro: 3.452 ± 0.076
3.004LeuGln: 3.004 ± 0.069
2.944LeuArg: 2.944 ± 0.054
7.175LeuSer: 7.175 ± 0.107
5.326LeuThr: 5.326 ± 0.087
5.211LeuVal: 5.211 ± 0.083
0.777LeuTrp: 0.777 ± 0.028
3.115LeuTyr: 3.115 ± 0.06
0.0LeuXaa: 0.0 ± 0.0
Met
1.154MetAla: 1.154 ± 0.035
0.134MetCys: 0.134 ± 0.009
0.905MetAsp: 0.905 ± 0.023
1.148MetGlu: 1.148 ± 0.031
0.84MetPhe: 0.84 ± 0.029
1.063MetGly: 1.063 ± 0.033
0.334MetHis: 0.334 ± 0.016
1.489MetIle: 1.489 ± 0.038
1.986MetLys: 1.986 ± 0.044
1.722MetLeu: 1.722 ± 0.042
0.44MetMet: 0.44 ± 0.019
1.357MetAsn: 1.357 ± 0.034
0.661MetPro: 0.661 ± 0.022
0.598MetGln: 0.598 ± 0.021
0.738MetArg: 0.738 ± 0.025
1.349MetSer: 1.349 ± 0.032
1.017MetThr: 1.017 ± 0.028
1.165MetVal: 1.165 ± 0.031
0.127MetTrp: 0.127 ± 0.01
0.66MetTyr: 0.66 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.54AsnAla: 3.54 ± 0.066
0.502AsnCys: 0.502 ± 0.023
3.937AsnAsp: 3.937 ± 0.102
4.351AsnGlu: 4.351 ± 0.057
3.415AsnPhe: 3.415 ± 0.056
4.544AsnGly: 4.544 ± 0.088
1.252AsnHis: 1.252 ± 0.046
5.396AsnIle: 5.396 ± 0.069
5.166AsnLys: 5.166 ± 0.084
5.879AsnLeu: 5.879 ± 0.077
1.152AsnMet: 1.152 ± 0.034
4.94AsnAsn: 4.94 ± 0.084
2.745AsnPro: 2.745 ± 0.068
2.381AsnGln: 2.381 ± 0.048
2.372AsnArg: 2.372 ± 0.042
4.684AsnSer: 4.684 ± 0.074
4.361AsnThr: 4.361 ± 0.098
3.872AsnVal: 3.872 ± 0.066
0.822AsnTrp: 0.822 ± 0.024
3.159AsnTyr: 3.159 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
1.531ProAla: 1.531 ± 0.041
0.327ProCys: 0.327 ± 0.052
2.075ProAsp: 2.075 ± 0.105
2.539ProGlu: 2.539 ± 0.053
1.857ProPhe: 1.857 ± 0.043
1.594ProGly: 1.594 ± 0.042
0.563ProHis: 0.563 ± 0.023
2.626ProIle: 2.626 ± 0.043
2.477ProLys: 2.477 ± 0.051
2.691ProLeu: 2.691 ± 0.078
0.612ProMet: 0.612 ± 0.022
2.458ProAsn: 2.458 ± 0.067
0.738ProPro: 0.738 ± 0.025
0.947ProGln: 0.947 ± 0.027
0.854ProArg: 0.854 ± 0.029
2.304ProSer: 2.304 ± 0.049
2.276ProThr: 2.276 ± 0.118
2.034ProVal: 2.034 ± 0.05
0.325ProTrp: 0.325 ± 0.017
1.303ProTyr: 1.303 ± 0.029
0.0ProXaa: 0.0 ± 0.0
Gln
1.639GlnAla: 1.639 ± 0.051
0.183GlnCys: 0.183 ± 0.011
1.443GlnAsp: 1.443 ± 0.043
2.447GlnGlu: 2.447 ± 0.061
1.608GlnPhe: 1.608 ± 0.036
1.785GlnGly: 1.785 ± 0.04
0.572GlnHis: 0.572 ± 0.022
2.481GlnIle: 2.481 ± 0.044
2.803GlnLys: 2.803 ± 0.056
3.26GlnLeu: 3.26 ± 0.056
0.655GlnMet: 0.655 ± 0.021
2.224GlnAsn: 2.224 ± 0.055
0.961GlnPro: 0.961 ± 0.041
1.337GlnGln: 1.337 ± 0.034
1.125GlnArg: 1.125 ± 0.031
1.913GlnSer: 1.913 ± 0.045
1.855GlnThr: 1.855 ± 0.054
1.947GlnVal: 1.947 ± 0.048
0.382GlnTrp: 0.382 ± 0.016
1.37GlnTyr: 1.37 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
1.677ArgAla: 1.677 ± 0.039
0.193ArgCys: 0.193 ± 0.011
1.664ArgAsp: 1.664 ± 0.038
2.073ArgGlu: 2.073 ± 0.049
1.754ArgPhe: 1.754 ± 0.038
1.887ArgGly: 1.887 ± 0.037
0.583ArgHis: 0.583 ± 0.027
2.882ArgIle: 2.882 ± 0.05
3.055ArgLys: 3.055 ± 0.058
3.043ArgLeu: 3.043 ± 0.054
0.756ArgMet: 0.756 ± 0.026
2.284ArgAsn: 2.284 ± 0.046
0.879ArgPro: 0.879 ± 0.025
0.905ArgGln: 0.905 ± 0.029
1.235ArgArg: 1.235 ± 0.039
2.032ArgSer: 2.032 ± 0.053
1.787ArgThr: 1.787 ± 0.043
2.067ArgVal: 2.067 ± 0.046
0.352ArgTrp: 0.352 ± 0.019
1.49ArgTyr: 1.49 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
3.433SerAla: 3.433 ± 0.078
0.718SerCys: 0.718 ± 0.027
4.066SerAsp: 4.066 ± 0.132
4.518SerGlu: 4.518 ± 0.081
4.273SerPhe: 4.273 ± 0.06
4.833SerGly: 4.833 ± 0.1
1.002SerHis: 1.002 ± 0.028
6.092SerIle: 6.092 ± 0.091
5.597SerLys: 5.597 ± 0.077
6.724SerLeu: 6.724 ± 0.094
1.224SerMet: 1.224 ± 0.031
4.95SerAsn: 4.95 ± 0.072
2.113SerPro: 2.113 ± 0.045
2.126SerGln: 2.126 ± 0.058
1.988SerArg: 1.988 ± 0.046
5.316SerSer: 5.316 ± 0.085
3.967SerThr: 3.967 ± 0.111
4.067SerVal: 4.067 ± 0.071
0.767SerTrp: 0.767 ± 0.033
3.24SerTyr: 3.24 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
3.303ThrAla: 3.303 ± 0.12
0.445ThrCys: 0.445 ± 0.046
3.694ThrAsp: 3.694 ± 0.199
3.353ThrGlu: 3.353 ± 0.049
3.024ThrPhe: 3.024 ± 0.059
4.065ThrGly: 4.065 ± 0.136
1.009ThrHis: 1.009 ± 0.031
5.582ThrIle: 5.582 ± 0.155
4.068ThrLys: 4.068 ± 0.065
5.432ThrLeu: 5.432 ± 0.067
0.904ThrMet: 0.904 ± 0.031
3.889ThrAsn: 3.889 ± 0.087
2.59ThrPro: 2.59 ± 0.05
1.815ThrGln: 1.815 ± 0.047
1.646ThrArg: 1.646 ± 0.042
4.572ThrSer: 4.572 ± 0.147
3.924ThrThr: 3.924 ± 0.133
4.07ThrVal: 4.07 ± 0.166
0.604ThrTrp: 0.604 ± 0.032
2.619ThrTyr: 2.619 ± 0.077
0.0ThrXaa: 0.0 ± 0.0
Val
3.656ValAla: 3.656 ± 0.09
0.638ValCys: 0.638 ± 0.047
3.542ValAsp: 3.542 ± 0.086
3.797ValGlu: 3.797 ± 0.066
3.409ValPhe: 3.409 ± 0.059
3.676ValGly: 3.676 ± 0.066
0.957ValHis: 0.957 ± 0.027
4.644ValIle: 4.644 ± 0.074
4.567ValLys: 4.567 ± 0.079
5.903ValLeu: 5.903 ± 0.097
1.133ValMet: 1.133 ± 0.03
4.082ValAsn: 4.082 ± 0.087
2.071ValPro: 2.071 ± 0.086
1.836ValGln: 1.836 ± 0.052
1.874ValArg: 1.874 ± 0.042
4.865ValSer: 4.865 ± 0.063
3.992ValThr: 3.992 ± 0.143
4.193ValVal: 4.193 ± 0.087
0.577ValTrp: 0.577 ± 0.02
2.468ValTyr: 2.468 ± 0.052
0.0ValXaa: 0.0 ± 0.0
Trp
0.501TrpAla: 0.501 ± 0.019
0.1TrpCys: 0.1 ± 0.008
0.541TrpAsp: 0.541 ± 0.022
0.638TrpGlu: 0.638 ± 0.025
0.578TrpPhe: 0.578 ± 0.021
0.641TrpGly: 0.641 ± 0.019
0.2TrpHis: 0.2 ± 0.012
0.731TrpIle: 0.731 ± 0.023
0.881TrpLys: 0.881 ± 0.025
0.959TrpLeu: 0.959 ± 0.03
0.266TrpMet: 0.266 ± 0.015
0.709TrpAsn: 0.709 ± 0.023
0.175TrpPro: 0.175 ± 0.012
0.407TrpGln: 0.407 ± 0.019
0.377TrpArg: 0.377 ± 0.018
0.744TrpSer: 0.744 ± 0.036
0.519TrpThr: 0.519 ± 0.02
0.609TrpVal: 0.609 ± 0.021
0.14TrpTrp: 0.14 ± 0.01
0.434TrpTyr: 0.434 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.037TyrAla: 2.037 ± 0.038
0.357TyrCys: 0.357 ± 0.017
2.281TyrAsp: 2.281 ± 0.048
2.435TyrGlu: 2.435 ± 0.044
2.428TyrPhe: 2.428 ± 0.053
2.552TyrGly: 2.552 ± 0.043
0.794TyrHis: 0.794 ± 0.025
2.864TyrIle: 2.864 ± 0.051
3.371TyrLys: 3.371 ± 0.059
3.847TyrLeu: 3.847 ± 0.068
0.704TyrMet: 0.704 ± 0.027
2.779TyrAsn: 2.779 ± 0.048
1.485TyrPro: 1.485 ± 0.051
1.594TyrGln: 1.594 ± 0.039
1.674TyrArg: 1.674 ± 0.036
2.979TyrSer: 2.979 ± 0.057
2.719TyrThr: 2.719 ± 0.114
2.389TyrVal: 2.389 ± 0.056
0.486TyrTrp: 0.486 ± 0.019
1.93TyrTyr: 1.93 ± 0.044
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4480 proteins (1587372 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski