Amino acid dipepetide frequency for Streptococcus suis (strain 05ZYH33)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.814AlaAla: 5.814 ± 0.144
0.543AlaCys: 0.543 ± 0.031
4.494AlaAsp: 4.494 ± 0.086
5.358AlaGlu: 5.358 ± 0.113
3.312AlaPhe: 3.312 ± 0.071
5.906AlaGly: 5.906 ± 0.119
1.302AlaHis: 1.302 ± 0.045
5.913AlaIle: 5.913 ± 0.108
4.799AlaLys: 4.799 ± 0.085
7.463AlaLeu: 7.463 ± 0.127
1.975AlaMet: 1.975 ± 0.059
3.149AlaAsn: 3.149 ± 0.087
2.218AlaPro: 2.218 ± 0.077
3.148AlaGln: 3.148 ± 0.078
2.911AlaArg: 2.911 ± 0.069
4.616AlaSer: 4.616 ± 0.092
4.497AlaThr: 4.497 ± 0.094
5.492AlaVal: 5.492 ± 0.095
0.65AlaTrp: 0.65 ± 0.03
2.793AlaTyr: 2.793 ± 0.058
0.0AlaXaa: 0.0 ± 0.0
Cys
0.315CysAla: 0.315 ± 0.026
0.062CysCys: 0.062 ± 0.009
0.289CysAsp: 0.289 ± 0.024
0.271CysGlu: 0.271 ± 0.02
0.31CysPhe: 0.31 ± 0.022
0.491CysGly: 0.491 ± 0.03
0.217CysHis: 0.217 ± 0.019
0.383CysIle: 0.383 ± 0.023
0.297CysLys: 0.297 ± 0.025
0.729CysLeu: 0.729 ± 0.037
0.151CysMet: 0.151 ± 0.016
0.209CysAsn: 0.209 ± 0.017
0.273CysPro: 0.273 ± 0.021
0.42CysGln: 0.42 ± 0.029
0.274CysArg: 0.274 ± 0.021
0.424CysSer: 0.424 ± 0.037
0.261CysThr: 0.261 ± 0.02
0.312CysVal: 0.312 ± 0.025
0.053CysTrp: 0.053 ± 0.01
0.227CysTyr: 0.227 ± 0.02
0.0CysXaa: 0.0 ± 0.0
Asp
3.667AspAla: 3.667 ± 0.075
0.373AspCys: 0.373 ± 0.026
2.606AspAsp: 2.606 ± 0.068
4.077AspGlu: 4.077 ± 0.085
3.3AspPhe: 3.3 ± 0.084
3.873AspGly: 3.873 ± 0.1
0.916AspHis: 0.916 ± 0.042
4.246AspIle: 4.246 ± 0.092
3.997AspLys: 3.997 ± 0.084
5.817AspLeu: 5.817 ± 0.105
1.504AspMet: 1.504 ± 0.054
2.355AspAsn: 2.355 ± 0.066
1.547AspPro: 1.547 ± 0.057
2.089AspGln: 2.089 ± 0.059
2.139AspArg: 2.139 ± 0.063
3.148AspSer: 3.148 ± 0.075
2.896AspThr: 2.896 ± 0.071
3.627AspVal: 3.627 ± 0.078
0.719AspTrp: 0.719 ± 0.036
2.818AspTyr: 2.818 ± 0.067
0.0AspXaa: 0.0 ± 0.0
Glu
5.566GluAla: 5.566 ± 0.127
0.3GluCys: 0.3 ± 0.024
3.734GluAsp: 3.734 ± 0.08
6.499GluGlu: 6.499 ± 0.127
2.798GluPhe: 2.798 ± 0.067
3.809GluGly: 3.809 ± 0.076
1.276GluHis: 1.276 ± 0.046
5.217GluIle: 5.217 ± 0.09
5.867GluLys: 5.867 ± 0.106
7.169GluLeu: 7.169 ± 0.119
1.91GluMet: 1.91 ± 0.06
4.018GluAsn: 4.018 ± 0.076
1.696GluPro: 1.696 ± 0.059
3.274GluGln: 3.274 ± 0.085
3.33GluArg: 3.33 ± 0.082
3.374GluSer: 3.374 ± 0.08
3.827GluThr: 3.827 ± 0.093
5.092GluVal: 5.092 ± 0.113
0.64GluTrp: 0.64 ± 0.032
2.102GluTyr: 2.102 ± 0.059
0.0GluXaa: 0.0 ± 0.0
Phe
3.376PheAla: 3.376 ± 0.083
0.294PheCys: 0.294 ± 0.023
2.924PheAsp: 2.924 ± 0.074
3.164PheGlu: 3.164 ± 0.074
2.204PhePhe: 2.204 ± 0.081
3.34PheGly: 3.34 ± 0.073
0.88PheHis: 0.88 ± 0.04
3.072PheIle: 3.072 ± 0.083
2.351PheLys: 2.351 ± 0.071
4.637PheLeu: 4.637 ± 0.11
1.151PheMet: 1.151 ± 0.054
1.823PheAsn: 1.823 ± 0.051
1.575PhePro: 1.575 ± 0.057
1.621PheGln: 1.621 ± 0.055
1.63PheArg: 1.63 ± 0.055
3.185PheSer: 3.185 ± 0.08
2.532PheThr: 2.532 ± 0.063
3.174PheVal: 3.174 ± 0.082
0.498PheTrp: 0.498 ± 0.031
1.887PheTyr: 1.887 ± 0.059
0.0PheXaa: 0.0 ± 0.0
Gly
4.643GlyAla: 4.643 ± 0.101
0.404GlyCys: 0.404 ± 0.027
3.376GlyAsp: 3.376 ± 0.087
3.791GlyGlu: 3.791 ± 0.083
3.286GlyPhe: 3.286 ± 0.074
4.267GlyGly: 4.267 ± 0.106
1.379GlyHis: 1.379 ± 0.049
5.327GlyIle: 5.327 ± 0.098
4.583GlyLys: 4.583 ± 0.103
7.056GlyLeu: 7.056 ± 0.109
1.901GlyMet: 1.901 ± 0.062
2.786GlyAsn: 2.786 ± 0.071
1.433GlyPro: 1.433 ± 0.049
3.359GlyGln: 3.359 ± 0.08
2.906GlyArg: 2.906 ± 0.081
3.668GlySer: 3.668 ± 0.084
3.716GlyThr: 3.716 ± 0.097
4.737GlyVal: 4.737 ± 0.092
0.637GlyTrp: 0.637 ± 0.03
2.75GlyTyr: 2.75 ± 0.073
0.0GlyXaa: 0.0 ± 0.0
His
1.227HisAla: 1.227 ± 0.051
0.138HisCys: 0.138 ± 0.015
0.887HisAsp: 0.887 ± 0.035
1.095HisGlu: 1.095 ± 0.044
1.117HisPhe: 1.117 ± 0.049
1.259HisGly: 1.259 ± 0.049
0.54HisHis: 0.54 ± 0.034
1.314HisIle: 1.314 ± 0.053
0.892HisLys: 0.892 ± 0.038
2.159HisLeu: 2.159 ± 0.067
0.415HisMet: 0.415 ± 0.025
0.69HisAsn: 0.69 ± 0.033
0.869HisPro: 0.869 ± 0.049
0.928HisGln: 0.928 ± 0.043
0.949HisArg: 0.949 ± 0.044
1.12HisSer: 1.12 ± 0.045
0.988HisThr: 0.988 ± 0.037
1.095HisVal: 1.095 ± 0.046
0.146HisTrp: 0.146 ± 0.015
0.923HisTyr: 0.923 ± 0.04
0.0HisXaa: 0.0 ± 0.0
Ile
6.254IleAla: 6.254 ± 0.106
0.573IleCys: 0.573 ± 0.032
4.315IleAsp: 4.315 ± 0.086
5.128IleGlu: 5.128 ± 0.083
3.351IlePhe: 3.351 ± 0.092
5.006IleGly: 5.006 ± 0.099
1.374IleHis: 1.374 ± 0.046
4.811IleIle: 4.811 ± 0.115
3.978IleLys: 3.978 ± 0.088
7.325IleLeu: 7.325 ± 0.138
1.801IleMet: 1.801 ± 0.058
2.816IleAsn: 2.816 ± 0.067
2.824IlePro: 2.824 ± 0.072
2.819IleGln: 2.819 ± 0.072
3.016IleArg: 3.016 ± 0.074
4.916IleSer: 4.916 ± 0.098
3.642IleThr: 3.642 ± 0.082
4.74IleVal: 4.74 ± 0.093
0.631IleTrp: 0.631 ± 0.034
2.565IleTyr: 2.565 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
4.698LysAla: 4.698 ± 0.106
0.246LysCys: 0.246 ± 0.021
3.535LysAsp: 3.535 ± 0.089
5.704LysGlu: 5.704 ± 0.103
1.872LysPhe: 1.872 ± 0.057
3.768LysGly: 3.768 ± 0.079
1.146LysHis: 1.146 ± 0.045
4.519LysIle: 4.519 ± 0.093
4.852LysLys: 4.852 ± 0.106
5.678LysLeu: 5.678 ± 0.099
1.901LysMet: 1.901 ± 0.056
3.309LysAsn: 3.309 ± 0.079
2.087LysPro: 2.087 ± 0.059
2.86LysGln: 2.86 ± 0.068
3.031LysArg: 3.031 ± 0.074
3.471LysSer: 3.471 ± 0.076
3.868LysThr: 3.868 ± 0.088
4.448LysVal: 4.448 ± 0.094
0.585LysTrp: 0.585 ± 0.036
2.08LysTyr: 2.08 ± 0.062
0.0LysXaa: 0.0 ± 0.0
Leu
9.009LeuAla: 9.009 ± 0.13
0.575LeuCys: 0.575 ± 0.038
6.1LeuAsp: 6.1 ± 0.107
7.305LeuGlu: 7.305 ± 0.132
4.379LeuPhe: 4.379 ± 0.12
6.553LeuGly: 6.553 ± 0.127
1.686LeuHis: 1.686 ± 0.054
6.591LeuIle: 6.591 ± 0.147
5.867LeuLys: 5.867 ± 0.093
10.328LeuLeu: 10.328 ± 0.197
2.397LeuMet: 2.397 ± 0.074
3.964LeuAsn: 3.964 ± 0.09
4.323LeuPro: 4.323 ± 0.089
3.663LeuGln: 3.663 ± 0.078
3.872LeuArg: 3.872 ± 0.073
7.059LeuSer: 7.059 ± 0.121
6.414LeuThr: 6.414 ± 0.116
7.466LeuVal: 7.466 ± 0.12
0.706LeuTrp: 0.706 ± 0.034
3.356LeuTyr: 3.356 ± 0.077
0.0LeuXaa: 0.0 ± 0.0
Met
2.057MetAla: 2.057 ± 0.069
0.161MetCys: 0.161 ± 0.017
1.448MetAsp: 1.448 ± 0.047
1.77MetGlu: 1.77 ± 0.061
0.801MetPhe: 0.801 ± 0.036
1.65MetGly: 1.65 ± 0.064
0.373MetHis: 0.373 ± 0.025
1.869MetIle: 1.869 ± 0.06
2.075MetLys: 2.075 ± 0.053
2.254MetLeu: 2.254 ± 0.066
0.788MetMet: 0.788 ± 0.043
1.296MetAsn: 1.296 ± 0.05
0.857MetPro: 0.857 ± 0.037
0.862MetGln: 0.862 ± 0.037
1.095MetArg: 1.095 ± 0.042
1.543MetSer: 1.543 ± 0.053
1.91MetThr: 1.91 ± 0.059
1.829MetVal: 1.829 ± 0.051
0.156MetTrp: 0.156 ± 0.017
0.616MetTyr: 0.616 ± 0.03
0.0MetXaa: 0.0 ± 0.0
Asn
2.939AsnAla: 2.939 ± 0.072
0.251AsnCys: 0.251 ± 0.024
2.222AsnAsp: 2.222 ± 0.062
2.415AsnGlu: 2.415 ± 0.07
2.018AsnPhe: 2.018 ± 0.063
3.245AsnGly: 3.245 ± 0.081
1.01AsnHis: 1.01 ± 0.04
3.248AsnIle: 3.248 ± 0.074
2.588AsnLys: 2.588 ± 0.078
4.519AsnLeu: 4.519 ± 0.086
1.138AsnMet: 1.138 ± 0.039
1.852AsnAsn: 1.852 ± 0.062
2.159AsnPro: 2.159 ± 0.062
2.272AsnGln: 2.272 ± 0.07
2.126AsnArg: 2.126 ± 0.065
2.478AsnSer: 2.478 ± 0.075
2.115AsnThr: 2.115 ± 0.066
2.647AsnVal: 2.647 ± 0.059
0.54AsnTrp: 0.54 ± 0.036
1.808AsnTyr: 1.808 ± 0.058
0.0AsnXaa: 0.0 ± 0.0
Pro
2.662ProAla: 2.662 ± 0.067
0.194ProCys: 0.194 ± 0.019
2.008ProAsp: 2.008 ± 0.059
2.829ProGlu: 2.829 ± 0.082
1.634ProPhe: 1.634 ± 0.038
1.801ProGly: 1.801 ± 0.066
0.695ProHis: 0.695 ± 0.033
2.524ProIle: 2.524 ± 0.055
2.059ProLys: 2.059 ± 0.095
3.046ProLeu: 3.046 ± 0.07
0.79ProMet: 0.79 ± 0.037
1.593ProAsn: 1.593 ± 0.054
0.634ProPro: 0.634 ± 0.039
1.318ProGln: 1.318 ± 0.049
1.102ProArg: 1.102 ± 0.048
2.228ProSer: 2.228 ± 0.068
2.327ProThr: 2.327 ± 0.069
2.658ProVal: 2.658 ± 0.072
0.281ProTrp: 0.281 ± 0.024
1.337ProTyr: 1.337 ± 0.048
0.0ProXaa: 0.0 ± 0.0
Gln
4.034GlnAla: 4.034 ± 0.095
0.136GlnCys: 0.136 ± 0.015
2.006GlnAsp: 2.006 ± 0.056
3.553GlnGlu: 3.553 ± 0.068
1.737GlnPhe: 1.737 ± 0.051
2.379GlnGly: 2.379 ± 0.068
0.68GlnHis: 0.68 ± 0.037
3.066GlnIle: 3.066 ± 0.074
2.635GlnLys: 2.635 ± 0.059
4.612GlnLeu: 4.612 ± 0.101
1.056GlnMet: 1.056 ± 0.047
1.588GlnAsn: 1.588 ± 0.054
1.317GlnPro: 1.317 ± 0.059
1.649GlnGln: 1.649 ± 0.059
1.603GlnArg: 1.603 ± 0.054
2.438GlnSer: 2.438 ± 0.067
2.629GlnThr: 2.629 ± 0.079
3.586GlnVal: 3.586 ± 0.078
0.337GlnTrp: 0.337 ± 0.027
1.397GlnTyr: 1.397 ± 0.051
0.0GlnXaa: 0.0 ± 0.0
Arg
2.652ArgAla: 2.652 ± 0.078
0.204ArgCys: 0.204 ± 0.019
2.184ArgAsp: 2.184 ± 0.063
3.102ArgGlu: 3.102 ± 0.085
2.054ArgPhe: 2.054 ± 0.059
2.384ArgGly: 2.384 ± 0.067
0.842ArgHis: 0.842 ± 0.043
3.174ArgIle: 3.174 ± 0.074
3.011ArgLys: 3.011 ± 0.08
4.31ArgLeu: 4.31 ± 0.096
1.177ArgMet: 1.177 ± 0.044
1.873ArgAsn: 1.873 ± 0.06
1.299ArgPro: 1.299 ± 0.046
2.189ArgGln: 2.189 ± 0.06
2.066ArgArg: 2.066 ± 0.069
2.164ArgSer: 2.164 ± 0.058
2.215ArgThr: 2.215 ± 0.069
2.737ArgVal: 2.737 ± 0.068
0.332ArgTrp: 0.332 ± 0.027
1.688ArgTyr: 1.688 ± 0.058
0.0ArgXaa: 0.0 ± 0.0
Ser
3.8SerAla: 3.8 ± 0.089
0.394SerCys: 0.394 ± 0.025
3.243SerAsp: 3.243 ± 0.074
3.737SerGlu: 3.737 ± 0.088
2.992SerPhe: 2.992 ± 0.081
4.331SerGly: 4.331 ± 0.088
1.238SerHis: 1.238 ± 0.047
4.282SerIle: 4.282 ± 0.093
3.648SerLys: 3.648 ± 0.072
6.796SerLeu: 6.796 ± 0.13
1.399SerMet: 1.399 ± 0.046
2.532SerAsn: 2.532 ± 0.067
2.105SerPro: 2.105 ± 0.068
3.11SerGln: 3.11 ± 0.076
2.534SerArg: 2.534 ± 0.061
4.302SerSer: 4.302 ± 0.115
3.494SerThr: 3.494 ± 0.07
4.097SerVal: 4.097 ± 0.081
0.644SerTrp: 0.644 ± 0.028
2.584SerTyr: 2.584 ± 0.057
0.0SerXaa: 0.0 ± 0.0
Thr
4.376ThrAla: 4.376 ± 0.096
0.34ThrCys: 0.34 ± 0.026
3.519ThrAsp: 3.519 ± 0.087
3.716ThrGlu: 3.716 ± 0.09
2.599ThrPhe: 2.599 ± 0.071
4.404ThrGly: 4.404 ± 0.103
0.957ThrHis: 0.957 ± 0.043
4.663ThrIle: 4.663 ± 0.102
3.277ThrLys: 3.277 ± 0.085
5.443ThrLeu: 5.443 ± 0.101
1.266ThrMet: 1.266 ± 0.051
2.478ThrAsn: 2.478 ± 0.076
2.404ThrPro: 2.404 ± 0.084
1.816ThrGln: 1.816 ± 0.056
1.941ThrArg: 1.941 ± 0.064
3.875ThrSer: 3.875 ± 0.093
3.356ThrThr: 3.356 ± 0.094
4.62ThrVal: 4.62 ± 0.107
0.532ThrTrp: 0.532 ± 0.031
2.31ThrTyr: 2.31 ± 0.062
0.0ThrXaa: 0.0 ± 0.0
Val
6.129ValAla: 6.129 ± 0.108
0.473ValCys: 0.473 ± 0.029
4.064ValAsp: 4.064 ± 0.089
5.212ValGlu: 5.212 ± 0.108
3.156ValPhe: 3.156 ± 0.081
4.589ValGly: 4.589 ± 0.092
1.223ValHis: 1.223 ± 0.049
4.619ValIle: 4.619 ± 0.088
4.098ValLys: 4.098 ± 0.092
6.909ValLeu: 6.909 ± 0.097
1.635ValMet: 1.635 ± 0.058
3.115ValAsn: 3.115 ± 0.075
2.484ValPro: 2.484 ± 0.073
2.353ValGln: 2.353 ± 0.063
2.885ValArg: 2.885 ± 0.079
4.359ValSer: 4.359 ± 0.096
4.673ValThr: 4.673 ± 0.113
5.097ValVal: 5.097 ± 0.112
0.566ValTrp: 0.566 ± 0.031
2.509ValTyr: 2.509 ± 0.062
0.0ValXaa: 0.0 ± 0.0
Trp
0.525TrpAla: 0.525 ± 0.028
0.051TrpCys: 0.051 ± 0.009
0.486TrpAsp: 0.486 ± 0.029
0.608TrpGlu: 0.608 ± 0.033
0.422TrpPhe: 0.422 ± 0.029
0.563TrpGly: 0.563 ± 0.034
0.161TrpHis: 0.161 ± 0.017
0.598TrpIle: 0.598 ± 0.031
0.591TrpLys: 0.591 ± 0.031
1.113TrpLeu: 1.113 ± 0.048
0.235TrpMet: 0.235 ± 0.019
0.525TrpAsn: 0.525 ± 0.031
0.207TrpPro: 0.207 ± 0.018
0.443TrpGln: 0.443 ± 0.031
0.397TrpArg: 0.397 ± 0.024
0.627TrpSer: 0.627 ± 0.034
0.585TrpThr: 0.585 ± 0.035
0.511TrpVal: 0.511 ± 0.032
0.126TrpTrp: 0.126 ± 0.014
0.414TrpTyr: 0.414 ± 0.032
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.611TyrAla: 2.611 ± 0.073
0.255TyrCys: 0.255 ± 0.021
2.274TyrAsp: 2.274 ± 0.068
2.164TyrGlu: 2.164 ± 0.069
1.975TyrPhe: 1.975 ± 0.056
2.386TyrGly: 2.386 ± 0.061
0.854TyrHis: 0.854 ± 0.039
2.455TyrIle: 2.455 ± 0.074
2.192TyrLys: 2.192 ± 0.065
4.249TyrLeu: 4.249 ± 0.093
0.818TyrMet: 0.818 ± 0.037
1.621TyrAsn: 1.621 ± 0.057
1.407TyrPro: 1.407 ± 0.055
2.245TyrGln: 2.245 ± 0.066
1.849TyrArg: 1.849 ± 0.059
2.268TyrSer: 2.268 ± 0.065
1.947TyrThr: 1.947 ± 0.064
2.184TyrVal: 2.184 ± 0.062
0.41TyrTrp: 0.41 ± 0.033
1.768TyrTyr: 1.768 ± 0.066
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2178 proteins (609028 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski