Amino acid dipepetide frequency for Gavia stellata (Red-throated diver) (Colymbus stellatus)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.476AlaAla: 5.476 ± 0.053
1.309AlaCys: 1.309 ± 0.023
2.988AlaAsp: 2.988 ± 0.029
4.541AlaGlu: 4.541 ± 0.048
2.65AlaPhe: 2.65 ± 0.032
3.731AlaGly: 3.731 ± 0.047
1.334AlaHis: 1.334 ± 0.019
3.194AlaIle: 3.194 ± 0.038
3.78AlaLys: 3.78 ± 0.044
6.359AlaLeu: 6.359 ± 0.054
1.526AlaMet: 1.526 ± 0.024
2.31AlaAsn: 2.31 ± 0.028
2.772AlaPro: 2.772 ± 0.036
2.668AlaGln: 2.668 ± 0.035
2.878AlaArg: 2.878 ± 0.031
5.124AlaSer: 5.124 ± 0.049
3.343AlaThr: 3.343 ± 0.032
4.862AlaVal: 4.862 ± 0.047
0.679AlaTrp: 0.679 ± 0.016
1.658AlaTyr: 1.658 ± 0.025
0.002AlaXaa: 0.002 ± 0.001
Cys
1.169CysAla: 1.169 ± 0.024
0.643CysCys: 0.643 ± 0.021
1.032CysAsp: 1.032 ± 0.026
1.306CysGlu: 1.306 ± 0.032
0.961CysPhe: 0.961 ± 0.019
1.418CysGly: 1.418 ± 0.025
0.611CysHis: 0.611 ± 0.019
1.173CysIle: 1.173 ± 0.02
1.363CysLys: 1.363 ± 0.025
2.157CysLeu: 2.157 ± 0.027
0.459CysMet: 0.459 ± 0.013
0.912CysAsn: 0.912 ± 0.021
1.178CysPro: 1.178 ± 0.031
1.07CysGln: 1.07 ± 0.027
1.226CysArg: 1.226 ± 0.021
1.993CysSer: 1.993 ± 0.034
1.187CysThr: 1.187 ± 0.025
1.328CysVal: 1.328 ± 0.025
0.297CysTrp: 0.297 ± 0.009
0.696CysTyr: 0.696 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
2.873AspAla: 2.873 ± 0.03
1.119AspCys: 1.119 ± 0.021
2.864AspAsp: 2.864 ± 0.036
3.691AspGlu: 3.691 ± 0.043
2.309AspPhe: 2.309 ± 0.031
3.175AspGly: 3.175 ± 0.038
1.171AspHis: 1.171 ± 0.021
3.038AspIle: 3.038 ± 0.032
2.896AspLys: 2.896 ± 0.033
5.132AspLeu: 5.132 ± 0.048
1.171AspMet: 1.171 ± 0.019
1.989AspAsn: 1.989 ± 0.027
2.615AspPro: 2.615 ± 0.034
1.883AspGln: 1.883 ± 0.024
2.329AspArg: 2.329 ± 0.031
4.068AspSer: 4.068 ± 0.049
2.508AspThr: 2.508 ± 0.03
3.204AspVal: 3.204 ± 0.03
0.667AspTrp: 0.667 ± 0.015
1.664AspTyr: 1.664 ± 0.025
0.0AspXaa: 0.0 ± 0.0
Glu
4.645GluAla: 4.645 ± 0.045
1.302GluCys: 1.302 ± 0.036
4.535GluAsp: 4.535 ± 0.046
7.897GluGlu: 7.897 ± 0.097
2.272GluPhe: 2.272 ± 0.026
3.818GluGly: 3.818 ± 0.039
1.582GluHis: 1.582 ± 0.024
3.658GluIle: 3.658 ± 0.042
5.937GluLys: 5.937 ± 0.076
6.392GluLeu: 6.392 ± 0.069
1.816GluMet: 1.816 ± 0.024
3.556GluAsn: 3.556 ± 0.036
2.519GluPro: 2.519 ± 0.033
3.209GluGln: 3.209 ± 0.045
3.929GluArg: 3.929 ± 0.052
4.476GluSer: 4.476 ± 0.047
3.626GluThr: 3.626 ± 0.047
4.373GluVal: 4.373 ± 0.04
0.722GluTrp: 0.722 ± 0.016
1.866GluTyr: 1.866 ± 0.028
0.001GluXaa: 0.001 ± 0.0
Phe
2.193PheAla: 2.193 ± 0.028
1.019PheCys: 1.019 ± 0.02
1.853PheAsp: 1.853 ± 0.026
2.196PheGlu: 2.196 ± 0.029
1.918PhePhe: 1.918 ± 0.029
2.341PheGly: 2.341 ± 0.035
1.112PheHis: 1.112 ± 0.019
2.198PheIle: 2.198 ± 0.03
2.137PheLys: 2.137 ± 0.027
4.304PheLeu: 4.304 ± 0.047
0.808PheMet: 0.808 ± 0.019
1.591PheAsn: 1.591 ± 0.024
1.984PhePro: 1.984 ± 0.03
1.855PheGln: 1.855 ± 0.025
1.918PheArg: 1.918 ± 0.03
3.522PheSer: 3.522 ± 0.038
2.28PheThr: 2.28 ± 0.032
2.435PheVal: 2.435 ± 0.03
0.533PheTrp: 0.533 ± 0.015
1.367PheTyr: 1.367 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
3.3GlyAla: 3.3 ± 0.041
1.198GlyCys: 1.198 ± 0.024
2.895GlyAsp: 2.895 ± 0.035
3.703GlyGlu: 3.703 ± 0.05
2.498GlyPhe: 2.498 ± 0.039
3.713GlyGly: 3.713 ± 0.054
1.446GlyHis: 1.446 ± 0.021
3.16GlyIle: 3.16 ± 0.038
4.027GlyLys: 4.027 ± 0.044
5.029GlyLeu: 5.029 ± 0.05
1.367GlyMet: 1.367 ± 0.025
2.634GlyAsn: 2.634 ± 0.033
2.633GlyPro: 2.633 ± 0.075
2.451GlyGln: 2.451 ± 0.032
3.019GlyArg: 3.019 ± 0.041
4.896GlySer: 4.896 ± 0.058
3.43GlyThr: 3.43 ± 0.038
3.38GlyVal: 3.38 ± 0.037
0.755GlyTrp: 0.755 ± 0.018
1.867GlyTyr: 1.867 ± 0.03
0.002GlyXaa: 0.002 ± 0.001
His
1.344HisAla: 1.344 ± 0.024
0.709HisCys: 0.709 ± 0.015
0.932HisAsp: 0.932 ± 0.018
1.389HisGlu: 1.389 ± 0.022
1.1HisPhe: 1.1 ± 0.02
1.461HisGly: 1.461 ± 0.022
0.854HisHis: 0.854 ± 0.021
1.391HisIle: 1.391 ± 0.024
1.401HisLys: 1.401 ± 0.026
2.824HisLeu: 2.824 ± 0.032
0.597HisMet: 0.597 ± 0.015
0.995HisAsn: 0.995 ± 0.018
1.465HisPro: 1.465 ± 0.026
1.199HisGln: 1.199 ± 0.022
1.411HisArg: 1.411 ± 0.022
2.167HisSer: 2.167 ± 0.03
1.304HisThr: 1.304 ± 0.027
1.535HisVal: 1.535 ± 0.021
0.35HisTrp: 0.35 ± 0.011
0.866HisTyr: 0.866 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
3.13IleAla: 3.13 ± 0.035
1.224IleCys: 1.224 ± 0.025
2.398IleAsp: 2.398 ± 0.032
2.999IleGlu: 2.999 ± 0.037
2.266IlePhe: 2.266 ± 0.033
2.524IleGly: 2.524 ± 0.032
1.429IleHis: 1.429 ± 0.022
2.89IleIle: 2.89 ± 0.036
3.158IleLys: 3.158 ± 0.034
5.136IleLeu: 5.136 ± 0.053
1.117IleMet: 1.117 ± 0.022
2.263IleAsn: 2.263 ± 0.03
2.897IlePro: 2.897 ± 0.033
2.534IleGln: 2.534 ± 0.03
2.576IleArg: 2.576 ± 0.029
4.281IleSer: 4.281 ± 0.04
2.95IleThr: 2.95 ± 0.034
3.058IleVal: 3.058 ± 0.034
0.592IleTrp: 0.592 ± 0.014
1.692IleTyr: 1.692 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
4.182LysAla: 4.182 ± 0.047
1.225LysCys: 1.225 ± 0.023
3.566LysAsp: 3.566 ± 0.043
5.843LysGlu: 5.843 ± 0.073
1.977LysPhe: 1.977 ± 0.027
3.484LysGly: 3.484 ± 0.053
1.643LysHis: 1.643 ± 0.025
3.372LysIle: 3.372 ± 0.041
5.64LysLys: 5.64 ± 0.073
5.851LysLeu: 5.851 ± 0.055
1.626LysMet: 1.626 ± 0.023
2.953LysAsn: 2.953 ± 0.037
3.128LysPro: 3.128 ± 0.039
3.113LysGln: 3.113 ± 0.037
3.633LysArg: 3.633 ± 0.04
4.348LysSer: 4.348 ± 0.05
3.548LysThr: 3.548 ± 0.033
3.85LysVal: 3.85 ± 0.045
0.695LysTrp: 0.695 ± 0.018
1.948LysTyr: 1.948 ± 0.027
0.0LysXaa: 0.0 ± 0.0
Leu
6.128LeuAla: 6.128 ± 0.053
2.201LeuCys: 2.201 ± 0.033
4.793LeuAsp: 4.793 ± 0.041
7.034LeuGlu: 7.034 ± 0.072
3.667LeuPhe: 3.667 ± 0.051
5.113LeuGly: 5.113 ± 0.045
2.714LeuHis: 2.714 ± 0.033
4.392LeuIle: 4.392 ± 0.042
6.618LeuLys: 6.618 ± 0.054
10.088LeuLeu: 10.088 ± 0.099
2.086LeuMet: 2.086 ± 0.027
3.98LeuAsn: 3.98 ± 0.041
5.278LeuPro: 5.278 ± 0.049
5.646LeuGln: 5.646 ± 0.061
5.077LeuArg: 5.077 ± 0.047
7.848LeuSer: 7.848 ± 0.057
5.026LeuThr: 5.026 ± 0.044
5.454LeuVal: 5.454 ± 0.052
1.069LeuTrp: 1.069 ± 0.021
2.842LeuTyr: 2.842 ± 0.034
0.001LeuXaa: 0.001 ± 0.0
Met
1.617MetAla: 1.617 ± 0.024
0.455MetCys: 0.455 ± 0.012
1.293MetAsp: 1.293 ± 0.021
1.903MetGlu: 1.903 ± 0.03
0.867MetPhe: 0.867 ± 0.017
1.247MetGly: 1.247 ± 0.022
0.524MetHis: 0.524 ± 0.014
1.004MetIle: 1.004 ± 0.02
1.679MetLys: 1.679 ± 0.023
2.114MetLeu: 2.114 ± 0.032
0.623MetMet: 0.623 ± 0.018
1.038MetAsn: 1.038 ± 0.02
1.077MetPro: 1.077 ± 0.021
1.069MetGln: 1.069 ± 0.023
1.058MetArg: 1.058 ± 0.018
1.552MetSer: 1.552 ± 0.022
1.212MetThr: 1.212 ± 0.021
1.477MetVal: 1.477 ± 0.024
0.264MetTrp: 0.264 ± 0.009
0.684MetTyr: 0.684 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.391AsnAla: 2.391 ± 0.034
0.999AsnCys: 0.999 ± 0.022
1.763AsnAsp: 1.763 ± 0.026
2.687AsnGlu: 2.687 ± 0.036
1.702AsnPhe: 1.702 ± 0.024
2.791AsnGly: 2.791 ± 0.04
1.043AsnHis: 1.043 ± 0.02
2.628AsnIle: 2.628 ± 0.03
2.71AsnLys: 2.71 ± 0.032
4.165AsnLeu: 4.165 ± 0.047
1.057AsnMet: 1.057 ± 0.019
1.938AsnAsn: 1.938 ± 0.031
2.318AsnPro: 2.318 ± 0.033
1.747AsnGln: 1.747 ± 0.029
2.028AsnArg: 2.028 ± 0.028
3.562AsnSer: 3.562 ± 0.044
2.344AsnThr: 2.344 ± 0.031
2.555AsnVal: 2.555 ± 0.032
0.499AsnTrp: 0.499 ± 0.013
1.356AsnTyr: 1.356 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
3.518ProAla: 3.518 ± 0.046
1.023ProCys: 1.023 ± 0.025
2.612ProAsp: 2.612 ± 0.033
3.857ProGlu: 3.857 ± 0.046
1.943ProPhe: 1.943 ± 0.023
3.405ProGly: 3.405 ± 0.09
1.204ProHis: 1.204 ± 0.021
2.004ProIle: 2.004 ± 0.026
2.771ProLys: 2.771 ± 0.045
4.6ProLeu: 4.6 ± 0.046
0.952ProMet: 0.952 ± 0.022
1.966ProAsn: 1.966 ± 0.033
4.131ProPro: 4.131 ± 0.083
2.289ProGln: 2.289 ± 0.031
2.402ProArg: 2.402 ± 0.034
4.839ProSer: 4.839 ± 0.066
2.682ProThr: 2.682 ± 0.035
3.724ProVal: 3.724 ± 0.037
0.543ProTrp: 0.543 ± 0.015
1.484ProTyr: 1.484 ± 0.024
0.001ProXaa: 0.001 ± 0.001
Gln
3.026GlnAla: 3.026 ± 0.035
0.956GlnCys: 0.956 ± 0.023
2.241GlnAsp: 2.241 ± 0.025
3.729GlnGlu: 3.729 ± 0.044
1.481GlnPhe: 1.481 ± 0.022
2.42GlnGly: 2.42 ± 0.032
1.29GlnHis: 1.29 ± 0.023
2.322GlnIle: 2.322 ± 0.028
3.288GlnLys: 3.288 ± 0.045
4.606GlnLeu: 4.606 ± 0.058
1.133GlnMet: 1.133 ± 0.018
2.088GlnAsn: 2.088 ± 0.029
2.332GlnPro: 2.332 ± 0.039
3.035GlnGln: 3.035 ± 0.065
2.642GlnArg: 2.642 ± 0.028
3.231GlnSer: 3.231 ± 0.038
2.401GlnThr: 2.401 ± 0.029
2.749GlnVal: 2.749 ± 0.033
0.523GlnTrp: 0.523 ± 0.012
1.326GlnTyr: 1.326 ± 0.024
0.001GlnXaa: 0.001 ± 0.0
Arg
3.007ArgAla: 3.007 ± 0.034
1.089ArgCys: 1.089 ± 0.025
2.531ArgAsp: 2.531 ± 0.036
3.745ArgGlu: 3.745 ± 0.05
1.887ArgPhe: 1.887 ± 0.028
2.747ArgGly: 2.747 ± 0.043
1.414ArgHis: 1.414 ± 0.025
2.631ArgIle: 2.631 ± 0.029
3.926ArgLys: 3.926 ± 0.04
4.724ArgLeu: 4.724 ± 0.045
1.201ArgMet: 1.201 ± 0.018
2.278ArgAsn: 2.278 ± 0.029
2.334ArgPro: 2.334 ± 0.035
2.461ArgGln: 2.461 ± 0.036
3.575ArgArg: 3.575 ± 0.047
3.887ArgSer: 3.887 ± 0.055
2.664ArgThr: 2.664 ± 0.034
2.892ArgVal: 2.892 ± 0.034
0.595ArgTrp: 0.595 ± 0.014
1.623ArgTyr: 1.623 ± 0.023
0.0ArgXaa: 0.0 ± 0.0
Ser
5.059SerAla: 5.059 ± 0.049
1.829SerCys: 1.829 ± 0.026
4.06SerAsp: 4.06 ± 0.042
5.124SerGlu: 5.124 ± 0.051
3.203SerPhe: 3.203 ± 0.038
4.876SerGly: 4.876 ± 0.051
2.022SerHis: 2.022 ± 0.028
3.626SerIle: 3.626 ± 0.037
4.581SerLys: 4.581 ± 0.053
7.949SerLeu: 7.949 ± 0.057
1.632SerMet: 1.632 ± 0.023
3.128SerAsn: 3.128 ± 0.033
5.008SerPro: 5.008 ± 0.072
3.65SerGln: 3.65 ± 0.049
3.963SerArg: 3.963 ± 0.054
8.983SerSer: 8.983 ± 0.12
4.56SerThr: 4.56 ± 0.048
5.151SerVal: 5.151 ± 0.048
0.984SerTrp: 0.984 ± 0.018
2.277SerTyr: 2.277 ± 0.029
0.002SerXaa: 0.002 ± 0.001
Thr
3.813ThrAla: 3.813 ± 0.034
1.315ThrCys: 1.315 ± 0.028
2.727ThrAsp: 2.727 ± 0.033
3.823ThrGlu: 3.823 ± 0.039
2.265ThrPhe: 2.265 ± 0.033
3.404ThrGly: 3.404 ± 0.036
1.195ThrHis: 1.195 ± 0.022
2.695ThrIle: 2.695 ± 0.032
3.013ThrLys: 3.013 ± 0.033
5.189ThrLeu: 5.189 ± 0.047
1.144ThrMet: 1.144 ± 0.021
2.007ThrAsn: 2.007 ± 0.027
3.122ThrPro: 3.122 ± 0.046
2.195ThrGln: 2.195 ± 0.026
2.277ThrArg: 2.277 ± 0.029
4.671ThrSer: 4.671 ± 0.054
3.079ThrThr: 3.079 ± 0.046
4.194ThrVal: 4.194 ± 0.036
0.679ThrTrp: 0.679 ± 0.017
1.595ThrTyr: 1.595 ± 0.024
0.001ThrXaa: 0.001 ± 0.001
Val
4.047ValAla: 4.047 ± 0.04
1.55ValCys: 1.55 ± 0.025
3.104ValAsp: 3.104 ± 0.04
4.02ValGlu: 4.02 ± 0.039
2.693ValPhe: 2.693 ± 0.031
3.317ValGly: 3.317 ± 0.036
1.575ValHis: 1.575 ± 0.023
3.391ValIle: 3.391 ± 0.04
3.98ValLys: 3.98 ± 0.043
6.3ValLeu: 6.3 ± 0.057
1.439ValMet: 1.439 ± 0.024
2.671ValAsn: 2.671 ± 0.03
3.346ValPro: 3.346 ± 0.037
2.846ValGln: 2.846 ± 0.032
2.915ValArg: 2.915 ± 0.029
4.968ValSer: 4.968 ± 0.048
3.91ValThr: 3.91 ± 0.045
4.273ValVal: 4.273 ± 0.043
0.711ValTrp: 0.711 ± 0.014
1.907ValTyr: 1.907 ± 0.028
0.0ValXaa: 0.0 ± 0.0
Trp
0.647TrpAla: 0.647 ± 0.015
0.244TrpCys: 0.244 ± 0.01
0.672TrpAsp: 0.672 ± 0.016
0.776TrpGlu: 0.776 ± 0.016
0.465TrpPhe: 0.465 ± 0.012
0.642TrpGly: 0.642 ± 0.02
0.31TrpHis: 0.31 ± 0.011
0.648TrpIle: 0.648 ± 0.014
0.905TrpLys: 0.905 ± 0.016
1.15TrpLeu: 1.15 ± 0.021
0.309TrpMet: 0.309 ± 0.012
0.621TrpAsn: 0.621 ± 0.014
0.444TrpPro: 0.444 ± 0.011
0.544TrpGln: 0.544 ± 0.014
0.646TrpArg: 0.646 ± 0.014
0.843TrpSer: 0.843 ± 0.018
0.668TrpThr: 0.668 ± 0.017
0.66TrpVal: 0.66 ± 0.015
0.185TrpTrp: 0.185 ± 0.008
0.373TrpTyr: 0.373 ± 0.01
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.619TyrAla: 1.619 ± 0.022
0.769TyrCys: 0.769 ± 0.018
1.508TyrAsp: 1.508 ± 0.025
1.867TyrGlu: 1.867 ± 0.026
1.438TyrPhe: 1.438 ± 0.023
1.774TyrGly: 1.774 ± 0.03
0.822TyrHis: 0.822 ± 0.018
1.701TyrIle: 1.701 ± 0.024
1.817TyrLys: 1.817 ± 0.028
2.983TyrLeu: 2.983 ± 0.04
0.694TyrMet: 0.694 ± 0.018
1.359TyrAsn: 1.359 ± 0.021
1.368TyrPro: 1.368 ± 0.025
1.356TyrGln: 1.356 ± 0.022
1.709TyrArg: 1.709 ± 0.027
2.406TyrSer: 2.406 ± 0.031
1.681TyrThr: 1.681 ± 0.027
1.81TyrVal: 1.81 ± 0.026
0.408TyrTrp: 0.408 ± 0.013
1.115TyrTyr: 1.115 ± 0.019
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.001
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.002XaaLeu: 0.002 ± 0.001
0.001XaaMet: 0.001 ± 0.001
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.001
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.001
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.053XaaXaa: 0.053 ± 0.009
Statistics based on 7730 proteins (3192604 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski