Amino acid dipepetide frequency for Trichoplax sp. H2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.397AlaAla: 4.397 ± 0.044
1.163AlaCys: 1.163 ± 0.018
3.045AlaAsp: 3.045 ± 0.025
3.264AlaGlu: 3.264 ± 0.051
2.412AlaPhe: 2.412 ± 0.023
2.809AlaGly: 2.809 ± 0.035
1.023AlaHis: 1.023 ± 0.015
5.046AlaIle: 5.046 ± 0.042
3.956AlaLys: 3.956 ± 0.038
5.517AlaLeu: 5.517 ± 0.041
1.455AlaMet: 1.455 ± 0.018
3.44AlaAsn: 3.44 ± 0.025
1.747AlaPro: 1.747 ± 0.02
1.927AlaGln: 1.927 ± 0.021
2.567AlaArg: 2.567 ± 0.024
5.063AlaSer: 5.063 ± 0.027
3.969AlaThr: 3.969 ± 0.036
3.903AlaVal: 3.903 ± 0.029
0.574AlaTrp: 0.574 ± 0.011
1.935AlaTyr: 1.935 ± 0.021
0.0AlaXaa: 0.0 ± 0.0
Cys
0.916CysAla: 0.916 ± 0.016
0.563CysCys: 0.563 ± 0.01
1.078CysAsp: 1.078 ± 0.018
0.892CysGlu: 0.892 ± 0.017
0.811CysPhe: 0.811 ± 0.012
1.12CysGly: 1.12 ± 0.019
0.585CysHis: 0.585 ± 0.013
1.528CysIle: 1.528 ± 0.02
1.314CysLys: 1.314 ± 0.022
1.984CysLeu: 1.984 ± 0.024
0.405CysMet: 0.405 ± 0.008
1.379CysAsn: 1.379 ± 0.025
0.886CysPro: 0.886 ± 0.058
1.153CysGln: 1.153 ± 0.033
1.122CysArg: 1.122 ± 0.019
1.79CysSer: 1.79 ± 0.034
1.094CysThr: 1.094 ± 0.026
1.021CysVal: 1.021 ± 0.016
0.256CysTrp: 0.256 ± 0.006
0.891CysTyr: 0.891 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
2.942AspAla: 2.942 ± 0.028
1.115AspCys: 1.115 ± 0.017
4.307AspAsp: 4.307 ± 0.043
4.1AspGlu: 4.1 ± 0.038
2.133AspPhe: 2.133 ± 0.017
3.029AspGly: 3.029 ± 0.026
1.375AspHis: 1.375 ± 0.018
4.414AspIle: 4.414 ± 0.03
3.566AspLys: 3.566 ± 0.029
4.67AspLeu: 4.67 ± 0.03
1.128AspMet: 1.128 ± 0.014
3.465AspAsn: 3.465 ± 0.029
1.933AspPro: 1.933 ± 0.018
2.285AspGln: 2.285 ± 0.025
2.718AspArg: 2.718 ± 0.026
4.525AspSer: 4.525 ± 0.036
2.77AspThr: 2.77 ± 0.022
3.254AspVal: 3.254 ± 0.026
0.622AspTrp: 0.622 ± 0.01
2.117AspTyr: 2.117 ± 0.019
0.0AspXaa: 0.0 ± 0.0
Glu
3.421GluAla: 3.421 ± 0.052
1.068GluCys: 1.068 ± 0.025
3.65GluAsp: 3.65 ± 0.035
4.841GluGlu: 4.841 ± 0.064
2.092GluPhe: 2.092 ± 0.02
2.298GluGly: 2.298 ± 0.019
1.067GluHis: 1.067 ± 0.015
4.564GluIle: 4.564 ± 0.037
4.643GluLys: 4.643 ± 0.048
5.372GluLeu: 5.372 ± 0.048
1.413GluMet: 1.413 ± 0.018
3.903GluAsn: 3.903 ± 0.035
1.466GluPro: 1.466 ± 0.019
2.169GluGln: 2.169 ± 0.029
2.868GluArg: 2.868 ± 0.033
4.501GluSer: 4.501 ± 0.041
2.899GluThr: 2.899 ± 0.024
3.121GluVal: 3.121 ± 0.029
0.561GluTrp: 0.561 ± 0.011
1.88GluTyr: 1.88 ± 0.019
0.0GluXaa: 0.0 ± 0.0
Phe
2.354PheAla: 2.354 ± 0.02
0.9PheCys: 0.9 ± 0.013
2.293PheAsp: 2.293 ± 0.02
1.954PheGlu: 1.954 ± 0.019
1.638PhePhe: 1.638 ± 0.019
2.356PheGly: 2.356 ± 0.026
1.038PheHis: 1.038 ± 0.012
3.027PheIle: 3.027 ± 0.026
2.246PheLys: 2.246 ± 0.021
3.697PheLeu: 3.697 ± 0.032
0.905PheMet: 0.905 ± 0.013
2.375PheAsn: 2.375 ± 0.023
1.467PhePro: 1.467 ± 0.017
1.638PheGln: 1.638 ± 0.017
1.906PheArg: 1.906 ± 0.019
3.21PheSer: 3.21 ± 0.025
2.456PheThr: 2.456 ± 0.027
2.242PheVal: 2.242 ± 0.02
0.442PheTrp: 0.442 ± 0.009
1.657PheTyr: 1.657 ± 0.019
0.0PheXaa: 0.0 ± 0.0
Gly
2.465GlyAla: 2.465 ± 0.025
0.964GlyCys: 0.964 ± 0.016
2.64GlyAsp: 2.64 ± 0.028
2.352GlyGlu: 2.352 ± 0.025
2.174GlyPhe: 2.174 ± 0.027
2.697GlyGly: 2.697 ± 0.031
1.274GlyHis: 1.274 ± 0.018
3.809GlyIle: 3.809 ± 0.029
3.54GlyLys: 3.54 ± 0.031
4.248GlyLeu: 4.248 ± 0.03
1.026GlyMet: 1.026 ± 0.013
3.264GlyAsn: 3.264 ± 0.029
1.482GlyPro: 1.482 ± 0.025
1.949GlyGln: 1.949 ± 0.025
2.441GlyArg: 2.441 ± 0.025
4.178GlySer: 4.178 ± 0.048
2.673GlyThr: 2.673 ± 0.036
2.663GlyVal: 2.663 ± 0.023
0.602GlyTrp: 0.602 ± 0.014
2.239GlyTyr: 2.239 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
1.153HisAla: 1.153 ± 0.014
0.554HisCys: 0.554 ± 0.012
1.342HisAsp: 1.342 ± 0.016
1.254HisGlu: 1.254 ± 0.022
1.007HisPhe: 1.007 ± 0.013
1.292HisGly: 1.292 ± 0.016
0.89HisHis: 0.89 ± 0.016
1.425HisIle: 1.425 ± 0.014
1.256HisLys: 1.256 ± 0.016
2.301HisLeu: 2.301 ± 0.019
0.407HisMet: 0.407 ± 0.009
1.306HisAsn: 1.306 ± 0.017
1.162HisPro: 1.162 ± 0.02
1.26HisGln: 1.26 ± 0.017
1.291HisArg: 1.291 ± 0.017
1.894HisSer: 1.894 ± 0.018
1.115HisThr: 1.115 ± 0.015
1.211HisVal: 1.211 ± 0.013
0.263HisTrp: 0.263 ± 0.007
1.031HisTyr: 1.031 ± 0.016
0.0HisXaa: 0.0 ± 0.0
Ile
5.053IleAla: 5.053 ± 0.031
1.566IleCys: 1.566 ± 0.019
4.226IleAsp: 4.226 ± 0.029
3.999IleGlu: 3.999 ± 0.033
2.987IlePhe: 2.987 ± 0.031
3.817IleGly: 3.817 ± 0.03
1.602IleHis: 1.602 ± 0.017
5.446IleIle: 5.446 ± 0.041
4.513IleLys: 4.513 ± 0.027
6.65IleLeu: 6.65 ± 0.041
1.49IleMet: 1.49 ± 0.014
4.294IleAsn: 4.294 ± 0.033
3.016IlePro: 3.016 ± 0.026
2.931IleGln: 2.931 ± 0.024
3.43IleArg: 3.43 ± 0.026
6.283IleSer: 6.283 ± 0.043
4.42IleThr: 4.42 ± 0.036
4.149IleVal: 4.149 ± 0.027
0.753IleTrp: 0.753 ± 0.012
2.654IleTyr: 2.654 ± 0.025
0.0IleXaa: 0.0 ± 0.0
Lys
3.915LysAla: 3.915 ± 0.041
1.211LysCys: 1.211 ± 0.021
3.693LysAsp: 3.693 ± 0.03
4.575LysGlu: 4.575 ± 0.049
2.437LysPhe: 2.437 ± 0.019
2.647LysGly: 2.647 ± 0.032
1.43LysHis: 1.43 ± 0.02
4.785LysIle: 4.785 ± 0.034
5.185LysLys: 5.185 ± 0.051
6.611LysLeu: 6.611 ± 0.044
1.524LysMet: 1.524 ± 0.018
3.662LysAsn: 3.662 ± 0.026
2.392LysPro: 2.392 ± 0.023
2.781LysGln: 2.781 ± 0.025
3.615LysArg: 3.615 ± 0.028
5.694LysSer: 5.694 ± 0.039
3.457LysThr: 3.457 ± 0.026
3.594LysVal: 3.594 ± 0.029
0.701LysTrp: 0.701 ± 0.011
2.451LysTyr: 2.451 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
6.025LeuAla: 6.025 ± 0.039
1.875LeuCys: 1.875 ± 0.022
4.834LeuAsp: 4.834 ± 0.031
5.186LeuGlu: 5.186 ± 0.048
3.585LeuPhe: 3.585 ± 0.031
4.2LeuGly: 4.2 ± 0.035
2.382LeuHis: 2.382 ± 0.022
6.244LeuIle: 6.244 ± 0.041
6.383LeuLys: 6.383 ± 0.039
9.298LeuLeu: 9.298 ± 0.061
2.007LeuMet: 2.007 ± 0.02
4.934LeuAsn: 4.934 ± 0.033
4.318LeuPro: 4.318 ± 0.032
4.783LeuGln: 4.783 ± 0.04
4.78LeuArg: 4.78 ± 0.035
8.139LeuSer: 8.139 ± 0.045
5.343LeuThr: 5.343 ± 0.034
4.789LeuVal: 4.789 ± 0.031
0.948LeuTrp: 0.948 ± 0.014
3.157LeuTyr: 3.157 ± 0.024
0.0LeuXaa: 0.0 ± 0.0
Met
1.809MetAla: 1.809 ± 0.022
0.312MetCys: 0.312 ± 0.007
1.218MetAsp: 1.218 ± 0.014
1.452MetGlu: 1.452 ± 0.018
0.727MetPhe: 0.727 ± 0.012
0.88MetGly: 0.88 ± 0.013
0.458MetHis: 0.458 ± 0.01
1.524MetIle: 1.524 ± 0.017
1.627MetLys: 1.627 ± 0.018
1.98MetLeu: 1.98 ± 0.02
0.591MetMet: 0.591 ± 0.01
1.158MetAsn: 1.158 ± 0.014
0.935MetPro: 0.935 ± 0.013
1.027MetGln: 1.027 ± 0.013
0.894MetArg: 0.894 ± 0.012
1.597MetSer: 1.597 ± 0.018
1.266MetThr: 1.266 ± 0.017
1.15MetVal: 1.15 ± 0.013
0.198MetTrp: 0.198 ± 0.006
0.743MetTyr: 0.743 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
3.181AsnAla: 3.181 ± 0.025
1.345AsnCys: 1.345 ± 0.029
3.557AsnAsp: 3.557 ± 0.029
3.505AsnGlu: 3.505 ± 0.028
2.337AsnPhe: 2.337 ± 0.021
3.251AsnGly: 3.251 ± 0.042
1.617AsnHis: 1.617 ± 0.025
4.427AsnIle: 4.427 ± 0.034
3.713AsnLys: 3.713 ± 0.029
5.438AsnLeu: 5.438 ± 0.036
1.171AsnMet: 1.171 ± 0.013
4.033AsnAsn: 4.033 ± 0.04
2.464AsnPro: 2.464 ± 0.023
2.822AsnGln: 2.822 ± 0.025
2.939AsnArg: 2.939 ± 0.022
5.155AsnSer: 5.155 ± 0.037
3.191AsnThr: 3.191 ± 0.026
3.389AsnVal: 3.389 ± 0.025
0.616AsnTrp: 0.616 ± 0.01
2.366AsnTyr: 2.366 ± 0.022
0.0AsnXaa: 0.0 ± 0.0
Pro
2.169ProAla: 2.169 ± 0.027
0.792ProCys: 0.792 ± 0.031
2.078ProAsp: 2.078 ± 0.022
2.144ProGlu: 2.144 ± 0.021
1.65ProPhe: 1.65 ± 0.017
1.831ProGly: 1.831 ± 0.031
0.799ProHis: 0.799 ± 0.012
2.738ProIle: 2.738 ± 0.023
2.29ProLys: 2.29 ± 0.021
3.393ProLeu: 3.393 ± 0.027
0.742ProMet: 0.742 ± 0.011
2.244ProAsn: 2.244 ± 0.025
2.182ProPro: 2.182 ± 0.038
1.491ProGln: 1.491 ± 0.015
1.657ProArg: 1.657 ± 0.021
3.809ProSer: 3.809 ± 0.036
2.596ProThr: 2.596 ± 0.032
2.392ProVal: 2.392 ± 0.02
0.407ProTrp: 0.407 ± 0.009
1.467ProTyr: 1.467 ± 0.017
0.0ProXaa: 0.0 ± 0.0
Gln
2.496GlnAla: 2.496 ± 0.027
0.91GlnCys: 0.91 ± 0.024
2.291GlnAsp: 2.291 ± 0.023
2.659GlnGlu: 2.659 ± 0.038
1.806GlnPhe: 1.806 ± 0.019
1.895GlnGly: 1.895 ± 0.026
1.073GlnHis: 1.073 ± 0.015
2.853GlnIle: 2.853 ± 0.026
2.551GlnLys: 2.551 ± 0.025
4.558GlnLeu: 4.558 ± 0.04
0.914GlnMet: 0.914 ± 0.014
2.47GlnAsn: 2.47 ± 0.025
1.72GlnPro: 1.72 ± 0.02
2.708GlnGln: 2.708 ± 0.042
2.187GlnArg: 2.187 ± 0.025
3.69GlnSer: 3.69 ± 0.029
2.117GlnThr: 2.117 ± 0.019
2.289GlnVal: 2.289 ± 0.021
0.455GlnTrp: 0.455 ± 0.009
1.572GlnTyr: 1.572 ± 0.016
0.0GlnXaa: 0.0 ± 0.0
Arg
2.355ArgAla: 2.355 ± 0.023
1.041ArgCys: 1.041 ± 0.017
2.465ArgAsp: 2.465 ± 0.023
2.669ArgGlu: 2.669 ± 0.028
1.935ArgPhe: 1.935 ± 0.017
2.067ArgGly: 2.067 ± 0.022
1.29ArgHis: 1.29 ± 0.018
3.407ArgIle: 3.407 ± 0.024
3.961ArgLys: 3.961 ± 0.032
4.799ArgLeu: 4.799 ± 0.032
1.055ArgMet: 1.055 ± 0.013
3.16ArgAsn: 3.16 ± 0.027
1.755ArgPro: 1.755 ± 0.02
2.37ArgGln: 2.37 ± 0.023
3.267ArgArg: 3.267 ± 0.037
4.098ArgSer: 4.098 ± 0.037
2.318ArgThr: 2.318 ± 0.022
2.292ArgVal: 2.292 ± 0.021
0.574ArgTrp: 0.574 ± 0.011
2.074ArgTyr: 2.074 ± 0.019
0.0ArgXaa: 0.0 ± 0.0
Ser
4.549SerAla: 4.549 ± 0.027
1.752SerCys: 1.752 ± 0.028
4.742SerAsp: 4.742 ± 0.03
4.334SerGlu: 4.334 ± 0.035
3.245SerPhe: 3.245 ± 0.028
4.129SerGly: 4.129 ± 0.04
1.935SerHis: 1.935 ± 0.02
6.091SerIle: 6.091 ± 0.039
5.685SerLys: 5.685 ± 0.038
7.752SerLeu: 7.752 ± 0.049
1.847SerMet: 1.847 ± 0.022
5.791SerAsn: 5.791 ± 0.045
3.452SerPro: 3.452 ± 0.036
3.549SerGln: 3.549 ± 0.029
3.936SerArg: 3.936 ± 0.033
9.122SerSer: 9.122 ± 0.085
5.474SerThr: 5.474 ± 0.054
4.728SerVal: 4.728 ± 0.033
0.864SerTrp: 0.864 ± 0.013
3.219SerTyr: 3.219 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
3.771ThrAla: 3.771 ± 0.029
1.324ThrCys: 1.324 ± 0.034
3.02ThrAsp: 3.02 ± 0.027
3.126ThrGlu: 3.126 ± 0.028
2.369ThrPhe: 2.369 ± 0.021
3.05ThrGly: 3.05 ± 0.041
1.009ThrHis: 1.009 ± 0.013
4.287ThrIle: 4.287 ± 0.031
3.346ThrLys: 3.346 ± 0.028
5.194ThrLeu: 5.194 ± 0.031
1.151ThrMet: 1.151 ± 0.014
3.34ThrAsn: 3.34 ± 0.031
2.474ThrPro: 2.474 ± 0.038
1.939ThrGln: 1.939 ± 0.019
2.261ThrArg: 2.261 ± 0.02
5.375ThrSer: 5.375 ± 0.049
4.142ThrThr: 4.142 ± 0.049
3.682ThrVal: 3.682 ± 0.027
0.63ThrTrp: 0.63 ± 0.011
1.969ThrTyr: 1.969 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
3.849ValAla: 3.849 ± 0.03
1.112ValCys: 1.112 ± 0.02
3.285ValAsp: 3.285 ± 0.028
3.089ValGlu: 3.089 ± 0.03
2.201ValPhe: 2.201 ± 0.024
2.781ValGly: 2.781 ± 0.021
1.183ValHis: 1.183 ± 0.015
4.364ValIle: 4.364 ± 0.029
3.634ValLys: 3.634 ± 0.027
4.952ValLeu: 4.952 ± 0.033
1.322ValMet: 1.322 ± 0.015
3.259ValAsn: 3.259 ± 0.027
2.152ValPro: 2.152 ± 0.022
2.14ValGln: 2.14 ± 0.019
2.431ValArg: 2.431 ± 0.022
4.305ValSer: 4.305 ± 0.029
3.609ValThr: 3.609 ± 0.028
3.574ValVal: 3.574 ± 0.026
0.599ValTrp: 0.599 ± 0.011
2.036ValTyr: 2.036 ± 0.024
0.0ValXaa: 0.0 ± 0.0
Trp
0.47TrpAla: 0.47 ± 0.009
0.213TrpCys: 0.213 ± 0.006
0.532TrpAsp: 0.532 ± 0.011
0.468TrpGlu: 0.468 ± 0.01
0.441TrpPhe: 0.441 ± 0.011
0.478TrpGly: 0.478 ± 0.015
0.26TrpHis: 0.26 ± 0.006
0.903TrpIle: 0.903 ± 0.013
0.835TrpLys: 0.835 ± 0.014
1.208TrpLeu: 1.208 ± 0.015
0.273TrpMet: 0.273 ± 0.006
0.702TrpAsn: 0.702 ± 0.013
0.381TrpPro: 0.381 ± 0.008
0.484TrpGln: 0.484 ± 0.01
0.552TrpArg: 0.552 ± 0.009
0.84TrpSer: 0.84 ± 0.014
0.613TrpThr: 0.613 ± 0.011
0.446TrpVal: 0.446 ± 0.009
0.169TrpTrp: 0.169 ± 0.005
0.403TrpTyr: 0.403 ± 0.011
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.923TyrAla: 1.923 ± 0.019
1.009TyrCys: 1.009 ± 0.031
2.172TyrAsp: 2.172 ± 0.021
1.932TyrGlu: 1.932 ± 0.026
1.817TyrPhe: 1.817 ± 0.019
2.052TyrGly: 2.052 ± 0.02
1.121TyrHis: 1.121 ± 0.015
2.4TyrIle: 2.4 ± 0.018
2.115TyrLys: 2.115 ± 0.019
3.587TyrLeu: 3.587 ± 0.028
0.719TyrMet: 0.719 ± 0.011
2.341TyrAsn: 2.341 ± 0.031
1.486TyrPro: 1.486 ± 0.019
1.807TyrGln: 1.807 ± 0.019
2.09TyrArg: 2.09 ± 0.02
2.928TyrSer: 2.928 ± 0.027
1.939TyrThr: 1.939 ± 0.032
1.987TyrVal: 1.987 ± 0.017
0.434TyrTrp: 0.434 ± 0.008
1.814TyrTyr: 1.814 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.014XaaXaa: 0.014 ± 0.004
Statistics based on 12150 proteins (6586016 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski