Amino acid dipepetide frequency for Amphimedon queenslandica (Sponge)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.035AlaAla: 4.035 ± 0.025
1.166AlaCys: 1.166 ± 0.014
2.609AlaAsp: 2.609 ± 0.016
2.931AlaGlu: 2.931 ± 0.018
1.892AlaPhe: 1.892 ± 0.014
3.227AlaGly: 3.227 ± 0.021
1.078AlaHis: 1.078 ± 0.01
3.596AlaIle: 3.596 ± 0.021
3.039AlaLys: 3.039 ± 0.022
5.169AlaLeu: 5.169 ± 0.022
1.183AlaMet: 1.183 ± 0.01
2.46AlaAsn: 2.46 ± 0.014
2.37AlaPro: 2.37 ± 0.018
1.868AlaGln: 1.868 ± 0.013
2.167AlaArg: 2.167 ± 0.014
4.88AlaSer: 4.88 ± 0.025
4.014AlaThr: 4.014 ± 0.024
4.105AlaVal: 4.105 ± 0.019
0.49AlaTrp: 0.49 ± 0.006
1.616AlaTyr: 1.616 ± 0.011
0.002AlaXaa: 0.002 ± 0.0
Cys
0.862CysAla: 0.862 ± 0.009
0.638CysCys: 0.638 ± 0.009
1.303CysAsp: 1.303 ± 0.016
1.033CysGlu: 1.033 ± 0.011
0.891CysPhe: 0.891 ± 0.009
1.298CysGly: 1.298 ± 0.012
0.723CysHis: 0.723 ± 0.008
1.395CysIle: 1.395 ± 0.012
1.138CysLys: 1.138 ± 0.011
2.223CysLeu: 2.223 ± 0.017
0.413CysMet: 0.413 ± 0.006
1.265CysAsn: 1.265 ± 0.017
1.195CysPro: 1.195 ± 0.015
0.997CysGln: 0.997 ± 0.013
1.031CysArg: 1.031 ± 0.01
2.324CysSer: 2.324 ± 0.022
1.592CysThr: 1.592 ± 0.018
1.205CysVal: 1.205 ± 0.01
0.251CysTrp: 0.251 ± 0.004
0.839CysTyr: 0.839 ± 0.009
0.001CysXaa: 0.001 ± 0.0
Asp
2.645AspAla: 2.645 ± 0.017
1.251AspCys: 1.251 ± 0.011
4.022AspAsp: 4.022 ± 0.027
3.936AspGlu: 3.936 ± 0.025
2.006AspPhe: 2.006 ± 0.014
3.361AspGly: 3.361 ± 0.02
1.313AspHis: 1.313 ± 0.011
4.081AspIle: 4.081 ± 0.021
3.214AspLys: 3.214 ± 0.019
4.424AspLeu: 4.424 ± 0.024
1.086AspMet: 1.086 ± 0.012
2.833AspAsn: 2.833 ± 0.015
2.578AspPro: 2.578 ± 0.017
1.828AspGln: 1.828 ± 0.013
2.062AspArg: 2.062 ± 0.014
4.461AspSer: 4.461 ± 0.019
3.417AspThr: 3.417 ± 0.023
3.549AspVal: 3.549 ± 0.018
0.616AspTrp: 0.616 ± 0.007
1.977AspTyr: 1.977 ± 0.012
0.001AspXaa: 0.001 ± 0.0
Glu
3.763GluAla: 3.763 ± 0.022
1.241GluCys: 1.241 ± 0.016
3.647GluAsp: 3.647 ± 0.023
6.59GluGlu: 6.59 ± 0.048
1.996GluPhe: 1.996 ± 0.014
3.684GluGly: 3.684 ± 0.023
1.224GluHis: 1.224 ± 0.01
3.698GluIle: 3.698 ± 0.017
4.609GluLys: 4.609 ± 0.031
5.98GluLeu: 5.98 ± 0.029
1.451GluMet: 1.451 ± 0.013
2.649GluAsn: 2.649 ± 0.015
1.962GluPro: 1.962 ± 0.013
2.337GluGln: 2.337 ± 0.018
3.424GluArg: 3.424 ± 0.022
4.667GluSer: 4.667 ± 0.023
3.402GluThr: 3.402 ± 0.019
3.905GluVal: 3.905 ± 0.019
0.7GluTrp: 0.7 ± 0.008
2.091GluTyr: 2.091 ± 0.013
0.002GluXaa: 0.002 ± 0.0
Phe
1.751PheAla: 1.751 ± 0.012
0.92PheCys: 0.92 ± 0.008
2.174PheAsp: 2.174 ± 0.014
1.956PheGlu: 1.956 ± 0.013
1.614PhePhe: 1.614 ± 0.013
2.276PheGly: 2.276 ± 0.014
0.944PheHis: 0.944 ± 0.008
2.809PheIle: 2.809 ± 0.016
2.173PheLys: 2.173 ± 0.014
3.669PheLeu: 3.669 ± 0.018
0.847PheMet: 0.847 ± 0.009
2.138PheAsn: 2.138 ± 0.016
1.588PhePro: 1.588 ± 0.011
1.359PheGln: 1.359 ± 0.011
1.456PheArg: 1.456 ± 0.011
3.56PheSer: 3.56 ± 0.02
2.497PheThr: 2.497 ± 0.019
2.269PheVal: 2.269 ± 0.014
0.414PheTrp: 0.414 ± 0.007
1.41PheTyr: 1.41 ± 0.011
0.001PheXaa: 0.001 ± 0.0
Gly
3.111GlyAla: 3.111 ± 0.022
1.074GlyCys: 1.074 ± 0.012
2.866GlyAsp: 2.866 ± 0.019
3.151GlyGlu: 3.151 ± 0.016
2.066GlyPhe: 2.066 ± 0.012
5.065GlyGly: 5.065 ± 0.044
1.55GlyHis: 1.55 ± 0.016
3.498GlyIle: 3.498 ± 0.017
3.05GlyLys: 3.05 ± 0.018
4.99GlyLeu: 4.99 ± 0.023
1.148GlyMet: 1.148 ± 0.01
2.722GlyAsn: 2.722 ± 0.022
2.343GlyPro: 2.343 ± 0.024
2.088GlyGln: 2.088 ± 0.015
2.829GlyArg: 2.829 ± 0.017
5.209GlySer: 5.209 ± 0.031
3.422GlyThr: 3.422 ± 0.023
3.769GlyVal: 3.769 ± 0.022
0.733GlyTrp: 0.733 ± 0.01
2.166GlyTyr: 2.166 ± 0.021
0.002GlyXaa: 0.002 ± 0.0
His
0.981HisAla: 0.981 ± 0.008
0.832HisCys: 0.832 ± 0.01
1.129HisAsp: 1.129 ± 0.011
1.2HisGlu: 1.2 ± 0.01
1.145HisPhe: 1.145 ± 0.009
1.187HisGly: 1.187 ± 0.01
0.969HisHis: 0.969 ± 0.013
1.649HisIle: 1.649 ± 0.013
1.401HisLys: 1.401 ± 0.012
2.535HisLeu: 2.535 ± 0.016
0.517HisMet: 0.517 ± 0.006
1.211HisAsn: 1.211 ± 0.011
1.182HisPro: 1.182 ± 0.011
1.132HisGln: 1.132 ± 0.011
1.117HisArg: 1.117 ± 0.011
2.387HisSer: 2.387 ± 0.015
1.384HisThr: 1.384 ± 0.014
1.45HisVal: 1.45 ± 0.01
0.321HisTrp: 0.321 ± 0.005
1.204HisTyr: 1.204 ± 0.01
0.001HisXaa: 0.001 ± 0.0
Ile
3.71IleAla: 3.71 ± 0.017
1.262IleCys: 1.262 ± 0.014
3.861IleAsp: 3.861 ± 0.021
3.938IleGlu: 3.938 ± 0.021
2.317IlePhe: 2.317 ± 0.016
3.333IleGly: 3.333 ± 0.019
1.481IleHis: 1.481 ± 0.011
4.438IleIle: 4.438 ± 0.027
3.984IleLys: 3.984 ± 0.021
5.667IleLeu: 5.667 ± 0.025
1.282IleMet: 1.282 ± 0.011
3.789IleAsn: 3.789 ± 0.024
3.015IlePro: 3.015 ± 0.017
2.46IleGln: 2.46 ± 0.014
2.545IleArg: 2.545 ± 0.015
5.524IleSer: 5.524 ± 0.024
4.292IleThr: 4.292 ± 0.031
4.117IleVal: 4.117 ± 0.017
0.568IleTrp: 0.568 ± 0.007
1.848IleTyr: 1.848 ± 0.013
0.003IleXaa: 0.003 ± 0.0
Lys
3.271LysAla: 3.271 ± 0.019
1.258LysCys: 1.258 ± 0.011
3.527LysAsp: 3.527 ± 0.022
5.558LysGlu: 5.558 ± 0.039
2.02LysPhe: 2.02 ± 0.012
3.266LysGly: 3.266 ± 0.03
1.395LysHis: 1.395 ± 0.011
3.274LysIle: 3.274 ± 0.019
4.981LysLys: 4.981 ± 0.039
5.654LysLeu: 5.654 ± 0.027
1.415LysMet: 1.415 ± 0.01
2.5LysAsn: 2.5 ± 0.016
2.315LysPro: 2.315 ± 0.016
2.386LysGln: 2.386 ± 0.016
3.468LysArg: 3.468 ± 0.021
4.287LysSer: 4.287 ± 0.02
3.086LysThr: 3.086 ± 0.019
3.453LysVal: 3.453 ± 0.021
0.737LysTrp: 0.737 ± 0.008
2.042LysTyr: 2.042 ± 0.014
0.002LysXaa: 0.002 ± 0.0
Leu
5.089LeuAla: 5.089 ± 0.024
1.991LeuCys: 1.991 ± 0.013
4.478LeuAsp: 4.478 ± 0.022
5.526LeuGlu: 5.526 ± 0.027
3.802LeuPhe: 3.802 ± 0.019
4.107LeuGly: 4.107 ± 0.022
2.807LeuHis: 2.807 ± 0.021
5.821LeuIle: 5.821 ± 0.027
6.23LeuLys: 6.23 ± 0.035
10.779LeuLeu: 10.779 ± 0.047
2.249LeuMet: 2.249 ± 0.014
4.37LeuAsn: 4.37 ± 0.025
4.937LeuPro: 4.937 ± 0.027
4.425LeuGln: 4.425 ± 0.023
4.367LeuArg: 4.367 ± 0.021
9.196LeuSer: 9.196 ± 0.032
5.851LeuThr: 5.851 ± 0.04
5.935LeuVal: 5.935 ± 0.022
0.963LeuTrp: 0.963 ± 0.01
3.213LeuTyr: 3.213 ± 0.018
0.003LeuXaa: 0.003 ± 0.0
Met
1.6MetAla: 1.6 ± 0.012
0.391MetCys: 0.391 ± 0.005
1.076MetAsp: 1.076 ± 0.01
1.61MetGlu: 1.61 ± 0.012
0.718MetPhe: 0.718 ± 0.007
0.908MetGly: 0.908 ± 0.01
0.413MetHis: 0.413 ± 0.005
1.406MetIle: 1.406 ± 0.01
1.723MetLys: 1.723 ± 0.014
1.879MetLeu: 1.879 ± 0.014
0.637MetMet: 0.637 ± 0.007
1.032MetAsn: 1.032 ± 0.01
0.723MetPro: 0.723 ± 0.008
0.64MetGln: 0.64 ± 0.007
0.974MetArg: 0.974 ± 0.008
1.929MetSer: 1.929 ± 0.011
1.32MetThr: 1.32 ± 0.01
1.104MetVal: 1.104 ± 0.009
0.221MetTrp: 0.221 ± 0.003
0.706MetTyr: 0.706 ± 0.007
0.001MetXaa: 0.001 ± 0.0
Asn
2.48AsnAla: 2.48 ± 0.019
1.327AsnCys: 1.327 ± 0.016
3.038AsnAsp: 3.038 ± 0.019
3.535AsnGlu: 3.535 ± 0.022
1.69AsnPhe: 1.69 ± 0.013
3.675AsnGly: 3.675 ± 0.028
1.16AsnHis: 1.16 ± 0.01
3.653AsnIle: 3.653 ± 0.021
2.913AsnLys: 2.913 ± 0.015
3.696AsnLeu: 3.696 ± 0.02
1.0AsnMet: 1.0 ± 0.007
3.198AsnAsn: 3.198 ± 0.023
2.31AsnPro: 2.31 ± 0.016
1.584AsnGln: 1.584 ± 0.011
1.881AsnArg: 1.881 ± 0.012
3.806AsnSer: 3.806 ± 0.022
3.161AsnThr: 3.161 ± 0.022
3.324AsnVal: 3.324 ± 0.02
0.561AsnTrp: 0.561 ± 0.006
1.705AsnTyr: 1.705 ± 0.012
0.001AsnXaa: 0.001 ± 0.0
Pro
2.507ProAla: 2.507 ± 0.017
0.844ProCys: 0.844 ± 0.012
2.504ProAsp: 2.504 ± 0.016
2.326ProGlu: 2.326 ± 0.016
1.731ProPhe: 1.731 ± 0.012
2.467ProGly: 2.467 ± 0.026
1.224ProHis: 1.224 ± 0.012
2.539ProIle: 2.539 ± 0.014
2.038ProLys: 2.038 ± 0.015
4.857ProLeu: 4.857 ± 0.025
0.81ProMet: 0.81 ± 0.008
2.188ProAsn: 2.188 ± 0.016
4.65ProPro: 4.65 ± 0.059
1.848ProGln: 1.848 ± 0.014
1.696ProArg: 1.696 ± 0.013
4.964ProSer: 4.964 ± 0.031
3.21ProThr: 3.21 ± 0.018
3.572ProVal: 3.572 ± 0.022
0.362ProTrp: 0.362 ± 0.006
1.348ProTyr: 1.348 ± 0.012
0.002ProXaa: 0.002 ± 0.0
Gln
1.884GlnAla: 1.884 ± 0.013
1.083GlnCys: 1.083 ± 0.012
1.782GlnAsp: 1.782 ± 0.015
2.763GlnGlu: 2.763 ± 0.02
1.651GlnPhe: 1.651 ± 0.012
1.71GlnGly: 1.71 ± 0.014
0.97GlnHis: 0.97 ± 0.008
1.968GlnIle: 1.968 ± 0.015
2.279GlnLys: 2.279 ± 0.017
4.455GlnLeu: 4.455 ± 0.022
0.834GlnMet: 0.834 ± 0.009
1.581GlnAsn: 1.581 ± 0.013
1.7GlnPro: 1.7 ± 0.014
2.19GlnGln: 2.19 ± 0.02
1.854GlnArg: 1.854 ± 0.015
3.426GlnSer: 3.426 ± 0.02
1.96GlnThr: 1.96 ± 0.014
2.392GlnVal: 2.392 ± 0.015
0.59GlnTrp: 0.59 ± 0.008
1.747GlnTyr: 1.747 ± 0.012
0.002GlnXaa: 0.002 ± 0.0
Arg
2.169ArgAla: 2.169 ± 0.013
0.874ArgCys: 0.874 ± 0.009
2.327ArgAsp: 2.327 ± 0.014
3.144ArgGlu: 3.144 ± 0.023
1.581ArgPhe: 1.581 ± 0.01
2.639ArgGly: 2.639 ± 0.021
1.203ArgHis: 1.203 ± 0.01
2.527ArgIle: 2.527 ± 0.013
3.253ArgLys: 3.253 ± 0.022
4.222ArgLeu: 4.222 ± 0.021
0.985ArgMet: 0.985 ± 0.008
2.161ArgAsn: 2.161 ± 0.014
1.908ArgPro: 1.908 ± 0.014
1.921ArgGln: 1.921 ± 0.014
2.862ArgArg: 2.862 ± 0.021
3.461ArgSer: 3.461 ± 0.019
2.208ArgThr: 2.208 ± 0.015
2.71ArgVal: 2.71 ± 0.015
0.534ArgTrp: 0.534 ± 0.006
1.561ArgTyr: 1.561 ± 0.011
0.002ArgXaa: 0.002 ± 0.0
Ser
4.262SerAla: 4.262 ± 0.024
2.105SerCys: 2.105 ± 0.018
4.828SerAsp: 4.828 ± 0.024
4.252SerGlu: 4.252 ± 0.024
3.876SerPhe: 3.876 ± 0.024
4.834SerGly: 4.834 ± 0.026
2.297SerHis: 2.297 ± 0.015
5.834SerIle: 5.834 ± 0.026
4.547SerLys: 4.547 ± 0.021
9.6SerLeu: 9.6 ± 0.035
1.693SerMet: 1.693 ± 0.011
4.616SerAsn: 4.616 ± 0.024
4.642SerPro: 4.642 ± 0.029
3.302SerGln: 3.302 ± 0.017
3.446SerArg: 3.446 ± 0.018
10.963SerSer: 10.963 ± 0.057
6.245SerThr: 6.245 ± 0.037
6.139SerVal: 6.139 ± 0.03
0.879SerTrp: 0.879 ± 0.009
2.962SerTyr: 2.962 ± 0.02
0.004SerXaa: 0.004 ± 0.001
Thr
3.903ThrAla: 3.903 ± 0.022
1.541ThrCys: 1.541 ± 0.019
3.654ThrAsp: 3.654 ± 0.022
3.547ThrGlu: 3.547 ± 0.02
2.238ThrPhe: 2.238 ± 0.023
4.209ThrGly: 4.209 ± 0.027
1.318ThrHis: 1.318 ± 0.01
4.087ThrIle: 4.087 ± 0.025
2.988ThrLys: 2.988 ± 0.017
5.854ThrLeu: 5.854 ± 0.037
1.097ThrMet: 1.097 ± 0.011
3.337ThrAsn: 3.337 ± 0.02
3.38ThrPro: 3.38 ± 0.021
2.081ThrGln: 2.081 ± 0.021
2.358ThrArg: 2.358 ± 0.012
6.105ThrSer: 6.105 ± 0.033
4.492ThrThr: 4.492 ± 0.031
4.8ThrVal: 4.8 ± 0.03
0.592ThrTrp: 0.592 ± 0.007
1.822ThrTyr: 1.822 ± 0.015
0.003ThrXaa: 0.003 ± 0.0
Val
3.802ValAla: 3.802 ± 0.016
1.488ValCys: 1.488 ± 0.018
3.342ValAsp: 3.342 ± 0.02
3.589ValGlu: 3.589 ± 0.02
2.53ValPhe: 2.53 ± 0.016
3.146ValGly: 3.146 ± 0.016
1.581ValHis: 1.581 ± 0.013
4.217ValIle: 4.217 ± 0.021
3.681ValLys: 3.681 ± 0.018
6.176ValLeu: 6.176 ± 0.026
1.435ValMet: 1.435 ± 0.01
3.178ValAsn: 3.178 ± 0.02
3.089ValPro: 3.089 ± 0.017
2.511ValGln: 2.511 ± 0.016
2.575ValArg: 2.575 ± 0.014
6.051ValSer: 6.051 ± 0.03
4.788ValThr: 4.788 ± 0.029
4.159ValVal: 4.159 ± 0.023
0.698ValTrp: 0.698 ± 0.008
2.302ValTyr: 2.302 ± 0.015
0.003ValXaa: 0.003 ± 0.0
Trp
0.463TrpAla: 0.463 ± 0.008
0.228TrpCys: 0.228 ± 0.004
0.621TrpAsp: 0.621 ± 0.008
0.588TrpGlu: 0.588 ± 0.007
0.498TrpPhe: 0.498 ± 0.007
0.553TrpGly: 0.553 ± 0.01
0.265TrpHis: 0.265 ± 0.004
0.742TrpIle: 0.742 ± 0.008
0.772TrpLys: 0.772 ± 0.008
1.094TrpLeu: 1.094 ± 0.009
0.227TrpMet: 0.227 ± 0.004
0.578TrpAsn: 0.578 ± 0.008
0.383TrpPro: 0.383 ± 0.005
0.34TrpGln: 0.34 ± 0.006
0.62TrpArg: 0.62 ± 0.007
0.99TrpSer: 0.99 ± 0.01
0.763TrpThr: 0.763 ± 0.01
0.496TrpVal: 0.496 ± 0.006
0.194TrpTrp: 0.194 ± 0.004
0.457TrpTyr: 0.457 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.608TyrAla: 1.608 ± 0.013
1.102TyrCys: 1.102 ± 0.011
1.869TyrAsp: 1.869 ± 0.013
1.846TyrGlu: 1.846 ± 0.014
1.564TyrPhe: 1.564 ± 0.011
1.907TyrGly: 1.907 ± 0.017
1.017TyrHis: 1.017 ± 0.01
2.201TyrIle: 2.201 ± 0.013
1.881TyrLys: 1.881 ± 0.013
3.215TyrLeu: 3.215 ± 0.016
0.696TyrMet: 0.696 ± 0.008
1.889TyrAsn: 1.889 ± 0.014
1.413TyrPro: 1.413 ± 0.013
1.522TyrGln: 1.522 ± 0.012
1.517TyrArg: 1.517 ± 0.011
3.138TyrSer: 3.138 ± 0.02
2.299TyrThr: 2.299 ± 0.019
1.885TyrVal: 1.885 ± 0.012
0.446TyrTrp: 0.446 ± 0.006
1.68TyrTyr: 1.68 ± 0.015
0.002TyrXaa: 0.002 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.002XaaGlu: 0.002 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.002XaaHis: 0.002 ± 0.0
0.003XaaIle: 0.003 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.004XaaLeu: 0.004 ± 0.001
0.004XaaMet: 0.004 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.002XaaSer: 0.002 ± 0.0
0.002XaaThr: 0.002 ± 0.0
0.002XaaVal: 0.002 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.577XaaXaa: 0.577 ± 0.066
Statistics based on 43437 proteins (14716634 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski