Amino acid dipepetide frequency for Ooceraea biroi (Clonal raider ant) (Cerapachys biroi)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.748AlaAla: 5.748 ± 0.052
1.309AlaCys: 1.309 ± 0.029
3.195AlaAsp: 3.195 ± 0.027
4.156AlaGlu: 4.156 ± 0.037
2.188AlaPhe: 2.188 ± 0.02
3.629AlaGly: 3.629 ± 0.031
1.491AlaHis: 1.491 ± 0.016
3.657AlaIle: 3.657 ± 0.028
3.699AlaLys: 3.699 ± 0.03
5.832AlaLeu: 5.832 ± 0.042
1.619AlaMet: 1.619 ± 0.016
2.879AlaAsn: 2.879 ± 0.024
2.927AlaPro: 2.927 ± 0.027
2.496AlaGln: 2.496 ± 0.026
4.051AlaArg: 4.051 ± 0.029
5.347AlaSer: 5.347 ± 0.037
4.299AlaThr: 4.299 ± 0.033
4.274AlaVal: 4.274 ± 0.029
0.659AlaTrp: 0.659 ± 0.011
1.705AlaTyr: 1.705 ± 0.018
0.001AlaXaa: 0.001 ± 0.0
Cys
1.232CysAla: 1.232 ± 0.018
0.501CysCys: 0.501 ± 0.012
1.176CysAsp: 1.176 ± 0.02
1.188CysGlu: 1.188 ± 0.022
0.749CysPhe: 0.749 ± 0.014
1.349CysGly: 1.349 ± 0.035
0.535CysHis: 0.535 ± 0.011
1.205CysIle: 1.205 ± 0.027
1.162CysLys: 1.162 ± 0.018
1.824CysLeu: 1.824 ± 0.028
0.459CysMet: 0.459 ± 0.013
1.024CysAsn: 1.024 ± 0.019
1.019CysPro: 1.019 ± 0.036
0.772CysGln: 0.772 ± 0.018
1.249CysArg: 1.249 ± 0.035
1.581CysSer: 1.581 ± 0.03
1.215CysThr: 1.215 ± 0.026
1.298CysVal: 1.298 ± 0.03
0.244CysTrp: 0.244 ± 0.007
0.603CysTyr: 0.603 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.452AspAla: 3.452 ± 0.027
1.058AspCys: 1.058 ± 0.023
3.797AspAsp: 3.797 ± 0.038
4.19AspGlu: 4.19 ± 0.029
2.038AspPhe: 2.038 ± 0.02
3.13AspGly: 3.13 ± 0.034
1.189AspHis: 1.189 ± 0.014
3.404AspIle: 3.404 ± 0.026
3.154AspLys: 3.154 ± 0.029
4.686AspLeu: 4.686 ± 0.037
1.249AspMet: 1.249 ± 0.013
2.687AspAsn: 2.687 ± 0.025
2.361AspPro: 2.361 ± 0.035
1.647AspGln: 1.647 ± 0.019
2.957AspArg: 2.957 ± 0.032
4.192AspSer: 4.192 ± 0.031
3.045AspThr: 3.045 ± 0.023
3.782AspVal: 3.782 ± 0.026
0.624AspTrp: 0.624 ± 0.011
1.786AspTyr: 1.786 ± 0.016
0.001AspXaa: 0.001 ± 0.0
Glu
4.114GluAla: 4.114 ± 0.036
1.274GluCys: 1.274 ± 0.037
4.107GluAsp: 4.107 ± 0.031
6.34GluGlu: 6.34 ± 0.072
2.065GluPhe: 2.065 ± 0.022
3.006GluGly: 3.006 ± 0.032
1.581GluHis: 1.581 ± 0.019
3.923GluIle: 3.923 ± 0.039
4.996GluLys: 4.996 ± 0.05
5.586GluLeu: 5.586 ± 0.046
1.675GluMet: 1.675 ± 0.017
3.674GluAsn: 3.674 ± 0.028
2.394GluPro: 2.394 ± 0.025
2.747GluGln: 2.747 ± 0.026
4.415GluArg: 4.415 ± 0.037
4.648GluSer: 4.648 ± 0.038
3.846GluThr: 3.846 ± 0.035
3.617GluVal: 3.617 ± 0.03
0.687GluTrp: 0.687 ± 0.011
1.956GluTyr: 1.956 ± 0.019
0.001GluXaa: 0.001 ± 0.0
Phe
2.255PheAla: 2.255 ± 0.02
0.822PheCys: 0.822 ± 0.012
1.952PheAsp: 1.952 ± 0.019
2.05PheGlu: 2.05 ± 0.019
1.523PhePhe: 1.523 ± 0.026
2.088PheGly: 2.088 ± 0.023
1.001PheHis: 1.001 ± 0.012
2.037PheIle: 2.037 ± 0.023
1.902PheLys: 1.902 ± 0.021
3.626PheLeu: 3.626 ± 0.032
0.816PheMet: 0.816 ± 0.011
1.646PheAsn: 1.646 ± 0.018
1.666PhePro: 1.666 ± 0.018
1.395PheGln: 1.395 ± 0.016
1.915PheArg: 1.915 ± 0.019
2.793PheSer: 2.793 ± 0.021
2.103PheThr: 2.103 ± 0.02
2.435PheVal: 2.435 ± 0.02
0.451PheTrp: 0.451 ± 0.009
1.27PheTyr: 1.27 ± 0.015
0.0PheXaa: 0.0 ± 0.0
Gly
3.337GlyAla: 3.337 ± 0.03
1.062GlyCys: 1.062 ± 0.021
2.833GlyAsp: 2.833 ± 0.029
3.189GlyGlu: 3.189 ± 0.033
2.01GlyPhe: 2.01 ± 0.025
4.248GlyGly: 4.248 ± 0.062
1.409GlyHis: 1.409 ± 0.02
3.122GlyIle: 3.122 ± 0.026
3.326GlyLys: 3.326 ± 0.029
4.356GlyLeu: 4.356 ± 0.033
1.238GlyMet: 1.238 ± 0.017
2.643GlyAsn: 2.643 ± 0.026
2.358GlyPro: 2.358 ± 0.042
2.006GlyGln: 2.006 ± 0.023
3.344GlyArg: 3.344 ± 0.03
4.47GlySer: 4.47 ± 0.043
3.275GlyThr: 3.275 ± 0.029
3.254GlyVal: 3.254 ± 0.026
0.66GlyTrp: 0.66 ± 0.012
1.912GlyTyr: 1.912 ± 0.027
0.001GlyXaa: 0.001 ± 0.0
His
1.53HisAla: 1.53 ± 0.017
0.611HisCys: 0.611 ± 0.013
1.198HisAsp: 1.198 ± 0.016
1.462HisGlu: 1.462 ± 0.016
1.03HisPhe: 1.03 ± 0.013
1.428HisGly: 1.428 ± 0.02
1.133HisHis: 1.133 ± 0.028
1.385HisIle: 1.385 ± 0.015
1.304HisLys: 1.304 ± 0.015
2.411HisLeu: 2.411 ± 0.023
0.627HisMet: 0.627 ± 0.01
1.148HisAsn: 1.148 ± 0.015
1.387HisPro: 1.387 ± 0.017
1.132HisGln: 1.132 ± 0.016
1.721HisArg: 1.721 ± 0.018
1.967HisSer: 1.967 ± 0.02
1.47HisThr: 1.47 ± 0.092
1.71HisVal: 1.71 ± 0.018
0.294HisTrp: 0.294 ± 0.007
0.899HisTyr: 0.899 ± 0.012
0.0HisXaa: 0.0 ± 0.0
Ile
3.851IleAla: 3.851 ± 0.024
1.276IleCys: 1.276 ± 0.024
3.172IleAsp: 3.172 ± 0.023
3.56IleGlu: 3.56 ± 0.034
2.262IlePhe: 2.262 ± 0.025
2.994IleGly: 2.994 ± 0.026
1.369IleHis: 1.369 ± 0.016
3.334IleIle: 3.334 ± 0.032
3.349IleLys: 3.349 ± 0.028
5.486IleLeu: 5.486 ± 0.04
1.309IleMet: 1.309 ± 0.017
2.713IleAsn: 2.713 ± 0.024
2.74IlePro: 2.74 ± 0.023
2.139IleGln: 2.139 ± 0.019
3.047IleArg: 3.047 ± 0.021
4.402IleSer: 4.402 ± 0.032
3.352IleThr: 3.352 ± 0.028
3.696IleVal: 3.696 ± 0.026
0.581IleTrp: 0.581 ± 0.01
1.772IleTyr: 1.772 ± 0.017
0.001IleXaa: 0.001 ± 0.0
Lys
3.364LysAla: 3.364 ± 0.029
1.21LysCys: 1.21 ± 0.023
3.446LysAsp: 3.446 ± 0.03
4.673LysGlu: 4.673 ± 0.044
1.97LysPhe: 1.97 ± 0.018
2.59LysGly: 2.59 ± 0.029
1.54LysHis: 1.54 ± 0.017
3.636LysIle: 3.636 ± 0.031
4.919LysLys: 4.919 ± 0.057
5.597LysLeu: 5.597 ± 0.044
1.516LysMet: 1.516 ± 0.016
2.977LysAsn: 2.977 ± 0.024
2.742LysPro: 2.742 ± 0.038
2.57LysGln: 2.57 ± 0.025
4.148LysArg: 4.148 ± 0.033
4.467LysSer: 4.467 ± 0.037
3.447LysThr: 3.447 ± 0.028
3.345LysVal: 3.345 ± 0.026
0.676LysTrp: 0.676 ± 0.011
2.044LysTyr: 2.044 ± 0.018
0.0LysXaa: 0.0 ± 0.0
Leu
6.017LeuAla: 6.017 ± 0.044
1.837LeuCys: 1.837 ± 0.02
4.754LeuAsp: 4.754 ± 0.036
5.979LeuGlu: 5.979 ± 0.051
3.214LeuPhe: 3.214 ± 0.035
4.37LeuGly: 4.37 ± 0.03
2.601LeuHis: 2.601 ± 0.025
4.606LeuIle: 4.606 ± 0.038
5.645LeuLys: 5.645 ± 0.042
8.974LeuLeu: 8.974 ± 0.063
2.071LeuMet: 2.071 ± 0.02
4.107LeuAsn: 4.107 ± 0.027
4.791LeuPro: 4.791 ± 0.029
4.478LeuGln: 4.478 ± 0.037
5.737LeuArg: 5.737 ± 0.039
7.123LeuSer: 7.123 ± 0.044
5.058LeuThr: 5.058 ± 0.033
5.068LeuVal: 5.068 ± 0.037
0.987LeuTrp: 0.987 ± 0.013
2.766LeuTyr: 2.766 ± 0.024
0.003LeuXaa: 0.003 ± 0.001
Met
1.575MetAla: 1.575 ± 0.017
0.472MetCys: 0.472 ± 0.009
1.342MetAsp: 1.342 ± 0.018
1.721MetGlu: 1.721 ± 0.017
0.855MetPhe: 0.855 ± 0.012
1.115MetGly: 1.115 ± 0.015
0.609MetHis: 0.609 ± 0.009
1.3MetIle: 1.3 ± 0.018
1.553MetLys: 1.553 ± 0.018
2.146MetLeu: 2.146 ± 0.023
0.685MetMet: 0.685 ± 0.013
1.092MetAsn: 1.092 ± 0.016
1.152MetPro: 1.152 ± 0.014
1.116MetGln: 1.116 ± 0.013
1.374MetArg: 1.374 ± 0.016
1.812MetSer: 1.812 ± 0.019
1.331MetThr: 1.331 ± 0.016
1.261MetVal: 1.261 ± 0.015
0.263MetTrp: 0.263 ± 0.007
0.776MetTyr: 0.776 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.186AsnAla: 3.186 ± 0.032
0.933AsnCys: 0.933 ± 0.018
2.637AsnAsp: 2.637 ± 0.024
3.119AsnGlu: 3.119 ± 0.024
1.77AsnPhe: 1.77 ± 0.017
2.85AsnGly: 2.85 ± 0.03
1.121AsnHis: 1.121 ± 0.015
3.111AsnIle: 3.111 ± 0.028
2.8AsnLys: 2.8 ± 0.024
4.288AsnLeu: 4.288 ± 0.031
1.135AsnMet: 1.135 ± 0.014
2.846AsnAsn: 2.846 ± 0.032
2.203AsnPro: 2.203 ± 0.03
1.719AsnGln: 1.719 ± 0.019
2.527AsnArg: 2.527 ± 0.02
3.731AsnSer: 3.731 ± 0.027
2.677AsnThr: 2.677 ± 0.026
3.544AsnVal: 3.544 ± 0.026
0.493AsnTrp: 0.493 ± 0.009
1.566AsnTyr: 1.566 ± 0.017
0.0AsnXaa: 0.0 ± 0.0
Pro
3.279ProAla: 3.279 ± 0.027
0.877ProCys: 0.877 ± 0.044
2.552ProAsp: 2.552 ± 0.022
3.121ProGlu: 3.121 ± 0.033
1.6ProPhe: 1.6 ± 0.018
2.884ProGly: 2.884 ± 0.058
1.268ProHis: 1.268 ± 0.016
2.55ProIle: 2.55 ± 0.023
2.542ProLys: 2.542 ± 0.027
4.131ProLeu: 4.131 ± 0.029
1.035ProMet: 1.035 ± 0.015
2.107ProAsn: 2.107 ± 0.025
4.143ProPro: 4.143 ± 0.067
2.021ProGln: 2.021 ± 0.028
2.954ProArg: 2.954 ± 0.029
4.326ProSer: 4.326 ± 0.043
3.11ProThr: 3.11 ± 0.031
3.196ProVal: 3.196 ± 0.03
0.492ProTrp: 0.492 ± 0.009
1.528ProTyr: 1.528 ± 0.018
0.0ProXaa: 0.0 ± 0.0
Gln
2.521GlnAla: 2.521 ± 0.025
0.811GlnCys: 0.811 ± 0.021
1.998GlnAsp: 1.998 ± 0.017
2.931GlnGlu: 2.931 ± 0.027
1.338GlnPhe: 1.338 ± 0.017
1.791GlnGly: 1.791 ± 0.023
1.25GlnHis: 1.25 ± 0.018
2.166GlnIle: 2.166 ± 0.02
2.482GlnLys: 2.482 ± 0.025
3.894GlnLeu: 3.894 ± 0.034
0.992GlnMet: 0.992 ± 0.016
2.046GlnAsn: 2.046 ± 0.023
2.053GlnPro: 2.053 ± 0.027
3.522GlnGln: 3.522 ± 0.083
2.756GlnArg: 2.756 ± 0.024
3.02GlnSer: 3.02 ± 0.032
2.238GlnThr: 2.238 ± 0.021
2.287GlnVal: 2.287 ± 0.021
0.473GlnTrp: 0.473 ± 0.009
1.289GlnTyr: 1.289 ± 0.015
0.0GlnXaa: 0.0 ± 0.0
Arg
3.675ArgAla: 3.675 ± 0.028
1.24ArgCys: 1.24 ± 0.023
3.32ArgAsp: 3.32 ± 0.03
4.18ArgGlu: 4.18 ± 0.032
2.11ArgPhe: 2.11 ± 0.019
3.247ArgGly: 3.247 ± 0.032
1.716ArgHis: 1.716 ± 0.015
3.345ArgIle: 3.345 ± 0.027
4.197ArgLys: 4.197 ± 0.034
5.317ArgLeu: 5.317 ± 0.035
1.407ArgMet: 1.407 ± 0.016
3.074ArgAsn: 3.074 ± 0.023
2.727ArgPro: 2.727 ± 0.03
2.552ArgGln: 2.552 ± 0.022
5.091ArgArg: 5.091 ± 0.045
4.848ArgSer: 4.848 ± 0.048
3.271ArgThr: 3.271 ± 0.024
3.391ArgVal: 3.391 ± 0.027
0.737ArgTrp: 0.737 ± 0.011
1.99ArgTyr: 1.99 ± 0.016
0.001ArgXaa: 0.001 ± 0.0
Ser
5.069SerAla: 5.069 ± 0.036
1.515SerCys: 1.515 ± 0.029
4.275SerAsp: 4.275 ± 0.032
4.675SerGlu: 4.675 ± 0.032
2.752SerPhe: 2.752 ± 0.028
4.579SerGly: 4.579 ± 0.037
1.894SerHis: 1.894 ± 0.021
4.132SerIle: 4.132 ± 0.027
4.416SerLys: 4.416 ± 0.033
7.071SerLeu: 7.071 ± 0.04
1.792SerMet: 1.792 ± 0.022
3.905SerAsn: 3.905 ± 0.032
4.505SerPro: 4.505 ± 0.049
3.109SerGln: 3.109 ± 0.033
4.839SerArg: 4.839 ± 0.044
8.506SerSer: 8.506 ± 0.077
5.273SerThr: 5.273 ± 0.04
4.879SerVal: 4.879 ± 0.027
0.847SerTrp: 0.847 ± 0.013
2.298SerTyr: 2.298 ± 0.02
0.0SerXaa: 0.0 ± 0.0
Thr
4.139ThrAla: 4.139 ± 0.034
1.264ThrCys: 1.264 ± 0.027
3.014ThrAsp: 3.014 ± 0.022
3.66ThrGlu: 3.66 ± 0.035
2.148ThrPhe: 2.148 ± 0.022
3.289ThrGly: 3.289 ± 0.027
1.383ThrHis: 1.383 ± 0.093
3.381ThrIle: 3.381 ± 0.025
3.279ThrLys: 3.279 ± 0.031
5.403ThrLeu: 5.403 ± 0.032
1.432ThrMet: 1.432 ± 0.014
2.662ThrAsn: 2.662 ± 0.024
3.3ThrPro: 3.3 ± 0.034
2.106ThrGln: 2.106 ± 0.019
3.296ThrArg: 3.296 ± 0.022
5.264ThrSer: 5.264 ± 0.041
4.411ThrThr: 4.411 ± 0.056
4.0ThrVal: 4.0 ± 0.029
0.663ThrTrp: 0.663 ± 0.011
1.776ThrTyr: 1.776 ± 0.016
0.001ThrXaa: 0.001 ± 0.0
Val
4.329ValAla: 4.329 ± 0.024
1.36ValCys: 1.36 ± 0.029
3.375ValAsp: 3.375 ± 0.023
3.955ValGlu: 3.955 ± 0.035
2.267ValPhe: 2.267 ± 0.021
3.082ValGly: 3.082 ± 0.029
1.592ValHis: 1.592 ± 0.018
3.579ValIle: 3.579 ± 0.024
3.603ValLys: 3.603 ± 0.029
5.556ValLeu: 5.556 ± 0.039
1.427ValMet: 1.427 ± 0.016
2.847ValAsn: 2.847 ± 0.029
3.304ValPro: 3.304 ± 0.031
2.587ValGln: 2.587 ± 0.023
3.416ValArg: 3.416 ± 0.025
4.754ValSer: 4.754 ± 0.031
4.049ValThr: 4.049 ± 0.028
3.975ValVal: 3.975 ± 0.034
0.686ValTrp: 0.686 ± 0.011
1.898ValTyr: 1.898 ± 0.019
0.001ValXaa: 0.001 ± 0.0
Trp
0.581TrpAla: 0.581 ± 0.009
0.235TrpCys: 0.235 ± 0.006
0.587TrpAsp: 0.587 ± 0.011
0.623TrpGlu: 0.623 ± 0.012
0.438TrpPhe: 0.438 ± 0.011
0.544TrpGly: 0.544 ± 0.011
0.277TrpHis: 0.277 ± 0.007
0.711TrpIle: 0.711 ± 0.012
0.736TrpLys: 0.736 ± 0.013
1.1TrpLeu: 1.1 ± 0.015
0.309TrpMet: 0.309 ± 0.011
0.572TrpAsn: 0.572 ± 0.011
0.509TrpPro: 0.509 ± 0.008
0.494TrpGln: 0.494 ± 0.008
0.758TrpArg: 0.758 ± 0.012
0.835TrpSer: 0.835 ± 0.011
0.625TrpThr: 0.625 ± 0.009
0.532TrpVal: 0.532 ± 0.009
0.201TrpTrp: 0.201 ± 0.006
0.411TrpTyr: 0.411 ± 0.008
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.907TyrAla: 1.907 ± 0.018
0.718TyrCys: 0.718 ± 0.012
1.699TyrAsp: 1.699 ± 0.02
1.878TyrGlu: 1.878 ± 0.019
1.416TyrPhe: 1.416 ± 0.018
1.838TyrGly: 1.838 ± 0.02
0.887TyrHis: 0.887 ± 0.012
1.82TyrIle: 1.82 ± 0.019
1.809TyrLys: 1.809 ± 0.017
2.9TyrLeu: 2.9 ± 0.023
0.752TyrMet: 0.752 ± 0.012
1.612TyrAsn: 1.612 ± 0.018
1.434TyrPro: 1.434 ± 0.018
1.25TyrGln: 1.25 ± 0.016
1.841TyrArg: 1.841 ± 0.019
2.245TyrSer: 2.245 ± 0.021
1.778TyrThr: 1.778 ± 0.019
2.098TyrVal: 2.098 ± 0.02
0.363TyrTrp: 0.363 ± 0.008
1.223TyrTyr: 1.223 ± 0.016
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.001XaaLeu: 0.001 ± 0.0
0.001XaaMet: 0.001 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
1.04XaaXaa: 1.04 ± 0.169
Statistics based on 16497 proteins (6790250 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski