Amino acid dipepetide frequency for Trachymyrmex septentrionalis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.118AlaAla: 5.118 ± 0.051
1.257AlaCys: 1.257 ± 0.03
2.843AlaAsp: 2.843 ± 0.023
3.716AlaGlu: 3.716 ± 0.028
2.128AlaPhe: 2.128 ± 0.021
3.319AlaGly: 3.319 ± 0.027
1.376AlaHis: 1.376 ± 0.016
3.67AlaIle: 3.67 ± 0.025
3.556AlaLys: 3.556 ± 0.035
5.494AlaLeu: 5.494 ± 0.044
1.475AlaMet: 1.475 ± 0.015
2.715AlaAsn: 2.715 ± 0.02
2.628AlaPro: 2.628 ± 0.03
2.229AlaGln: 2.229 ± 0.023
3.685AlaArg: 3.685 ± 0.032
4.833AlaSer: 4.833 ± 0.036
3.969AlaThr: 3.969 ± 0.034
3.91AlaVal: 3.91 ± 0.03
0.62AlaTrp: 0.62 ± 0.009
1.638AlaTyr: 1.638 ± 0.018
0.001AlaXaa: 0.001 ± 0.0
Cys
1.167CysAla: 1.167 ± 0.019
0.517CysCys: 0.517 ± 0.011
1.136CysAsp: 1.136 ± 0.023
1.189CysGlu: 1.189 ± 0.021
0.762CysPhe: 0.762 ± 0.012
1.318CysGly: 1.318 ± 0.036
0.54CysHis: 0.54 ± 0.012
1.298CysIle: 1.298 ± 0.03
1.199CysLys: 1.199 ± 0.017
1.846CysLeu: 1.846 ± 0.027
0.454CysMet: 0.454 ± 0.01
1.065CysAsn: 1.065 ± 0.02
1.042CysPro: 1.042 ± 0.035
0.769CysGln: 0.769 ± 0.019
1.237CysArg: 1.237 ± 0.034
1.616CysSer: 1.616 ± 0.033
1.204CysThr: 1.204 ± 0.026
1.308CysVal: 1.308 ± 0.031
0.24CysTrp: 0.24 ± 0.007
0.615CysTyr: 0.615 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
3.025AspAla: 3.025 ± 0.024
1.02AspCys: 1.02 ± 0.021
3.678AspAsp: 3.678 ± 0.042
4.049AspGlu: 4.049 ± 0.03
2.045AspPhe: 2.045 ± 0.021
2.97AspGly: 2.97 ± 0.031
1.111AspHis: 1.111 ± 0.014
3.688AspIle: 3.688 ± 0.031
3.333AspLys: 3.333 ± 0.034
4.584AspLeu: 4.584 ± 0.031
1.194AspMet: 1.194 ± 0.015
2.816AspAsn: 2.816 ± 0.027
2.255AspPro: 2.255 ± 0.031
1.607AspGln: 1.607 ± 0.016
2.865AspArg: 2.865 ± 0.031
4.236AspSer: 4.236 ± 0.032
3.007AspThr: 3.007 ± 0.024
3.533AspVal: 3.533 ± 0.03
0.607AspTrp: 0.607 ± 0.01
1.796AspTyr: 1.796 ± 0.018
0.001AspXaa: 0.001 ± 0.0
Glu
3.755GluAla: 3.755 ± 0.03
1.218GluCys: 1.218 ± 0.034
4.015GluAsp: 4.015 ± 0.031
6.237GluGlu: 6.237 ± 0.091
2.065GluPhe: 2.065 ± 0.02
2.899GluGly: 2.899 ± 0.031
1.489GluHis: 1.489 ± 0.017
4.313GluIle: 4.313 ± 0.045
5.257GluLys: 5.257 ± 0.061
5.524GluLeu: 5.524 ± 0.038
1.595GluMet: 1.595 ± 0.017
3.885GluAsn: 3.885 ± 0.039
2.307GluPro: 2.307 ± 0.028
2.694GluGln: 2.694 ± 0.03
4.356GluArg: 4.356 ± 0.042
4.641GluSer: 4.641 ± 0.038
3.89GluThr: 3.89 ± 0.035
3.463GluVal: 3.463 ± 0.031
0.669GluTrp: 0.669 ± 0.011
1.968GluTyr: 1.968 ± 0.017
0.002GluXaa: 0.002 ± 0.001
Phe
2.134PheAla: 2.134 ± 0.021
0.86PheCys: 0.86 ± 0.013
2.007PheAsp: 2.007 ± 0.019
2.115PheGlu: 2.115 ± 0.018
1.612PhePhe: 1.612 ± 0.024
2.09PheGly: 2.09 ± 0.022
1.022PheHis: 1.022 ± 0.013
2.24PheIle: 2.24 ± 0.022
2.044PheLys: 2.044 ± 0.02
3.691PheLeu: 3.691 ± 0.033
0.807PheMet: 0.807 ± 0.012
1.786PheAsn: 1.786 ± 0.02
1.69PhePro: 1.69 ± 0.018
1.365PheGln: 1.365 ± 0.014
1.931PheArg: 1.931 ± 0.02
2.948PheSer: 2.948 ± 0.025
2.139PheThr: 2.139 ± 0.02
2.391PheVal: 2.391 ± 0.023
0.427PheTrp: 0.427 ± 0.01
1.343PheTyr: 1.343 ± 0.014
0.001PheXaa: 0.001 ± 0.0
Gly
3.016GlyAla: 3.016 ± 0.028
1.035GlyCys: 1.035 ± 0.02
2.695GlyAsp: 2.695 ± 0.027
3.092GlyGlu: 3.092 ± 0.032
1.984GlyPhe: 1.984 ± 0.021
4.251GlyGly: 4.251 ± 0.068
1.36GlyHis: 1.36 ± 0.02
3.252GlyIle: 3.252 ± 0.027
3.248GlyLys: 3.248 ± 0.023
4.284GlyLeu: 4.284 ± 0.031
1.181GlyMet: 1.181 ± 0.017
2.664GlyAsn: 2.664 ± 0.029
2.256GlyPro: 2.256 ± 0.038
1.906GlyGln: 1.906 ± 0.02
3.255GlyArg: 3.255 ± 0.029
4.369GlySer: 4.369 ± 0.039
3.159GlyThr: 3.159 ± 0.029
3.063GlyVal: 3.063 ± 0.029
0.656GlyTrp: 0.656 ± 0.01
1.888GlyTyr: 1.888 ± 0.026
0.002GlyXaa: 0.002 ± 0.001
His
1.416HisAla: 1.416 ± 0.017
0.602HisCys: 0.602 ± 0.011
1.15HisAsp: 1.15 ± 0.013
1.398HisGlu: 1.398 ± 0.016
0.979HisPhe: 0.979 ± 0.013
1.374HisGly: 1.374 ± 0.017
1.118HisHis: 1.118 ± 0.03
1.51HisIle: 1.51 ± 0.016
1.362HisLys: 1.362 ± 0.015
2.37HisLeu: 2.37 ± 0.025
0.586HisMet: 0.586 ± 0.008
1.173HisAsn: 1.173 ± 0.016
1.373HisPro: 1.373 ± 0.02
1.109HisGln: 1.109 ± 0.016
1.596HisArg: 1.596 ± 0.018
1.979HisSer: 1.979 ± 0.021
1.406HisThr: 1.406 ± 0.016
1.593HisVal: 1.593 ± 0.015
0.285HisTrp: 0.285 ± 0.008
0.882HisTyr: 0.882 ± 0.011
0.0HisXaa: 0.0 ± 0.0
Ile
3.862IleAla: 3.862 ± 0.027
1.372IleCys: 1.372 ± 0.026
3.371IleAsp: 3.371 ± 0.024
3.933IleGlu: 3.933 ± 0.036
2.537IlePhe: 2.537 ± 0.027
3.048IleGly: 3.048 ± 0.026
1.477IleHis: 1.477 ± 0.018
3.971IleIle: 3.971 ± 0.044
3.89IleLys: 3.89 ± 0.035
5.848IleLeu: 5.848 ± 0.044
1.389IleMet: 1.389 ± 0.016
3.266IleAsn: 3.266 ± 0.032
2.997IlePro: 2.997 ± 0.032
2.366IleGln: 2.366 ± 0.026
3.188IleArg: 3.188 ± 0.023
4.936IleSer: 4.936 ± 0.033
3.649IleThr: 3.649 ± 0.034
3.846IleVal: 3.846 ± 0.027
0.616IleTrp: 0.616 ± 0.011
2.035IleTyr: 2.035 ± 0.021
0.001IleXaa: 0.001 ± 0.0
Lys
3.22LysAla: 3.22 ± 0.032
1.275LysCys: 1.275 ± 0.025
3.538LysAsp: 3.538 ± 0.038
4.94LysGlu: 4.94 ± 0.061
2.181LysPhe: 2.181 ± 0.021
2.566LysGly: 2.566 ± 0.028
1.543LysHis: 1.543 ± 0.017
4.248LysIle: 4.248 ± 0.035
5.477LysLys: 5.477 ± 0.063
5.715LysLeu: 5.715 ± 0.037
1.543LysMet: 1.543 ± 0.019
3.522LysAsn: 3.522 ± 0.029
2.686LysPro: 2.686 ± 0.046
2.656LysGln: 2.656 ± 0.026
4.066LysArg: 4.066 ± 0.03
4.768LysSer: 4.768 ± 0.043
3.639LysThr: 3.639 ± 0.029
3.321LysVal: 3.321 ± 0.029
0.677LysTrp: 0.677 ± 0.012
2.213LysTyr: 2.213 ± 0.021
0.002LysXaa: 0.002 ± 0.0
Leu
5.655LeuAla: 5.655 ± 0.04
1.851LeuCys: 1.851 ± 0.022
4.65LeuAsp: 4.65 ± 0.033
5.858LeuGlu: 5.858 ± 0.05
3.269LeuPhe: 3.269 ± 0.028
4.196LeuGly: 4.196 ± 0.034
2.49LeuHis: 2.49 ± 0.022
5.002LeuIle: 5.002 ± 0.033
5.849LeuLys: 5.849 ± 0.038
8.846LeuLeu: 8.846 ± 0.056
2.028LeuMet: 2.028 ± 0.022
4.341LeuAsn: 4.341 ± 0.033
4.808LeuPro: 4.808 ± 0.036
4.353LeuGln: 4.353 ± 0.037
5.529LeuArg: 5.529 ± 0.04
7.436LeuSer: 7.436 ± 0.045
5.169LeuThr: 5.169 ± 0.032
4.825LeuVal: 4.825 ± 0.031
0.926LeuTrp: 0.926 ± 0.014
2.857LeuTyr: 2.857 ± 0.025
0.002LeuXaa: 0.002 ± 0.001
Met
1.469MetAla: 1.469 ± 0.017
0.447MetCys: 0.447 ± 0.01
1.282MetAsp: 1.282 ± 0.016
1.654MetGlu: 1.654 ± 0.018
0.829MetPhe: 0.829 ± 0.013
1.06MetGly: 1.06 ± 0.015
0.571MetHis: 0.571 ± 0.008
1.336MetIle: 1.336 ± 0.017
1.54MetLys: 1.54 ± 0.016
2.075MetLeu: 2.075 ± 0.02
0.619MetMet: 0.619 ± 0.011
1.133MetAsn: 1.133 ± 0.017
1.121MetPro: 1.121 ± 0.014
1.088MetGln: 1.088 ± 0.015
1.316MetArg: 1.316 ± 0.015
1.815MetSer: 1.815 ± 0.016
1.335MetThr: 1.335 ± 0.016
1.188MetVal: 1.188 ± 0.014
0.24MetTrp: 0.24 ± 0.006
0.759MetTyr: 0.759 ± 0.014
0.001MetXaa: 0.001 ± 0.0
Asn
2.971AsnAla: 2.971 ± 0.025
1.004AsnCys: 1.004 ± 0.018
2.865AsnAsp: 2.865 ± 0.03
3.351AsnGlu: 3.351 ± 0.037
1.966AsnPhe: 1.966 ± 0.02
2.816AsnGly: 2.816 ± 0.032
1.181AsnHis: 1.181 ± 0.018
3.653AsnIle: 3.653 ± 0.031
3.254AsnLys: 3.254 ± 0.032
4.586AsnLeu: 4.586 ± 0.035
1.212AsnMet: 1.212 ± 0.016
3.366AsnAsn: 3.366 ± 0.035
2.21AsnPro: 2.21 ± 0.031
1.874AsnGln: 1.874 ± 0.022
2.559AsnArg: 2.559 ± 0.022
4.066AsnSer: 4.066 ± 0.033
2.923AsnThr: 2.923 ± 0.025
3.545AsnVal: 3.545 ± 0.024
0.514AsnTrp: 0.514 ± 0.009
1.714AsnTyr: 1.714 ± 0.018
0.001AsnXaa: 0.001 ± 0.0
Pro
2.941ProAla: 2.941 ± 0.027
0.864ProCys: 0.864 ± 0.045
2.346ProAsp: 2.346 ± 0.021
3.003ProGlu: 3.003 ± 0.035
1.666ProPhe: 1.666 ± 0.018
2.78ProGly: 2.78 ± 0.069
1.195ProHis: 1.195 ± 0.018
2.752ProIle: 2.752 ± 0.028
2.574ProLys: 2.574 ± 0.027
4.159ProLeu: 4.159 ± 0.028
0.991ProMet: 0.991 ± 0.015
2.188ProAsn: 2.188 ± 0.027
4.173ProPro: 4.173 ± 0.089
1.976ProGln: 1.976 ± 0.025
2.824ProArg: 2.824 ± 0.029
4.365ProSer: 4.365 ± 0.046
3.118ProThr: 3.118 ± 0.026
2.996ProVal: 2.996 ± 0.031
0.496ProTrp: 0.496 ± 0.008
1.505ProTyr: 1.505 ± 0.017
0.002ProXaa: 0.002 ± 0.001
Gln
2.274GlnAla: 2.274 ± 0.019
0.807GlnCys: 0.807 ± 0.022
1.917GlnAsp: 1.917 ± 0.02
2.858GlnGlu: 2.858 ± 0.034
1.354GlnPhe: 1.354 ± 0.014
1.738GlnGly: 1.738 ± 0.019
1.209GlnHis: 1.209 ± 0.015
2.362GlnIle: 2.362 ± 0.026
2.556GlnLys: 2.556 ± 0.023
3.76GlnLeu: 3.76 ± 0.033
0.962GlnMet: 0.962 ± 0.016
2.215GlnAsn: 2.215 ± 0.022
1.966GlnPro: 1.966 ± 0.03
3.468GlnGln: 3.468 ± 0.078
2.574GlnArg: 2.574 ± 0.022
3.087GlnSer: 3.087 ± 0.03
2.305GlnThr: 2.305 ± 0.023
2.137GlnVal: 2.137 ± 0.025
0.464GlnTrp: 0.464 ± 0.009
1.291GlnTyr: 1.291 ± 0.014
0.001GlnXaa: 0.001 ± 0.0
Arg
3.408ArgAla: 3.408 ± 0.026
1.235ArgCys: 1.235 ± 0.022
3.15ArgAsp: 3.15 ± 0.031
4.133ArgGlu: 4.133 ± 0.04
2.085ArgPhe: 2.085 ± 0.019
3.205ArgGly: 3.205 ± 0.035
1.656ArgHis: 1.656 ± 0.018
3.412ArgIle: 3.412 ± 0.027
4.109ArgLys: 4.109 ± 0.034
5.193ArgLeu: 5.193 ± 0.038
1.316ArgMet: 1.316 ± 0.016
3.07ArgAsn: 3.07 ± 0.021
2.652ArgPro: 2.652 ± 0.036
2.433ArgGln: 2.433 ± 0.019
4.93ArgArg: 4.93 ± 0.051
4.709ArgSer: 4.709 ± 0.044
3.18ArgThr: 3.18 ± 0.026
3.189ArgVal: 3.189 ± 0.028
0.689ArgTrp: 0.689 ± 0.012
1.931ArgTyr: 1.931 ± 0.019
0.003ArgXaa: 0.003 ± 0.001
Ser
4.638SerAla: 4.638 ± 0.034
1.577SerCys: 1.577 ± 0.029
4.273SerAsp: 4.273 ± 0.033
4.752SerGlu: 4.752 ± 0.035
2.916SerPhe: 2.916 ± 0.026
4.51SerGly: 4.51 ± 0.035
1.931SerHis: 1.931 ± 0.019
4.67SerIle: 4.67 ± 0.033
4.695SerLys: 4.695 ± 0.039
7.22SerLeu: 7.22 ± 0.043
1.773SerMet: 1.773 ± 0.018
4.285SerAsn: 4.285 ± 0.039
4.5SerPro: 4.5 ± 0.051
3.161SerGln: 3.161 ± 0.028
4.802SerArg: 4.802 ± 0.047
8.721SerSer: 8.721 ± 0.075
5.462SerThr: 5.462 ± 0.043
4.765SerVal: 4.765 ± 0.028
0.835SerTrp: 0.835 ± 0.012
2.346SerTyr: 2.346 ± 0.023
0.003SerXaa: 0.003 ± 0.001
Thr
3.829ThrAla: 3.829 ± 0.029
1.267ThrCys: 1.267 ± 0.025
3.017ThrAsp: 3.017 ± 0.024
3.702ThrGlu: 3.702 ± 0.042
2.217ThrPhe: 2.217 ± 0.02
3.206ThrGly: 3.206 ± 0.03
1.325ThrHis: 1.325 ± 0.015
3.742ThrIle: 3.742 ± 0.03
3.537ThrLys: 3.537 ± 0.034
5.384ThrLeu: 5.384 ± 0.038
1.393ThrMet: 1.393 ± 0.016
2.981ThrAsn: 2.981 ± 0.028
3.232ThrPro: 3.232 ± 0.038
2.127ThrGln: 2.127 ± 0.022
3.163ThrArg: 3.163 ± 0.028
5.455ThrSer: 5.455 ± 0.039
4.506ThrThr: 4.506 ± 0.06
3.872ThrVal: 3.872 ± 0.031
0.653ThrTrp: 0.653 ± 0.012
1.798ThrTyr: 1.798 ± 0.021
0.001ThrXaa: 0.001 ± 0.0
Val
3.963ValAla: 3.963 ± 0.031
1.347ValCys: 1.347 ± 0.027
3.172ValAsp: 3.172 ± 0.024
3.752ValGlu: 3.752 ± 0.034
2.195ValPhe: 2.195 ± 0.021
2.961ValGly: 2.961 ± 0.027
1.487ValHis: 1.487 ± 0.016
3.694ValIle: 3.694 ± 0.027
3.613ValLys: 3.613 ± 0.028
5.278ValLeu: 5.278 ± 0.033
1.311ValMet: 1.311 ± 0.013
2.846ValAsn: 2.846 ± 0.023
3.112ValPro: 3.112 ± 0.03
2.424ValGln: 2.424 ± 0.024
3.205ValArg: 3.205 ± 0.025
4.664ValSer: 4.664 ± 0.032
3.937ValThr: 3.937 ± 0.039
3.712ValVal: 3.712 ± 0.03
0.624ValTrp: 0.624 ± 0.011
1.88ValTyr: 1.88 ± 0.018
0.001ValXaa: 0.001 ± 0.0
Trp
0.536TrpAla: 0.536 ± 0.009
0.231TrpCys: 0.231 ± 0.006
0.576TrpAsp: 0.576 ± 0.01
0.61TrpGlu: 0.61 ± 0.011
0.421TrpPhe: 0.421 ± 0.009
0.518TrpGly: 0.518 ± 0.009
0.294TrpHis: 0.294 ± 0.007
0.726TrpIle: 0.726 ± 0.011
0.745TrpLys: 0.745 ± 0.013
1.111TrpLeu: 1.111 ± 0.016
0.289TrpMet: 0.289 ± 0.007
0.596TrpAsn: 0.596 ± 0.01
0.42TrpPro: 0.42 ± 0.009
0.466TrpGln: 0.466 ± 0.01
0.749TrpArg: 0.749 ± 0.012
0.814TrpSer: 0.814 ± 0.012
0.601TrpThr: 0.601 ± 0.011
0.509TrpVal: 0.509 ± 0.01
0.182TrpTrp: 0.182 ± 0.007
0.393TrpTyr: 0.393 ± 0.009
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.781TyrAla: 1.781 ± 0.016
0.737TyrCys: 0.737 ± 0.013
1.737TyrAsp: 1.737 ± 0.018
1.898TyrGlu: 1.898 ± 0.018
1.433TyrPhe: 1.433 ± 0.017
1.787TyrGly: 1.787 ± 0.023
0.884TyrHis: 0.884 ± 0.013
2.058TyrIle: 2.058 ± 0.022
1.958TyrLys: 1.958 ± 0.02
2.969TyrLeu: 2.969 ± 0.025
0.761TyrMet: 0.761 ± 0.011
1.717TyrAsn: 1.717 ± 0.018
1.484TyrPro: 1.484 ± 0.022
1.232TyrGln: 1.232 ± 0.015
1.856TyrArg: 1.856 ± 0.016
2.374TyrSer: 2.374 ± 0.024
1.81TyrThr: 1.81 ± 0.019
2.013TyrVal: 2.013 ± 0.02
0.364TyrTrp: 0.364 ± 0.009
1.263TyrTyr: 1.263 ± 0.017
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.0
0.001XaaCys: 0.001 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.001XaaPhe: 0.001 ± 0.0
0.002XaaGly: 0.002 ± 0.0
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.003XaaArg: 0.003 ± 0.001
0.002XaaSer: 0.002 ± 0.001
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.003XaaTyr: 0.003 ± 0.001
1.02XaaXaa: 1.02 ± 0.095
Statistics based on 15167 proteins (6907062 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski