Amino acid dipepetide frequency for Temnothorax curvispinosus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.713AlaAla: 5.713 ± 0.04
1.158AlaCys: 1.158 ± 0.017
3.293AlaAsp: 3.293 ± 0.02
4.369AlaGlu: 4.369 ± 0.031
2.042AlaPhe: 2.042 ± 0.014
3.578AlaGly: 3.578 ± 0.025
1.445AlaHis: 1.445 ± 0.011
3.581AlaIle: 3.581 ± 0.024
3.901AlaLys: 3.901 ± 0.029
5.865AlaLeu: 5.865 ± 0.036
1.513AlaMet: 1.513 ± 0.012
2.802AlaAsn: 2.802 ± 0.015
2.963AlaPro: 2.963 ± 0.026
2.585AlaGln: 2.585 ± 0.022
3.673AlaArg: 3.673 ± 0.024
5.28AlaSer: 5.28 ± 0.027
4.263AlaThr: 4.263 ± 0.022
4.225AlaVal: 4.225 ± 0.022
0.659AlaTrp: 0.659 ± 0.008
1.604AlaTyr: 1.604 ± 0.012
0.001AlaXaa: 0.001 ± 0.0
Cys
1.073CysAla: 1.073 ± 0.013
0.438CysCys: 0.438 ± 0.008
1.095CysAsp: 1.095 ± 0.014
1.137CysGlu: 1.137 ± 0.016
0.671CysPhe: 0.671 ± 0.008
1.233CysGly: 1.233 ± 0.024
0.501CysHis: 0.501 ± 0.009
1.016CysIle: 1.016 ± 0.017
1.15CysLys: 1.15 ± 0.013
1.706CysLeu: 1.706 ± 0.02
0.369CysMet: 0.369 ± 0.006
0.95CysAsn: 0.95 ± 0.013
0.951CysPro: 0.951 ± 0.021
0.752CysGln: 0.752 ± 0.012
1.101CysArg: 1.101 ± 0.02
1.462CysSer: 1.462 ± 0.021
1.089CysThr: 1.089 ± 0.016
1.186CysVal: 1.186 ± 0.017
0.213CysTrp: 0.213 ± 0.004
0.553CysTyr: 0.553 ± 0.008
0.001CysXaa: 0.001 ± 0.0
Asp
3.402AspAla: 3.402 ± 0.02
0.992AspCys: 0.992 ± 0.014
3.863AspAsp: 3.863 ± 0.024
4.333AspGlu: 4.333 ± 0.023
1.987AspPhe: 1.987 ± 0.014
3.277AspGly: 3.277 ± 0.024
1.153AspHis: 1.153 ± 0.01
3.478AspIle: 3.478 ± 0.022
3.495AspLys: 3.495 ± 0.032
4.893AspLeu: 4.893 ± 0.024
1.233AspMet: 1.233 ± 0.012
2.722AspAsn: 2.722 ± 0.018
2.61AspPro: 2.61 ± 0.027
1.762AspGln: 1.762 ± 0.012
2.966AspArg: 2.966 ± 0.025
4.497AspSer: 4.497 ± 0.024
3.246AspThr: 3.246 ± 0.019
3.803AspVal: 3.803 ± 0.021
0.657AspTrp: 0.657 ± 0.008
1.773AspTyr: 1.773 ± 0.014
0.0AspXaa: 0.0 ± 0.0
Glu
4.395GluAla: 4.395 ± 0.035
1.168GluCys: 1.168 ± 0.022
4.303GluAsp: 4.303 ± 0.024
6.526GluGlu: 6.526 ± 0.053
2.102GluPhe: 2.102 ± 0.014
3.148GluGly: 3.148 ± 0.028
1.555GluHis: 1.555 ± 0.011
4.083GluIle: 4.083 ± 0.03
5.22GluLys: 5.22 ± 0.043
5.906GluLeu: 5.906 ± 0.032
1.621GluMet: 1.621 ± 0.013
3.763GluAsn: 3.763 ± 0.024
2.65GluPro: 2.65 ± 0.031
2.905GluGln: 2.905 ± 0.027
4.376GluArg: 4.376 ± 0.03
4.857GluSer: 4.857 ± 0.032
4.093GluThr: 4.093 ± 0.031
3.789GluVal: 3.789 ± 0.023
0.675GluTrp: 0.675 ± 0.007
1.935GluTyr: 1.935 ± 0.015
0.001GluXaa: 0.001 ± 0.0
Phe
2.1PheAla: 2.1 ± 0.015
0.715PheCys: 0.715 ± 0.009
1.938PheAsp: 1.938 ± 0.013
2.04PheGlu: 2.04 ± 0.013
1.277PhePhe: 1.277 ± 0.014
2.011PheGly: 2.011 ± 0.017
0.893PheHis: 0.893 ± 0.01
1.911PheIle: 1.911 ± 0.015
1.883PheLys: 1.883 ± 0.014
3.271PheLeu: 3.271 ± 0.023
0.734PheMet: 0.734 ± 0.008
1.607PheAsn: 1.607 ± 0.012
1.519PhePro: 1.519 ± 0.014
1.307PheGln: 1.307 ± 0.01
1.891PheArg: 1.891 ± 0.019
2.622PheSer: 2.622 ± 0.017
2.026PheThr: 2.026 ± 0.018
2.266PheVal: 2.266 ± 0.016
0.421PheTrp: 0.421 ± 0.006
1.153PheTyr: 1.153 ± 0.011
0.001PheXaa: 0.001 ± 0.0
Gly
3.285GlyAla: 3.285 ± 0.021
0.925GlyCys: 0.925 ± 0.015
2.952GlyAsp: 2.952 ± 0.02
3.358GlyGlu: 3.358 ± 0.029
1.924GlyPhe: 1.924 ± 0.017
4.413GlyGly: 4.413 ± 0.044
1.449GlyHis: 1.449 ± 0.015
2.971GlyIle: 2.971 ± 0.018
3.319GlyLys: 3.319 ± 0.024
4.369GlyLeu: 4.369 ± 0.025
1.186GlyMet: 1.186 ± 0.011
2.602GlyAsn: 2.602 ± 0.017
2.485GlyPro: 2.485 ± 0.031
2.147GlyGln: 2.147 ± 0.016
3.209GlyArg: 3.209 ± 0.022
4.734GlySer: 4.734 ± 0.032
3.301GlyThr: 3.301 ± 0.022
3.203GlyVal: 3.203 ± 0.02
0.627GlyTrp: 0.627 ± 0.007
1.853GlyTyr: 1.853 ± 0.018
0.001GlyXaa: 0.001 ± 0.0
His
1.474HisAla: 1.474 ± 0.014
0.552HisCys: 0.552 ± 0.009
1.158HisAsp: 1.158 ± 0.009
1.444HisGlu: 1.444 ± 0.011
0.917HisPhe: 0.917 ± 0.009
1.417HisGly: 1.417 ± 0.012
1.156HisHis: 1.156 ± 0.018
1.37HisIle: 1.37 ± 0.011
1.305HisLys: 1.305 ± 0.011
2.4HisLeu: 2.4 ± 0.016
0.595HisMet: 0.595 ± 0.009
1.143HisAsn: 1.143 ± 0.008
1.409HisPro: 1.409 ± 0.014
1.209HisGln: 1.209 ± 0.013
1.563HisArg: 1.563 ± 0.012
1.974HisSer: 1.974 ± 0.017
1.353HisThr: 1.353 ± 0.011
1.583HisVal: 1.583 ± 0.01
0.288HisTrp: 0.288 ± 0.004
0.849HisTyr: 0.849 ± 0.008
0.001HisXaa: 0.001 ± 0.0
Ile
3.62IleAla: 3.62 ± 0.019
1.142IleCys: 1.142 ± 0.016
3.256IleAsp: 3.256 ± 0.02
3.731IleGlu: 3.731 ± 0.032
2.078IlePhe: 2.078 ± 0.017
2.846IleGly: 2.846 ± 0.019
1.291IleHis: 1.291 ± 0.012
3.229IleIle: 3.229 ± 0.019
3.46IleLys: 3.46 ± 0.027
5.212IleLeu: 5.212 ± 0.029
1.241IleMet: 1.241 ± 0.011
2.776IleAsn: 2.776 ± 0.02
2.814IlePro: 2.814 ± 0.018
2.272IleGln: 2.272 ± 0.016
2.918IleArg: 2.918 ± 0.016
4.469IleSer: 4.469 ± 0.026
3.354IleThr: 3.354 ± 0.021
3.595IleVal: 3.595 ± 0.021
0.541IleTrp: 0.541 ± 0.006
1.604IleTyr: 1.604 ± 0.013
0.001IleXaa: 0.001 ± 0.0
Lys
3.533LysAla: 3.533 ± 0.027
1.175LysCys: 1.175 ± 0.014
3.695LysAsp: 3.695 ± 0.038
5.002LysGlu: 5.002 ± 0.042
2.004LysPhe: 2.004 ± 0.013
2.609LysGly: 2.609 ± 0.021
1.515LysHis: 1.515 ± 0.01
3.752LysIle: 3.752 ± 0.026
5.039LysLys: 5.039 ± 0.04
5.696LysLeu: 5.696 ± 0.031
1.499LysMet: 1.499 ± 0.013
3.125LysAsn: 3.125 ± 0.024
2.964LysPro: 2.964 ± 0.05
2.63LysGln: 2.63 ± 0.019
3.952LysArg: 3.952 ± 0.022
4.595LysSer: 4.595 ± 0.029
3.649LysThr: 3.649 ± 0.023
3.448LysVal: 3.448 ± 0.023
0.761LysTrp: 0.761 ± 0.011
1.972LysTyr: 1.972 ± 0.016
0.001LysXaa: 0.001 ± 0.0
Leu
5.932LeuAla: 5.932 ± 0.029
1.709LeuCys: 1.709 ± 0.016
4.877LeuAsp: 4.877 ± 0.031
6.38LeuGlu: 6.38 ± 0.041
2.811LeuPhe: 2.811 ± 0.022
4.228LeuGly: 4.228 ± 0.022
2.481LeuHis: 2.481 ± 0.016
4.523LeuIle: 4.523 ± 0.027
5.8LeuLys: 5.8 ± 0.03
8.652LeuLeu: 8.652 ± 0.052
1.95LeuMet: 1.95 ± 0.015
4.232LeuAsn: 4.232 ± 0.024
4.721LeuPro: 4.721 ± 0.025
4.713LeuGln: 4.713 ± 0.03
5.669LeuArg: 5.669 ± 0.03
7.172LeuSer: 7.172 ± 0.028
5.227LeuThr: 5.227 ± 0.024
4.875LeuVal: 4.875 ± 0.023
0.934LeuTrp: 0.934 ± 0.01
2.597LeuTyr: 2.597 ± 0.017
0.001LeuXaa: 0.001 ± 0.0
Met
1.519MetAla: 1.519 ± 0.01
0.406MetCys: 0.406 ± 0.006
1.367MetAsp: 1.367 ± 0.011
1.746MetGlu: 1.746 ± 0.015
0.8MetPhe: 0.8 ± 0.01
1.088MetGly: 1.088 ± 0.012
0.542MetHis: 0.542 ± 0.006
1.141MetIle: 1.141 ± 0.011
1.42MetLys: 1.42 ± 0.011
2.018MetLeu: 2.018 ± 0.015
0.628MetMet: 0.628 ± 0.007
1.03MetAsn: 1.03 ± 0.01
1.122MetPro: 1.122 ± 0.011
1.086MetGln: 1.086 ± 0.012
1.287MetArg: 1.287 ± 0.012
1.823MetSer: 1.823 ± 0.014
1.364MetThr: 1.364 ± 0.011
1.223MetVal: 1.223 ± 0.011
0.231MetTrp: 0.231 ± 0.004
0.679MetTyr: 0.679 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
3.191AsnAla: 3.191 ± 0.019
0.892AsnCys: 0.892 ± 0.01
2.737AsnAsp: 2.737 ± 0.02
3.259AsnGlu: 3.259 ± 0.019
1.682AsnPhe: 1.682 ± 0.015
2.858AsnGly: 2.858 ± 0.02
1.124AsnHis: 1.124 ± 0.012
3.122AsnIle: 3.122 ± 0.021
2.97AsnLys: 2.97 ± 0.022
4.324AsnLeu: 4.324 ± 0.022
1.13AsnMet: 1.13 ± 0.01
3.016AsnAsn: 3.016 ± 0.025
2.19AsnPro: 2.19 ± 0.018
1.87AsnGln: 1.87 ± 0.014
2.424AsnArg: 2.424 ± 0.013
3.8AsnSer: 3.8 ± 0.024
2.786AsnThr: 2.786 ± 0.016
3.465AsnVal: 3.465 ± 0.019
0.487AsnTrp: 0.487 ± 0.006
1.51AsnTyr: 1.51 ± 0.013
0.001AsnXaa: 0.001 ± 0.0
Pro
3.312ProAla: 3.312 ± 0.023
0.765ProCys: 0.765 ± 0.025
2.618ProAsp: 2.618 ± 0.015
3.296ProGlu: 3.296 ± 0.024
1.537ProPhe: 1.537 ± 0.015
3.093ProGly: 3.093 ± 0.038
1.291ProHis: 1.291 ± 0.014
2.611ProIle: 2.611 ± 0.019
2.736ProLys: 2.736 ± 0.033
4.246ProLeu: 4.246 ± 0.026
1.06ProMet: 1.06 ± 0.01
2.196ProAsn: 2.196 ± 0.018
4.593ProPro: 4.593 ± 0.059
2.22ProGln: 2.22 ± 0.022
2.867ProArg: 2.867 ± 0.02
4.631ProSer: 4.631 ± 0.033
3.347ProThr: 3.347 ± 0.023
3.19ProVal: 3.19 ± 0.022
0.489ProTrp: 0.489 ± 0.006
1.434ProTyr: 1.434 ± 0.012
0.001ProXaa: 0.001 ± 0.0
Gln
2.687GlnAla: 2.687 ± 0.021
0.766GlnCys: 0.766 ± 0.014
2.096GlnAsp: 2.096 ± 0.014
3.084GlnGlu: 3.084 ± 0.024
1.329GlnPhe: 1.329 ± 0.01
1.91GlnGly: 1.91 ± 0.015
1.335GlnHis: 1.335 ± 0.017
2.273GlnIle: 2.273 ± 0.016
2.489GlnLys: 2.489 ± 0.018
4.038GlnLeu: 4.038 ± 0.027
1.012GlnMet: 1.012 ± 0.012
2.174GlnAsn: 2.174 ± 0.017
2.201GlnPro: 2.201 ± 0.023
4.245GlnGln: 4.245 ± 0.071
2.813GlnArg: 2.813 ± 0.02
3.332GlnSer: 3.332 ± 0.027
2.404GlnThr: 2.404 ± 0.022
2.337GlnVal: 2.337 ± 0.017
0.502GlnTrp: 0.502 ± 0.006
1.277GlnTyr: 1.277 ± 0.013
0.0GlnXaa: 0.0 ± 0.0
Arg
3.492ArgAla: 3.492 ± 0.02
1.099ArgCys: 1.099 ± 0.015
3.373ArgAsp: 3.373 ± 0.023
4.113ArgGlu: 4.113 ± 0.03
2.0ArgPhe: 2.0 ± 0.013
3.091ArgGly: 3.091 ± 0.022
1.63ArgHis: 1.63 ± 0.013
3.103ArgIle: 3.103 ± 0.017
4.042ArgLys: 4.042 ± 0.023
5.204ArgLeu: 5.204 ± 0.029
1.231ArgMet: 1.231 ± 0.01
2.862ArgAsn: 2.862 ± 0.017
2.709ArgPro: 2.709 ± 0.024
2.565ArgGln: 2.565 ± 0.018
4.635ArgArg: 4.635 ± 0.034
4.74ArgSer: 4.74 ± 0.035
3.155ArgThr: 3.155 ± 0.019
3.309ArgVal: 3.309 ± 0.02
0.716ArgTrp: 0.716 ± 0.009
1.856ArgTyr: 1.856 ± 0.015
0.001ArgXaa: 0.001 ± 0.0
Ser
5.06SerAla: 5.06 ± 0.026
1.407SerCys: 1.407 ± 0.019
4.553SerAsp: 4.553 ± 0.024
4.886SerGlu: 4.886 ± 0.028
2.596SerPhe: 2.596 ± 0.015
4.742SerGly: 4.742 ± 0.03
1.891SerHis: 1.891 ± 0.017
4.16SerIle: 4.16 ± 0.019
4.643SerLys: 4.643 ± 0.029
7.132SerLeu: 7.132 ± 0.032
1.754SerMet: 1.754 ± 0.012
4.0SerAsn: 4.0 ± 0.028
4.911SerPro: 4.911 ± 0.043
3.426SerGln: 3.426 ± 0.025
4.747SerArg: 4.747 ± 0.029
8.924SerSer: 8.924 ± 0.059
5.602SerThr: 5.602 ± 0.03
4.837SerVal: 4.837 ± 0.023
0.825SerTrp: 0.825 ± 0.009
2.152SerTyr: 2.152 ± 0.014
0.001SerXaa: 0.001 ± 0.0
Thr
4.213ThrAla: 4.213 ± 0.022
1.198ThrCys: 1.198 ± 0.017
3.141ThrAsp: 3.141 ± 0.018
3.786ThrGlu: 3.786 ± 0.027
2.088ThrPhe: 2.088 ± 0.015
3.488ThrGly: 3.488 ± 0.029
1.314ThrHis: 1.314 ± 0.011
3.333ThrIle: 3.333 ± 0.016
3.472ThrLys: 3.472 ± 0.025
5.449ThrLeu: 5.449 ± 0.024
1.402ThrMet: 1.402 ± 0.012
2.806ThrAsn: 2.806 ± 0.017
3.678ThrPro: 3.678 ± 0.029
2.289ThrGln: 2.289 ± 0.016
3.064ThrArg: 3.064 ± 0.019
5.573ThrSer: 5.573 ± 0.029
4.617ThrThr: 4.617 ± 0.036
4.031ThrVal: 4.031 ± 0.022
0.681ThrTrp: 0.681 ± 0.01
1.711ThrTyr: 1.711 ± 0.014
0.001ThrXaa: 0.001 ± 0.0
Val
4.197ValAla: 4.197 ± 0.024
1.265ValCys: 1.265 ± 0.017
3.486ValAsp: 3.486 ± 0.02
4.068ValGlu: 4.068 ± 0.029
2.093ValPhe: 2.093 ± 0.017
3.039ValGly: 3.039 ± 0.018
1.486ValHis: 1.486 ± 0.01
3.44ValIle: 3.44 ± 0.019
3.717ValLys: 3.717 ± 0.025
5.32ValLeu: 5.32 ± 0.025
1.358ValMet: 1.358 ± 0.012
2.934ValAsn: 2.934 ± 0.02
3.221ValPro: 3.221 ± 0.019
2.645ValGln: 2.645 ± 0.019
3.293ValArg: 3.293 ± 0.019
4.753ValSer: 4.753 ± 0.022
4.117ValThr: 4.117 ± 0.021
3.852ValVal: 3.852 ± 0.02
0.631ValTrp: 0.631 ± 0.007
1.748ValTyr: 1.748 ± 0.015
0.001ValXaa: 0.001 ± 0.0
Trp
0.588TrpAla: 0.588 ± 0.006
0.204TrpCys: 0.204 ± 0.004
0.636TrpAsp: 0.636 ± 0.009
0.671TrpGlu: 0.671 ± 0.008
0.404TrpPhe: 0.404 ± 0.006
0.488TrpGly: 0.488 ± 0.007
0.272TrpHis: 0.272 ± 0.004
0.674TrpIle: 0.674 ± 0.008
0.77TrpLys: 0.77 ± 0.009
1.122TrpLeu: 1.122 ± 0.013
0.279TrpMet: 0.279 ± 0.005
0.569TrpAsn: 0.569 ± 0.007
0.438TrpPro: 0.438 ± 0.006
0.477TrpGln: 0.477 ± 0.006
0.744TrpArg: 0.744 ± 0.007
0.837TrpSer: 0.837 ± 0.01
0.643TrpThr: 0.643 ± 0.008
0.521TrpVal: 0.521 ± 0.008
0.192TrpTrp: 0.192 ± 0.004
0.35TrpTyr: 0.35 ± 0.006
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.725TyrAla: 1.725 ± 0.013
0.67TyrCys: 0.67 ± 0.008
1.705TyrAsp: 1.705 ± 0.013
1.839TyrGlu: 1.839 ± 0.014
1.244TyrPhe: 1.244 ± 0.012
1.755TyrGly: 1.755 ± 0.017
0.835TyrHis: 0.835 ± 0.008
1.678TyrIle: 1.678 ± 0.013
1.784TyrLys: 1.784 ± 0.012
2.663TyrLeu: 2.663 ± 0.02
0.724TyrMet: 0.724 ± 0.008
1.527TyrAsn: 1.527 ± 0.012
1.388TyrPro: 1.388 ± 0.014
1.2TyrGln: 1.2 ± 0.01
1.734TyrArg: 1.734 ± 0.012
2.17TyrSer: 2.17 ± 0.017
1.698TyrThr: 1.698 ± 0.015
1.924TyrVal: 1.924 ± 0.015
0.347TyrTrp: 0.347 ± 0.006
1.128TyrTyr: 1.128 ± 0.011
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.001XaaAsp: 0.001 ± 0.0
0.001XaaGlu: 0.001 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.001XaaLys: 0.001 ± 0.0
0.002XaaLeu: 0.002 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.001XaaSer: 0.001 ± 0.0
0.001XaaThr: 0.001 ± 0.0
0.001XaaVal: 0.001 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.015XaaXaa: 0.015 ± 0.004
Statistics based on 21925 proteins (14401447 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski