Amino acid dipepetide frequency for Candidatus Afipia apatlaquensis

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
16.516AlaAla: 16.516 ± 0.155
1.007AlaCys: 1.007 ± 0.029
6.536AlaAsp: 6.536 ± 0.067
6.519AlaGlu: 6.519 ± 0.062
4.475AlaPhe: 4.475 ± 0.051
9.707AlaGly: 9.707 ± 0.093
2.142AlaHis: 2.142 ± 0.04
6.782AlaIle: 6.782 ± 0.078
5.086AlaLys: 5.086 ± 0.071
12.301AlaLeu: 12.301 ± 0.111
3.452AlaMet: 3.452 ± 0.051
3.25AlaAsn: 3.25 ± 0.052
5.483AlaPro: 5.483 ± 0.076
4.009AlaGln: 4.009 ± 0.053
7.917AlaArg: 7.917 ± 0.076
6.9AlaSer: 6.9 ± 0.076
6.195AlaThr: 6.195 ± 0.074
8.553AlaVal: 8.553 ± 0.082
1.362AlaTrp: 1.362 ± 0.032
2.587AlaTyr: 2.587 ± 0.04
0.0AlaXaa: 0.0 ± 0.0
Cys
0.954CysAla: 0.954 ± 0.028
0.119CysCys: 0.119 ± 0.009
0.493CysAsp: 0.493 ± 0.018
0.416CysGlu: 0.416 ± 0.018
0.343CysPhe: 0.343 ± 0.016
0.893CysGly: 0.893 ± 0.027
0.221CysHis: 0.221 ± 0.012
0.452CysIle: 0.452 ± 0.017
0.246CysLys: 0.246 ± 0.012
0.701CysLeu: 0.701 ± 0.024
0.178CysMet: 0.178 ± 0.011
0.244CysAsn: 0.244 ± 0.013
0.43CysPro: 0.43 ± 0.019
0.239CysGln: 0.239 ± 0.014
0.566CysArg: 0.566 ± 0.019
0.472CysSer: 0.472 ± 0.02
0.418CysThr: 0.418 ± 0.016
0.661CysVal: 0.661 ± 0.022
0.093CysTrp: 0.093 ± 0.008
0.189CysTyr: 0.189 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
6.575AspAla: 6.575 ± 0.075
0.459AspCys: 0.459 ± 0.017
3.149AspAsp: 3.149 ± 0.054
3.251AspGlu: 3.251 ± 0.045
2.178AspPhe: 2.178 ± 0.035
4.714AspGly: 4.714 ± 0.053
1.239AspHis: 1.239 ± 0.032
3.209AspIle: 3.209 ± 0.045
2.141AspLys: 2.141 ± 0.038
5.472AspLeu: 5.472 ± 0.057
1.286AspMet: 1.286 ± 0.026
1.48AspAsn: 1.48 ± 0.03
3.198AspPro: 3.198 ± 0.051
1.755AspGln: 1.755 ± 0.035
4.005AspArg: 4.005 ± 0.052
2.394AspSer: 2.394 ± 0.039
2.629AspThr: 2.629 ± 0.042
4.249AspVal: 4.249 ± 0.058
0.84AspTrp: 0.84 ± 0.024
1.475AspTyr: 1.475 ± 0.026
0.0AspXaa: 0.0 ± 0.0
Glu
6.444GluAla: 6.444 ± 0.07
0.356GluCys: 0.356 ± 0.016
2.492GluAsp: 2.492 ± 0.043
2.573GluGlu: 2.573 ± 0.046
1.871GluPhe: 1.871 ± 0.037
3.723GluGly: 3.723 ± 0.056
1.07GluHis: 1.07 ± 0.031
3.441GluIle: 3.441 ± 0.054
2.598GluLys: 2.598 ± 0.048
4.961GluLeu: 4.961 ± 0.059
1.429GluMet: 1.429 ± 0.033
1.66GluAsn: 1.66 ± 0.035
2.429GluPro: 2.429 ± 0.041
2.09GluGln: 2.09 ± 0.035
4.504GluArg: 4.504 ± 0.046
2.471GluSer: 2.471 ± 0.047
3.243GluThr: 3.243 ± 0.048
3.549GluVal: 3.549 ± 0.05
0.681GluTrp: 0.681 ± 0.021
1.031GluTyr: 1.031 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.694PheAla: 4.694 ± 0.049
0.4PheCys: 0.4 ± 0.017
2.538PheAsp: 2.538 ± 0.038
2.059PheGlu: 2.059 ± 0.035
1.492PhePhe: 1.492 ± 0.036
3.839PheGly: 3.839 ± 0.053
0.752PheHis: 0.752 ± 0.023
1.932PheIle: 1.932 ± 0.04
1.312PheLys: 1.312 ± 0.027
3.401PheLeu: 3.401 ± 0.051
0.823PheMet: 0.823 ± 0.022
1.224PheAsn: 1.224 ± 0.031
1.691PhePro: 1.691 ± 0.034
1.01PheGln: 1.01 ± 0.026
2.253PheArg: 2.253 ± 0.041
2.339PheSer: 2.339 ± 0.039
2.103PheThr: 2.103 ± 0.041
3.094PheVal: 3.094 ± 0.047
0.523PheTrp: 0.523 ± 0.02
0.916PheTyr: 0.916 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
8.81GlyAla: 8.81 ± 0.084
0.798GlyCys: 0.798 ± 0.026
4.174GlyAsp: 4.174 ± 0.059
4.222GlyGlu: 4.222 ± 0.06
3.672GlyPhe: 3.672 ± 0.044
7.244GlyGly: 7.244 ± 0.092
1.771GlyHis: 1.771 ± 0.037
4.842GlyIle: 4.842 ± 0.062
3.743GlyLys: 3.743 ± 0.058
8.25GlyLeu: 8.25 ± 0.07
2.172GlyMet: 2.172 ± 0.038
2.326GlyAsn: 2.326 ± 0.044
3.384GlyPro: 3.384 ± 0.046
2.723GlyGln: 2.723 ± 0.046
5.437GlyArg: 5.437 ± 0.065
4.695GlySer: 4.695 ± 0.063
4.614GlyThr: 4.614 ± 0.065
6.142GlyVal: 6.142 ± 0.071
1.323GlyTrp: 1.323 ± 0.033
2.319GlyTyr: 2.319 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
2.232HisAla: 2.232 ± 0.043
0.217HisCys: 0.217 ± 0.011
1.208HisAsp: 1.208 ± 0.026
0.942HisGlu: 0.942 ± 0.024
0.78HisPhe: 0.78 ± 0.023
1.871HisGly: 1.871 ± 0.037
0.587HisHis: 0.587 ± 0.025
1.012HisIle: 1.012 ± 0.028
0.547HisLys: 0.547 ± 0.02
1.922HisLeu: 1.922 ± 0.039
0.473HisMet: 0.473 ± 0.016
0.518HisAsn: 0.518 ± 0.018
1.217HisPro: 1.217 ± 0.032
0.539HisGln: 0.539 ± 0.019
1.427HisArg: 1.427 ± 0.033
0.961HisSer: 0.961 ± 0.027
0.844HisThr: 0.844 ± 0.027
1.498HisVal: 1.498 ± 0.032
0.319HisTrp: 0.319 ± 0.014
0.516HisTyr: 0.516 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
7.859IleAla: 7.859 ± 0.071
0.534IleCys: 0.534 ± 0.019
3.789IleAsp: 3.789 ± 0.05
3.687IleGlu: 3.687 ± 0.057
1.888IlePhe: 1.888 ± 0.044
5.365IleGly: 5.365 ± 0.059
0.944IleHis: 0.944 ± 0.023
2.64IleIle: 2.64 ± 0.044
2.126IleLys: 2.126 ± 0.044
4.715IleLeu: 4.715 ± 0.062
1.1IleMet: 1.1 ± 0.027
1.738IleAsn: 1.738 ± 0.036
2.579IlePro: 2.579 ± 0.041
1.422IleGln: 1.422 ± 0.03
3.203IleArg: 3.203 ± 0.048
3.325IleSer: 3.325 ± 0.053
2.92IleThr: 2.92 ± 0.052
4.878IleVal: 4.878 ± 0.064
0.637IleTrp: 0.637 ± 0.019
1.301IleTyr: 1.301 ± 0.029
0.0IleXaa: 0.0 ± 0.0
Lys
4.827LysAla: 4.827 ± 0.063
0.206LysCys: 0.206 ± 0.012
2.172LysAsp: 2.172 ± 0.037
1.929LysGlu: 1.929 ± 0.042
1.284LysPhe: 1.284 ± 0.029
2.926LysGly: 2.926 ± 0.05
0.783LysHis: 0.783 ± 0.024
2.414LysIle: 2.414 ± 0.045
1.883LysLys: 1.883 ± 0.045
4.085LysLeu: 4.085 ± 0.054
1.01LysMet: 1.01 ± 0.027
1.255LysAsn: 1.255 ± 0.033
2.597LysPro: 2.597 ± 0.044
1.349LysGln: 1.349 ± 0.031
2.728LysArg: 2.728 ± 0.041
2.5LysSer: 2.5 ± 0.041
2.457LysThr: 2.457 ± 0.04
2.793LysVal: 2.793 ± 0.046
0.458LysTrp: 0.458 ± 0.017
0.886LysTyr: 0.886 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
12.35LeuAla: 12.35 ± 0.107
0.831LeuCys: 0.831 ± 0.025
5.518LeuAsp: 5.518 ± 0.065
4.674LeuGlu: 4.674 ± 0.066
3.579LeuPhe: 3.579 ± 0.057
7.725LeuGly: 7.725 ± 0.075
1.734LeuHis: 1.734 ± 0.036
5.468LeuIle: 5.468 ± 0.058
4.245LeuLys: 4.245 ± 0.049
9.058LeuLeu: 9.058 ± 0.104
2.368LeuMet: 2.368 ± 0.037
2.861LeuAsn: 2.861 ± 0.042
5.187LeuPro: 5.187 ± 0.058
2.709LeuGln: 2.709 ± 0.043
6.432LeuArg: 6.432 ± 0.072
6.346LeuSer: 6.346 ± 0.077
5.717LeuThr: 5.717 ± 0.06
7.143LeuVal: 7.143 ± 0.082
1.045LeuTrp: 1.045 ± 0.029
1.926LeuTyr: 1.926 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.915MetAla: 2.915 ± 0.041
0.179MetCys: 0.179 ± 0.012
1.076MetAsp: 1.076 ± 0.024
1.018MetGlu: 1.018 ± 0.03
0.808MetPhe: 0.808 ± 0.021
1.679MetGly: 1.679 ± 0.032
0.444MetHis: 0.444 ± 0.017
1.461MetIle: 1.461 ± 0.031
1.17MetLys: 1.17 ± 0.029
2.479MetLeu: 2.479 ± 0.047
0.686MetMet: 0.686 ± 0.021
0.813MetAsn: 0.813 ± 0.021
1.487MetPro: 1.487 ± 0.033
0.845MetGln: 0.845 ± 0.024
1.732MetArg: 1.732 ± 0.035
1.783MetSer: 1.783 ± 0.038
2.03MetThr: 2.03 ± 0.035
1.659MetVal: 1.659 ± 0.033
0.253MetTrp: 0.253 ± 0.013
0.343MetTyr: 0.343 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.441AsnAla: 3.441 ± 0.044
0.269AsnCys: 0.269 ± 0.014
1.592AsnAsp: 1.592 ± 0.038
1.503AsnGlu: 1.503 ± 0.032
1.146AsnPhe: 1.146 ± 0.028
2.619AsnGly: 2.619 ± 0.045
0.537AsnHis: 0.537 ± 0.019
1.766AsnIle: 1.766 ± 0.038
1.041AsnLys: 1.041 ± 0.027
2.785AsnLeu: 2.785 ± 0.042
0.652AsnMet: 0.652 ± 0.022
0.865AsnAsn: 0.865 ± 0.024
1.89AsnPro: 1.89 ± 0.036
0.833AsnGln: 0.833 ± 0.025
1.831AsnArg: 1.831 ± 0.033
1.53AsnSer: 1.53 ± 0.031
1.485AsnThr: 1.485 ± 0.031
2.313AsnVal: 2.313 ± 0.041
0.468AsnTrp: 0.468 ± 0.017
0.811AsnTyr: 0.811 ± 0.025
0.0AsnXaa: 0.0 ± 0.0
Pro
6.12ProAla: 6.12 ± 0.061
0.313ProCys: 0.313 ± 0.016
3.441ProAsp: 3.441 ± 0.049
3.215ProGlu: 3.215 ± 0.048
2.003ProPhe: 2.003 ± 0.039
4.337ProGly: 4.337 ± 0.063
0.985ProHis: 0.985 ± 0.025
2.449ProIle: 2.449 ± 0.039
2.107ProLys: 2.107 ± 0.043
4.588ProLeu: 4.588 ± 0.052
1.195ProMet: 1.195 ± 0.027
1.572ProAsn: 1.572 ± 0.034
2.741ProPro: 2.741 ± 0.061
1.755ProGln: 1.755 ± 0.038
2.932ProArg: 2.932 ± 0.045
3.054ProSer: 3.054 ± 0.046
2.564ProThr: 2.564 ± 0.049
4.088ProVal: 4.088 ± 0.051
0.679ProTrp: 0.679 ± 0.023
1.174ProTyr: 1.174 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
3.818GlnAla: 3.818 ± 0.052
0.222GlnCys: 0.222 ± 0.013
1.478GlnAsp: 1.478 ± 0.032
1.302GlnGlu: 1.302 ± 0.03
1.179GlnPhe: 1.179 ± 0.03
2.23GlnGly: 2.23 ± 0.039
0.672GlnHis: 0.672 ± 0.016
2.051GlnIle: 2.051 ± 0.033
1.356GlnLys: 1.356 ± 0.036
2.914GlnLeu: 2.914 ± 0.047
0.862GlnMet: 0.862 ± 0.026
0.99GlnAsn: 0.99 ± 0.027
1.78GlnPro: 1.78 ± 0.04
1.345GlnGln: 1.345 ± 0.034
2.479GlnArg: 2.479 ± 0.047
1.952GlnSer: 1.952 ± 0.038
1.806GlnThr: 1.806 ± 0.037
2.203GlnVal: 2.203 ± 0.038
0.44GlnTrp: 0.44 ± 0.018
0.695GlnTyr: 0.695 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
7.449ArgAla: 7.449 ± 0.078
0.499ArgCys: 0.499 ± 0.017
4.002ArgAsp: 4.002 ± 0.059
3.982ArgGlu: 3.982 ± 0.055
2.713ArgPhe: 2.713 ± 0.045
4.605ArgGly: 4.605 ± 0.056
1.534ArgHis: 1.534 ± 0.034
3.944ArgIle: 3.944 ± 0.052
2.727ArgLys: 2.727 ± 0.047
6.941ArgLeu: 6.941 ± 0.071
1.698ArgMet: 1.698 ± 0.028
2.019ArgAsn: 2.019 ± 0.039
3.326ArgPro: 3.326 ± 0.052
2.308ArgGln: 2.308 ± 0.04
5.061ArgArg: 5.061 ± 0.076
3.799ArgSer: 3.799 ± 0.051
3.45ArgThr: 3.45 ± 0.048
4.638ArgVal: 4.638 ± 0.056
0.92ArgTrp: 0.92 ± 0.026
1.628ArgTyr: 1.628 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
6.632SerAla: 6.632 ± 0.077
0.456SerCys: 0.456 ± 0.016
3.246SerAsp: 3.246 ± 0.041
2.945SerGlu: 2.945 ± 0.048
2.407SerPhe: 2.407 ± 0.042
5.563SerGly: 5.563 ± 0.066
1.108SerHis: 1.108 ± 0.024
3.336SerIle: 3.336 ± 0.049
2.148SerLys: 2.148 ± 0.041
5.49SerLeu: 5.49 ± 0.057
1.395SerMet: 1.395 ± 0.033
1.717SerAsn: 1.717 ± 0.037
3.056SerPro: 3.056 ± 0.044
1.797SerGln: 1.797 ± 0.033
3.708SerArg: 3.708 ± 0.053
3.513SerSer: 3.513 ± 0.058
2.896SerThr: 2.896 ± 0.048
4.287SerVal: 4.287 ± 0.06
0.761SerTrp: 0.761 ± 0.021
1.412SerTyr: 1.412 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
6.339ThrAla: 6.339 ± 0.063
0.45ThrCys: 0.45 ± 0.019
2.823ThrAsp: 2.823 ± 0.044
2.593ThrGlu: 2.593 ± 0.037
2.23ThrPhe: 2.23 ± 0.042
5.058ThrGly: 5.058 ± 0.061
0.992ThrHis: 0.992 ± 0.029
3.243ThrIle: 3.243 ± 0.054
1.91ThrLys: 1.91 ± 0.04
5.652ThrLeu: 5.652 ± 0.066
1.351ThrMet: 1.351 ± 0.029
1.502ThrAsn: 1.502 ± 0.036
3.314ThrPro: 3.314 ± 0.042
1.531ThrGln: 1.531 ± 0.029
3.327ThrArg: 3.327 ± 0.043
3.232ThrSer: 3.232 ± 0.049
3.069ThrThr: 3.069 ± 0.055
4.367ThrVal: 4.367 ± 0.056
0.673ThrTrp: 0.673 ± 0.021
1.24ThrTyr: 1.24 ± 0.028
0.0ThrXaa: 0.0 ± 0.0
Val
9.118ValAla: 9.118 ± 0.074
0.648ValCys: 0.648 ± 0.021
3.913ValAsp: 3.913 ± 0.046
4.143ValGlu: 4.143 ± 0.06
2.817ValPhe: 2.817 ± 0.048
5.563ValGly: 5.563 ± 0.069
1.351ValHis: 1.351 ± 0.029
4.42ValIle: 4.42 ± 0.051
2.832ValLys: 2.832 ± 0.048
7.301ValLeu: 7.301 ± 0.071
1.926ValMet: 1.926 ± 0.035
2.164ValAsn: 2.164 ± 0.041
3.77ValPro: 3.77 ± 0.05
2.214ValGln: 2.214 ± 0.043
4.852ValArg: 4.852 ± 0.054
4.539ValSer: 4.539 ± 0.057
4.548ValThr: 4.548 ± 0.054
5.986ValVal: 5.986 ± 0.083
0.923ValTrp: 0.923 ± 0.025
1.539ValTyr: 1.539 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.139TrpAla: 1.139 ± 0.029
0.137TrpCys: 0.137 ± 0.011
0.612TrpAsp: 0.612 ± 0.022
0.492TrpGlu: 0.492 ± 0.017
0.531TrpPhe: 0.531 ± 0.019
0.876TrpGly: 0.876 ± 0.025
0.322TrpHis: 0.322 ± 0.013
0.751TrpIle: 0.751 ± 0.021
0.549TrpLys: 0.549 ± 0.02
1.571TrpLeu: 1.571 ± 0.035
0.367TrpMet: 0.367 ± 0.017
0.486TrpAsn: 0.486 ± 0.016
0.648TrpPro: 0.648 ± 0.022
0.537TrpGln: 0.537 ± 0.017
1.092TrpArg: 1.092 ± 0.026
0.833TrpSer: 0.833 ± 0.023
0.806TrpThr: 0.806 ± 0.02
0.751TrpVal: 0.751 ± 0.025
0.219TrpTrp: 0.219 ± 0.013
0.26TrpTyr: 0.26 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.547TyrAla: 2.547 ± 0.043
0.227TyrCys: 0.227 ± 0.012
1.447TyrAsp: 1.447 ± 0.031
1.152TyrGlu: 1.152 ± 0.027
1.001TyrPhe: 1.001 ± 0.025
2.125TyrGly: 2.125 ± 0.042
0.44TyrHis: 0.44 ± 0.016
1.04TyrIle: 1.04 ± 0.032
0.773TyrLys: 0.773 ± 0.022
2.328TyrLeu: 2.328 ± 0.043
0.438TyrMet: 0.438 ± 0.017
0.693TyrAsn: 0.693 ± 0.021
1.105TyrPro: 1.105 ± 0.028
0.762TyrGln: 0.762 ± 0.024
1.757TyrArg: 1.757 ± 0.034
1.239TyrSer: 1.239 ± 0.028
1.101TyrThr: 1.101 ± 0.027
1.715TyrVal: 1.715 ± 0.034
0.361TyrTrp: 0.361 ± 0.016
0.656TyrTyr: 0.656 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6186 proteins (1551791 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski