Amino acid dipepetide frequency for Gallaecimonas xiamenensis 3-C-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.926AlaAla: 11.926 ± 0.147
1.06AlaCys: 1.06 ± 0.038
6.063AlaAsp: 6.063 ± 0.092
6.356AlaGlu: 6.356 ± 0.094
3.995AlaPhe: 3.995 ± 0.065
8.721AlaGly: 8.721 ± 0.095
1.867AlaHis: 1.867 ± 0.038
5.261AlaIle: 5.261 ± 0.072
5.271AlaLys: 5.271 ± 0.086
15.545AlaLeu: 15.545 ± 0.172
3.299AlaMet: 3.299 ± 0.051
3.21AlaAsn: 3.21 ± 0.063
4.677AlaPro: 4.677 ± 0.073
5.706AlaGln: 5.706 ± 0.084
5.893AlaArg: 5.893 ± 0.081
6.124AlaSer: 6.124 ± 0.083
4.726AlaThr: 4.726 ± 0.061
7.104AlaVal: 7.104 ± 0.081
1.577AlaTrp: 1.577 ± 0.039
2.496AlaTyr: 2.496 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.834CysAla: 0.834 ± 0.032
0.134CysCys: 0.134 ± 0.011
0.523CysAsp: 0.523 ± 0.033
0.412CysGlu: 0.412 ± 0.017
0.374CysPhe: 0.374 ± 0.017
0.831CysGly: 0.831 ± 0.027
0.294CysHis: 0.294 ± 0.015
0.395CysIle: 0.395 ± 0.017
0.249CysLys: 0.249 ± 0.016
1.122CysLeu: 1.122 ± 0.028
0.169CysMet: 0.169 ± 0.011
0.267CysAsn: 0.267 ± 0.016
0.536CysPro: 0.536 ± 0.033
0.642CysGln: 0.642 ± 0.034
0.618CysArg: 0.618 ± 0.026
0.539CysSer: 0.539 ± 0.021
0.444CysThr: 0.444 ± 0.018
0.491CysVal: 0.491 ± 0.02
0.149CysTrp: 0.149 ± 0.011
0.319CysTyr: 0.319 ± 0.016
0.0CysXaa: 0.0 ± 0.0
Asp
5.956AspAla: 5.956 ± 0.078
0.503AspCys: 0.503 ± 0.018
3.451AspAsp: 3.451 ± 0.074
3.292AspGlu: 3.292 ± 0.054
2.549AspPhe: 2.549 ± 0.05
5.122AspGly: 5.122 ± 0.128
1.148AspHis: 1.148 ± 0.034
2.941AspIle: 2.941 ± 0.052
3.065AspLys: 3.065 ± 0.051
6.126AspLeu: 6.126 ± 0.073
1.37AspMet: 1.37 ± 0.032
2.249AspAsn: 2.249 ± 0.065
2.872AspPro: 2.872 ± 0.045
2.787AspGln: 2.787 ± 0.06
2.966AspArg: 2.966 ± 0.048
3.452AspSer: 3.452 ± 0.06
2.597AspThr: 2.597 ± 0.057
3.24AspVal: 3.24 ± 0.057
1.206AspTrp: 1.206 ± 0.034
1.897AspTyr: 1.897 ± 0.042
0.0AspXaa: 0.0 ± 0.0
Glu
6.468GluAla: 6.468 ± 0.091
0.323GluCys: 0.323 ± 0.017
2.787GluAsp: 2.787 ± 0.055
3.243GluGlu: 3.243 ± 0.074
1.503GluPhe: 1.503 ± 0.035
4.018GluGly: 4.018 ± 0.062
1.232GluHis: 1.232 ± 0.034
2.169GluIle: 2.169 ± 0.046
2.391GluLys: 2.391 ± 0.053
6.821GluLeu: 6.821 ± 0.095
1.192GluMet: 1.192 ± 0.037
1.436GluAsn: 1.436 ± 0.03
2.304GluPro: 2.304 ± 0.043
3.406GluGln: 3.406 ± 0.065
3.756GluArg: 3.756 ± 0.074
2.502GluSer: 2.502 ± 0.045
2.088GluThr: 2.088 ± 0.043
4.127GluVal: 4.127 ± 0.066
0.505GluTrp: 0.505 ± 0.021
1.05GluTyr: 1.05 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
3.997PheAla: 3.997 ± 0.06
0.447PheCys: 0.447 ± 0.018
2.764PheAsp: 2.764 ± 0.06
2.203PheGlu: 2.203 ± 0.044
1.566PhePhe: 1.566 ± 0.044
3.305PheGly: 3.305 ± 0.06
0.72PheHis: 0.72 ± 0.027
1.712PheIle: 1.712 ± 0.044
1.587PheLys: 1.587 ± 0.035
3.333PheLeu: 3.333 ± 0.058
0.859PheMet: 0.859 ± 0.028
1.478PheAsn: 1.478 ± 0.034
1.378PhePro: 1.378 ± 0.034
1.255PheGln: 1.255 ± 0.034
1.675PheArg: 1.675 ± 0.038
2.565PheSer: 2.565 ± 0.045
1.93PheThr: 1.93 ± 0.044
2.424PheVal: 2.424 ± 0.052
0.642PheTrp: 0.642 ± 0.025
1.193PheTyr: 1.193 ± 0.035
0.0PheXaa: 0.0 ± 0.0
Gly
7.468GlyAla: 7.468 ± 0.097
0.869GlyCys: 0.869 ± 0.026
4.71GlyAsp: 4.71 ± 0.098
4.57GlyGlu: 4.57 ± 0.064
3.458GlyPhe: 3.458 ± 0.056
5.972GlyGly: 5.972 ± 0.084
1.955GlyHis: 1.955 ± 0.049
4.007GlyIle: 4.007 ± 0.07
3.738GlyLys: 3.738 ± 0.06
9.775GlyLeu: 9.775 ± 0.102
1.948GlyMet: 1.948 ± 0.039
2.513GlyAsn: 2.513 ± 0.057
2.937GlyPro: 2.937 ± 0.055
4.972GlyGln: 4.972 ± 0.075
4.65GlyArg: 4.65 ± 0.066
4.601GlySer: 4.601 ± 0.076
3.789GlyThr: 3.789 ± 0.06
5.41GlyVal: 5.41 ± 0.075
1.338GlyTrp: 1.338 ± 0.037
2.538GlyTyr: 2.538 ± 0.056
0.0GlyXaa: 0.0 ± 0.0
His
1.485HisAla: 1.485 ± 0.033
0.355HisCys: 0.355 ± 0.017
0.968HisAsp: 0.968 ± 0.029
0.839HisGlu: 0.839 ± 0.029
1.132HisPhe: 1.132 ± 0.033
1.744HisGly: 1.744 ± 0.04
0.707HisHis: 0.707 ± 0.025
0.982HisIle: 0.982 ± 0.028
0.83HisLys: 0.83 ± 0.026
2.678HisLeu: 2.678 ± 0.049
0.473HisMet: 0.473 ± 0.02
0.682HisAsn: 0.682 ± 0.022
1.365HisPro: 1.365 ± 0.035
1.467HisGln: 1.467 ± 0.034
1.281HisArg: 1.281 ± 0.039
1.296HisSer: 1.296 ± 0.032
0.762HisThr: 0.762 ± 0.027
0.955HisVal: 0.955 ± 0.031
0.557HisTrp: 0.557 ± 0.025
0.932HisTyr: 0.932 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
4.815IleAla: 4.815 ± 0.067
0.455IleCys: 0.455 ± 0.019
3.013IleAsp: 3.013 ± 0.052
2.7IleGlu: 2.7 ± 0.054
1.409IlePhe: 1.409 ± 0.037
3.555IleGly: 3.555 ± 0.07
0.888IleHis: 0.888 ± 0.026
1.928IleIle: 1.928 ± 0.048
2.164IleLys: 2.164 ± 0.051
3.793IleLeu: 3.793 ± 0.057
0.777IleMet: 0.777 ± 0.026
1.797IleAsn: 1.797 ± 0.044
1.981IlePro: 1.981 ± 0.05
1.607IleGln: 1.607 ± 0.037
2.438IleArg: 2.438 ± 0.052
2.78IleSer: 2.78 ± 0.052
2.339IleThr: 2.339 ± 0.046
2.394IleVal: 2.394 ± 0.051
0.515IleTrp: 0.515 ± 0.021
1.173IleTyr: 1.173 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
6.228LysAla: 6.228 ± 0.086
0.207LysCys: 0.207 ± 0.015
2.838LysAsp: 2.838 ± 0.058
2.397LysGlu: 2.397 ± 0.05
0.885LysPhe: 0.885 ± 0.03
3.886LysGly: 3.886 ± 0.068
0.8LysHis: 0.8 ± 0.025
1.458LysIle: 1.458 ± 0.039
1.732LysLys: 1.732 ± 0.047
4.413LysLeu: 4.413 ± 0.071
0.816LysMet: 0.816 ± 0.028
1.031LysAsn: 1.031 ± 0.033
2.127LysPro: 2.127 ± 0.043
1.469LysGln: 1.469 ± 0.04
2.369LysArg: 2.369 ± 0.044
1.923LysSer: 1.923 ± 0.045
2.009LysThr: 2.009 ± 0.04
3.753LysVal: 3.753 ± 0.057
0.419LysTrp: 0.419 ± 0.016
0.779LysTyr: 0.779 ± 0.03
0.0LysXaa: 0.0 ± 0.0
Leu
17.358LeuAla: 17.358 ± 0.183
1.271LeuCys: 1.271 ± 0.035
7.574LeuAsp: 7.574 ± 0.08
6.042LeuGlu: 6.042 ± 0.087
4.198LeuPhe: 4.198 ± 0.061
10.411LeuGly: 10.411 ± 0.109
2.23LeuHis: 2.23 ± 0.049
4.428LeuIle: 4.428 ± 0.07
5.234LeuLys: 5.234 ± 0.071
14.22LeuLeu: 14.22 ± 0.181
2.921LeuMet: 2.921 ± 0.056
3.408LeuAsn: 3.408 ± 0.052
6.359LeuPro: 6.359 ± 0.092
3.844LeuGln: 3.844 ± 0.063
5.743LeuArg: 5.743 ± 0.087
7.597LeuSer: 7.597 ± 0.095
6.147LeuThr: 6.147 ± 0.073
9.518LeuVal: 9.518 ± 0.112
1.788LeuTrp: 1.788 ± 0.046
2.929LeuTyr: 2.929 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
3.161MetAla: 3.161 ± 0.048
0.098MetCys: 0.098 ± 0.008
1.361MetAsp: 1.361 ± 0.035
1.191MetGlu: 1.191 ± 0.03
0.559MetPhe: 0.559 ± 0.019
1.792MetGly: 1.792 ± 0.036
0.412MetHis: 0.412 ± 0.021
0.8MetIle: 0.8 ± 0.029
1.106MetLys: 1.106 ± 0.033
2.671MetLeu: 2.671 ± 0.046
0.613MetMet: 0.613 ± 0.025
0.711MetAsn: 0.711 ± 0.025
1.144MetPro: 1.144 ± 0.028
0.956MetGln: 0.956 ± 0.027
1.16MetArg: 1.16 ± 0.028
1.46MetSer: 1.46 ± 0.032
1.465MetThr: 1.465 ± 0.034
1.782MetVal: 1.782 ± 0.037
0.179MetTrp: 0.179 ± 0.013
0.355MetTyr: 0.355 ± 0.018
0.0MetXaa: 0.0 ± 0.0
Asn
2.934AsnAla: 2.934 ± 0.054
0.336AsnCys: 0.336 ± 0.038
1.74AsnAsp: 1.74 ± 0.043
1.305AsnGlu: 1.305 ± 0.037
1.04AsnPhe: 1.04 ± 0.037
2.639AsnGly: 2.639 ± 0.056
0.681AsnHis: 0.681 ± 0.024
1.434AsnIle: 1.434 ± 0.036
1.184AsnLys: 1.184 ± 0.03
3.559AsnLeu: 3.559 ± 0.06
0.602AsnMet: 0.602 ± 0.024
1.059AsnAsn: 1.059 ± 0.034
1.942AsnPro: 1.942 ± 0.041
1.529AsnGln: 1.529 ± 0.037
1.761AsnArg: 1.761 ± 0.042
1.667AsnSer: 1.667 ± 0.04
1.457AsnThr: 1.457 ± 0.036
1.67AsnVal: 1.67 ± 0.037
0.558AsnTrp: 0.558 ± 0.025
0.928AsnTyr: 0.928 ± 0.032
0.0AsnXaa: 0.0 ± 0.0
Pro
5.392ProAla: 5.392 ± 0.079
0.373ProCys: 0.373 ± 0.02
2.942ProAsp: 2.942 ± 0.055
2.852ProGlu: 2.852 ± 0.043
1.871ProPhe: 1.871 ± 0.038
4.005ProGly: 4.005 ± 0.066
0.953ProHis: 0.953 ± 0.031
1.763ProIle: 1.763 ± 0.043
2.08ProLys: 2.08 ± 0.035
5.915ProLeu: 5.915 ± 0.076
1.133ProMet: 1.133 ± 0.031
1.302ProAsn: 1.302 ± 0.031
1.706ProPro: 1.706 ± 0.038
2.459ProGln: 2.459 ± 0.048
2.092ProArg: 2.092 ± 0.048
2.584ProSer: 2.584 ± 0.048
1.755ProThr: 1.755 ± 0.05
3.627ProVal: 3.627 ± 0.058
0.875ProTrp: 0.875 ± 0.027
1.373ProTyr: 1.373 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
6.625GlnAla: 6.625 ± 0.092
0.451GlnCys: 0.451 ± 0.021
2.576GlnAsp: 2.576 ± 0.051
2.331GlnGlu: 2.331 ± 0.048
1.564GlnPhe: 1.564 ± 0.039
5.026GlnGly: 5.026 ± 0.07
1.147GlnHis: 1.147 ± 0.035
1.716GlnIle: 1.716 ± 0.036
1.593GlnLys: 1.593 ± 0.036
6.34GlnLeu: 6.34 ± 0.079
1.106GlnMet: 1.106 ± 0.032
1.09GlnAsn: 1.09 ± 0.03
2.432GlnPro: 2.432 ± 0.046
3.439GlnGln: 3.439 ± 0.083
2.872GlnArg: 2.872 ± 0.055
2.861GlnSer: 2.861 ± 0.055
1.9GlnThr: 1.9 ± 0.044
3.854GlnVal: 3.854 ± 0.053
0.975GlnTrp: 0.975 ± 0.032
1.468GlnTyr: 1.468 ± 0.037
0.0GlnXaa: 0.0 ± 0.0
Arg
4.722ArgAla: 4.722 ± 0.068
0.511ArgCys: 0.511 ± 0.02
3.071ArgAsp: 3.071 ± 0.055
2.862ArgGlu: 2.862 ± 0.059
2.914ArgPhe: 2.914 ± 0.062
3.263ArgGly: 3.263 ± 0.055
1.756ArgHis: 1.756 ± 0.04
2.758ArgIle: 2.758 ± 0.053
1.81ArgLys: 1.81 ± 0.046
7.745ArgLeu: 7.745 ± 0.108
1.187ArgMet: 1.187 ± 0.027
1.535ArgAsn: 1.535 ± 0.033
2.407ArgPro: 2.407 ± 0.04
3.867ArgGln: 3.867 ± 0.068
3.702ArgArg: 3.702 ± 0.06
2.682ArgSer: 2.682 ± 0.042
2.207ArgThr: 2.207 ± 0.042
3.43ArgVal: 3.43 ± 0.058
1.007ArgTrp: 1.007 ± 0.028
2.181ArgTyr: 2.181 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
5.728SerAla: 5.728 ± 0.071
0.536SerCys: 0.536 ± 0.023
3.315SerAsp: 3.315 ± 0.06
2.807SerGlu: 2.807 ± 0.049
2.329SerPhe: 2.329 ± 0.046
4.895SerGly: 4.895 ± 0.075
1.433SerHis: 1.433 ± 0.036
2.291SerIle: 2.291 ± 0.051
1.894SerLys: 1.894 ± 0.04
7.595SerLeu: 7.595 ± 0.099
1.102SerMet: 1.102 ± 0.031
1.654SerAsn: 1.654 ± 0.048
2.988SerPro: 2.988 ± 0.053
3.122SerGln: 3.122 ± 0.056
3.438SerArg: 3.438 ± 0.053
3.167SerSer: 3.167 ± 0.069
2.49SerThr: 2.49 ± 0.046
3.441SerVal: 3.441 ± 0.057
0.931SerTrp: 0.931 ± 0.028
1.712SerTyr: 1.712 ± 0.044
0.0SerXaa: 0.0 ± 0.0
Thr
4.517ThrAla: 4.517 ± 0.068
0.314ThrCys: 0.314 ± 0.017
2.796ThrAsp: 2.796 ± 0.059
2.406ThrGlu: 2.406 ± 0.048
1.698ThrPhe: 1.698 ± 0.043
4.136ThrGly: 4.136 ± 0.062
0.841ThrHis: 0.841 ± 0.028
1.871ThrIle: 1.871 ± 0.049
1.448ThrLys: 1.448 ± 0.04
7.034ThrLeu: 7.034 ± 0.089
0.824ThrMet: 0.824 ± 0.029
1.212ThrAsn: 1.212 ± 0.032
2.553ThrPro: 2.553 ± 0.046
2.371ThrGln: 2.371 ± 0.047
2.326ThrArg: 2.326 ± 0.048
2.359ThrSer: 2.359 ± 0.048
2.077ThrThr: 2.077 ± 0.048
3.278ThrVal: 3.278 ± 0.059
0.601ThrTrp: 0.601 ± 0.021
1.075ThrTyr: 1.075 ± 0.032
0.0ThrXaa: 0.0 ± 0.0
Val
7.908ValAla: 7.908 ± 0.082
0.598ValCys: 0.598 ± 0.022
4.059ValAsp: 4.059 ± 0.062
3.889ValGlu: 3.889 ± 0.055
2.292ValPhe: 2.292 ± 0.046
4.606ValGly: 4.606 ± 0.064
1.261ValHis: 1.261 ± 0.036
3.19ValIle: 3.19 ± 0.057
2.983ValLys: 2.983 ± 0.055
8.235ValLeu: 8.235 ± 0.093
1.795ValMet: 1.795 ± 0.041
2.255ValAsn: 2.255 ± 0.046
3.154ValPro: 3.154 ± 0.056
2.758ValGln: 2.758 ± 0.048
3.72ValArg: 3.72 ± 0.059
4.333ValSer: 4.333 ± 0.056
3.92ValThr: 3.92 ± 0.065
5.365ValVal: 5.365 ± 0.082
0.721ValTrp: 0.721 ± 0.028
1.464ValTyr: 1.464 ± 0.043
0.0ValXaa: 0.0 ± 0.0
Trp
1.191TrpAla: 1.191 ± 0.032
0.176TrpCys: 0.176 ± 0.014
0.74TrpAsp: 0.74 ± 0.027
0.519TrpGlu: 0.519 ± 0.022
0.554TrpPhe: 0.554 ± 0.019
0.963TrpGly: 0.963 ± 0.029
0.48TrpHis: 0.48 ± 0.022
0.456TrpIle: 0.456 ± 0.02
0.311TrpLys: 0.311 ± 0.019
2.727TrpLeu: 2.727 ± 0.064
0.313TrpMet: 0.313 ± 0.017
0.347TrpAsn: 0.347 ± 0.016
0.789TrpPro: 0.789 ± 0.028
1.605TrpGln: 1.605 ± 0.043
1.071TrpArg: 1.071 ± 0.034
0.712TrpSer: 0.712 ± 0.025
0.618TrpThr: 0.618 ± 0.021
1.085TrpVal: 1.085 ± 0.028
0.27TrpTrp: 0.27 ± 0.016
0.4TrpTyr: 0.4 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.167TyrAla: 2.167 ± 0.047
0.327TyrCys: 0.327 ± 0.016
1.498TyrAsp: 1.498 ± 0.046
1.103TyrGlu: 1.103 ± 0.033
1.142TyrPhe: 1.142 ± 0.032
2.326TyrGly: 2.326 ± 0.049
0.682TyrHis: 0.682 ± 0.027
0.96TyrIle: 0.96 ± 0.034
0.868TyrLys: 0.868 ± 0.035
3.573TyrLeu: 3.573 ± 0.061
0.481TyrMet: 0.481 ± 0.018
0.786TyrAsn: 0.786 ± 0.027
1.403TyrPro: 1.403 ± 0.029
2.105TyrGln: 2.105 ± 0.046
2.087TyrArg: 2.087 ± 0.051
1.665TyrSer: 1.665 ± 0.043
1.017TyrThr: 1.017 ± 0.032
1.558TyrVal: 1.558 ± 0.036
0.514TyrTrp: 0.514 ± 0.021
0.91TyrTyr: 0.91 ± 0.031
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3798 proteins (1224611 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski