Amino acid dipepetide frequency for Polymorphobacter fuscus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
25.457AlaAla: 25.457 ± 0.286
1.02AlaCys: 1.02 ± 0.033
9.027AlaAsp: 9.027 ± 0.097
7.262AlaGlu: 7.262 ± 0.105
4.371AlaPhe: 4.371 ± 0.078
13.763AlaGly: 13.763 ± 0.128
2.429AlaHis: 2.429 ± 0.057
7.14AlaIle: 7.14 ± 0.092
3.353AlaLys: 3.353 ± 0.073
15.303AlaLeu: 15.303 ± 0.159
4.269AlaMet: 4.269 ± 0.08
3.214AlaAsn: 3.214 ± 0.062
7.636AlaPro: 7.636 ± 0.113
4.327AlaGln: 4.327 ± 0.077
11.261AlaArg: 11.261 ± 0.148
6.491AlaSer: 6.491 ± 0.087
7.831AlaThr: 7.831 ± 0.08
10.492AlaVal: 10.492 ± 0.13
1.817AlaTrp: 1.817 ± 0.04
2.504AlaTyr: 2.504 ± 0.055
0.001AlaXaa: 0.001 ± 0.001
Cys
0.979CysAla: 0.979 ± 0.032
0.1CysCys: 0.1 ± 0.009
0.484CysAsp: 0.484 ± 0.02
0.305CysGlu: 0.305 ± 0.017
0.258CysPhe: 0.258 ± 0.019
0.797CysGly: 0.797 ± 0.031
0.185CysHis: 0.185 ± 0.014
0.323CysIle: 0.323 ± 0.019
0.119CysLys: 0.119 ± 0.011
0.641CysLeu: 0.641 ± 0.026
0.111CysMet: 0.111 ± 0.01
0.211CysAsn: 0.211 ± 0.014
0.437CysPro: 0.437 ± 0.022
0.189CysGln: 0.189 ± 0.014
0.576CysArg: 0.576 ± 0.029
0.355CysSer: 0.355 ± 0.017
0.397CysThr: 0.397 ± 0.023
0.503CysVal: 0.503 ± 0.021
0.106CysTrp: 0.106 ± 0.01
0.14CysTyr: 0.14 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
8.789AspAla: 8.789 ± 0.095
0.453AspCys: 0.453 ± 0.02
3.459AspAsp: 3.459 ± 0.061
2.522AspGlu: 2.522 ± 0.055
2.321AspPhe: 2.321 ± 0.052
5.774AspGly: 5.774 ± 0.093
1.275AspHis: 1.275 ± 0.039
2.965AspIle: 2.965 ± 0.063
1.477AspLys: 1.477 ± 0.035
5.618AspLeu: 5.618 ± 0.073
1.332AspMet: 1.332 ± 0.035
1.399AspAsn: 1.399 ± 0.043
3.797AspPro: 3.797 ± 0.059
1.548AspGln: 1.548 ± 0.038
5.056AspArg: 5.056 ± 0.076
2.466AspSer: 2.466 ± 0.053
2.951AspThr: 2.951 ± 0.053
4.291AspVal: 4.291 ± 0.073
1.177AspTrp: 1.177 ± 0.039
1.537AspTyr: 1.537 ± 0.047
0.0AspXaa: 0.0 ± 0.0
Glu
6.822GluAla: 6.822 ± 0.093
0.237GluCys: 0.237 ± 0.016
1.997GluAsp: 1.997 ± 0.044
1.594GluGlu: 1.594 ± 0.047
1.314GluPhe: 1.314 ± 0.038
3.439GluGly: 3.439 ± 0.065
0.852GluHis: 0.852 ± 0.031
2.559GluIle: 2.559 ± 0.057
1.342GluLys: 1.342 ± 0.039
4.038GluLeu: 4.038 ± 0.065
1.159GluMet: 1.159 ± 0.036
1.081GluAsn: 1.081 ± 0.032
2.23GluPro: 2.23 ± 0.042
1.378GluGln: 1.378 ± 0.034
3.812GluArg: 3.812 ± 0.062
1.921GluSer: 1.921 ± 0.043
3.029GluThr: 3.029 ± 0.051
3.084GluVal: 3.084 ± 0.058
0.655GluTrp: 0.655 ± 0.022
0.795GluTyr: 0.795 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
5.175PheAla: 5.175 ± 0.071
0.297PheCys: 0.297 ± 0.018
2.858PheAsp: 2.858 ± 0.045
1.724PheGlu: 1.724 ± 0.046
1.206PhePhe: 1.206 ± 0.044
3.788PheGly: 3.788 ± 0.07
0.697PheHis: 0.697 ± 0.028
1.453PheIle: 1.453 ± 0.041
0.824PheLys: 0.824 ± 0.029
2.832PheLeu: 2.832 ± 0.062
0.633PheMet: 0.633 ± 0.025
1.048PheAsn: 1.048 ± 0.039
1.496PhePro: 1.496 ± 0.042
0.905PheGln: 0.905 ± 0.031
2.125PheArg: 2.125 ± 0.044
1.789PheSer: 1.789 ± 0.045
2.086PheThr: 2.086 ± 0.044
2.652PheVal: 2.652 ± 0.053
0.506PheTrp: 0.506 ± 0.025
0.87PheTyr: 0.87 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
11.292GlyAla: 11.292 ± 0.132
0.79GlyCys: 0.79 ± 0.029
5.319GlyAsp: 5.319 ± 0.082
3.959GlyGlu: 3.959 ± 0.066
3.875GlyPhe: 3.875 ± 0.063
8.931GlyGly: 8.931 ± 0.294
1.98GlyHis: 1.98 ± 0.046
4.551GlyIle: 4.551 ± 0.078
2.891GlyLys: 2.891 ± 0.067
9.247GlyLeu: 9.247 ± 0.111
2.202GlyMet: 2.202 ± 0.047
2.461GlyAsn: 2.461 ± 0.076
4.088GlyPro: 4.088 ± 0.07
2.963GlyGln: 2.963 ± 0.054
6.72GlyArg: 6.72 ± 0.092
4.911GlySer: 4.911 ± 0.079
5.173GlyThr: 5.173 ± 0.097
7.03GlyVal: 7.03 ± 0.087
1.724GlyTrp: 1.724 ± 0.047
2.235GlyTyr: 2.235 ± 0.062
0.0GlyXaa: 0.0 ± 0.0
His
2.44HisAla: 2.44 ± 0.054
0.185HisCys: 0.185 ± 0.013
1.274HisAsp: 1.274 ± 0.039
0.754HisGlu: 0.754 ± 0.031
0.763HisPhe: 0.763 ± 0.033
2.055HisGly: 2.055 ± 0.047
0.487HisHis: 0.487 ± 0.024
0.893HisIle: 0.893 ± 0.031
0.374HisLys: 0.374 ± 0.021
1.75HisLeu: 1.75 ± 0.04
0.43HisMet: 0.43 ± 0.024
0.446HisAsn: 0.446 ± 0.023
1.231HisPro: 1.231 ± 0.038
0.501HisGln: 0.501 ± 0.022
1.46HisArg: 1.46 ± 0.039
0.873HisSer: 0.873 ± 0.034
0.667HisThr: 0.667 ± 0.025
1.529HisVal: 1.529 ± 0.041
0.342HisTrp: 0.342 ± 0.021
0.499HisTyr: 0.499 ± 0.021
0.0HisXaa: 0.0 ± 0.0
Ile
8.139IleAla: 8.139 ± 0.097
0.354IleCys: 0.354 ± 0.02
4.043IleAsp: 4.043 ± 0.065
2.819IleGlu: 2.819 ± 0.06
1.355IlePhe: 1.355 ± 0.041
5.364IleGly: 5.364 ± 0.088
0.795IleHis: 0.795 ± 0.032
2.302IleIle: 2.302 ± 0.056
1.127IleLys: 1.127 ± 0.036
3.595IleLeu: 3.595 ± 0.061
0.77IleMet: 0.77 ± 0.028
1.372IleAsn: 1.372 ± 0.034
2.054IlePro: 2.054 ± 0.042
0.912IleGln: 0.912 ± 0.036
2.773IleArg: 2.773 ± 0.056
2.22IleSer: 2.22 ± 0.048
2.7IleThr: 2.7 ± 0.054
4.168IleVal: 4.168 ± 0.063
0.584IleTrp: 0.584 ± 0.024
0.897IleTyr: 0.897 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
3.788LysAla: 3.788 ± 0.071
0.134LysCys: 0.134 ± 0.013
1.327LysAsp: 1.327 ± 0.044
0.793LysGlu: 0.793 ± 0.028
0.747LysPhe: 0.747 ± 0.026
2.193LysGly: 2.193 ± 0.059
0.422LysHis: 0.422 ± 0.02
1.122LysIle: 1.122 ± 0.037
0.766LysLys: 0.766 ± 0.031
2.678LysLeu: 2.678 ± 0.06
0.639LysMet: 0.639 ± 0.023
0.565LysAsn: 0.565 ± 0.026
1.739LysPro: 1.739 ± 0.049
0.728LysGln: 0.728 ± 0.029
1.851LysArg: 1.851 ± 0.044
1.303LysSer: 1.303 ± 0.042
1.581LysThr: 1.581 ± 0.041
1.937LysVal: 1.937 ± 0.049
0.326LysTrp: 0.326 ± 0.018
0.507LysTyr: 0.507 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
16.436LeuAla: 16.436 ± 0.171
0.702LeuCys: 0.702 ± 0.029
5.799LeuAsp: 5.799 ± 0.093
3.728LeuGlu: 3.728 ± 0.067
3.394LeuPhe: 3.394 ± 0.06
8.577LeuGly: 8.577 ± 0.101
1.802LeuHis: 1.802 ± 0.045
4.216LeuIle: 4.216 ± 0.075
2.805LeuLys: 2.805 ± 0.062
9.557LeuLeu: 9.557 ± 0.123
1.959LeuMet: 1.959 ± 0.047
2.129LeuAsn: 2.129 ± 0.045
6.052LeuPro: 6.052 ± 0.076
2.512LeuGln: 2.512 ± 0.047
6.785LeuArg: 6.785 ± 0.08
5.131LeuSer: 5.131 ± 0.08
5.859LeuThr: 5.859 ± 0.077
8.182LeuVal: 8.182 ± 0.11
1.302LeuTrp: 1.302 ± 0.045
1.828LeuTyr: 1.828 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
3.685MetAla: 3.685 ± 0.067
0.133MetCys: 0.133 ± 0.012
0.979MetAsp: 0.979 ± 0.032
0.798MetGlu: 0.798 ± 0.029
0.712MetPhe: 0.712 ± 0.026
1.785MetGly: 1.785 ± 0.047
0.408MetHis: 0.408 ± 0.017
1.23MetIle: 1.23 ± 0.035
0.728MetLys: 0.728 ± 0.025
2.451MetLeu: 2.451 ± 0.059
0.638MetMet: 0.638 ± 0.028
0.556MetAsn: 0.556 ± 0.024
1.458MetPro: 1.458 ± 0.033
0.67MetGln: 0.67 ± 0.027
1.688MetArg: 1.688 ± 0.039
1.213MetSer: 1.213 ± 0.036
1.951MetThr: 1.951 ± 0.042
1.73MetVal: 1.73 ± 0.046
0.198MetTrp: 0.198 ± 0.015
0.28MetTyr: 0.28 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.21AsnAla: 3.21 ± 0.066
0.235AsnCys: 0.235 ± 0.015
1.32AsnAsp: 1.32 ± 0.035
0.875AsnGlu: 0.875 ± 0.034
0.985AsnPhe: 0.985 ± 0.035
2.387AsnGly: 2.387 ± 0.082
0.442AsnHis: 0.442 ± 0.021
1.367AsnIle: 1.367 ± 0.041
0.575AsnLys: 0.575 ± 0.027
2.377AsnLeu: 2.377 ± 0.06
0.523AsnMet: 0.523 ± 0.022
0.762AsnAsn: 0.762 ± 0.042
1.78AsnPro: 1.78 ± 0.039
0.638AsnGln: 0.638 ± 0.026
1.778AsnArg: 1.778 ± 0.047
1.204AsnSer: 1.204 ± 0.042
1.142AsnThr: 1.142 ± 0.043
1.877AsnVal: 1.877 ± 0.05
0.41AsnTrp: 0.41 ± 0.022
0.675AsnTyr: 0.675 ± 0.033
0.0AsnXaa: 0.0 ± 0.0
Pro
8.836ProAla: 8.836 ± 0.122
0.336ProCys: 0.336 ± 0.018
4.069ProAsp: 4.069 ± 0.079
2.946ProGlu: 2.946 ± 0.055
1.929ProPhe: 1.929 ± 0.046
5.702ProGly: 5.702 ± 0.071
0.997ProHis: 0.997 ± 0.031
2.226ProIle: 2.226 ± 0.04
1.301ProLys: 1.301 ± 0.039
5.176ProLeu: 5.176 ± 0.083
1.275ProMet: 1.275 ± 0.037
1.206ProAsn: 1.206 ± 0.038
3.236ProPro: 3.236 ± 0.086
1.646ProGln: 1.646 ± 0.037
3.531ProArg: 3.531 ± 0.062
2.418ProSer: 2.418 ± 0.056
2.743ProThr: 2.743 ± 0.06
4.923ProVal: 4.923 ± 0.073
0.767ProTrp: 0.767 ± 0.031
1.012ProTyr: 1.012 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
4.033GlnAla: 4.033 ± 0.061
0.204GlnCys: 0.204 ± 0.014
1.216GlnAsp: 1.216 ± 0.035
0.898GlnGlu: 0.898 ± 0.034
1.002GlnPhe: 1.002 ± 0.027
2.411GlnGly: 2.411 ± 0.044
0.481GlnHis: 0.481 ± 0.025
1.393GlnIle: 1.393 ± 0.04
0.727GlnLys: 0.727 ± 0.029
3.08GlnLeu: 3.08 ± 0.057
0.714GlnMet: 0.714 ± 0.02
0.657GlnAsn: 0.657 ± 0.026
1.86GlnPro: 1.86 ± 0.047
0.987GlnGln: 0.987 ± 0.036
2.421GlnArg: 2.421 ± 0.052
1.542GlnSer: 1.542 ± 0.044
1.541GlnThr: 1.541 ± 0.042
2.258GlnVal: 2.258 ± 0.04
0.413GlnTrp: 0.413 ± 0.018
0.58GlnTyr: 0.58 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
9.48ArgAla: 9.48 ± 0.107
0.504ArgCys: 0.504 ± 0.022
4.437ArgAsp: 4.437 ± 0.072
2.949ArgGlu: 2.949 ± 0.055
3.141ArgPhe: 3.141 ± 0.056
5.304ArgGly: 5.304 ± 0.077
1.763ArgHis: 1.763 ± 0.047
4.04ArgIle: 4.04 ± 0.074
1.591ArgLys: 1.591 ± 0.049
8.6ArgLeu: 8.6 ± 0.115
1.884ArgMet: 1.884 ± 0.048
1.837ArgAsn: 1.837 ± 0.042
4.083ArgPro: 4.083 ± 0.066
2.405ArgGln: 2.405 ± 0.053
5.992ArgArg: 5.992 ± 0.091
3.626ArgSer: 3.626 ± 0.059
3.673ArgThr: 3.673 ± 0.066
5.129ArgVal: 5.129 ± 0.071
1.21ArgTrp: 1.21 ± 0.037
1.765ArgTyr: 1.765 ± 0.043
0.0ArgXaa: 0.0 ± 0.0
Ser
6.287SerAla: 6.287 ± 0.082
0.351SerCys: 0.351 ± 0.02
2.906SerAsp: 2.906 ± 0.053
2.031SerGlu: 2.031 ± 0.051
1.999SerPhe: 1.999 ± 0.049
5.2SerGly: 5.2 ± 0.088
0.913SerHis: 0.913 ± 0.029
2.37SerIle: 2.37 ± 0.053
1.176SerLys: 1.176 ± 0.041
4.753SerLeu: 4.753 ± 0.076
1.041SerMet: 1.041 ± 0.035
1.352SerAsn: 1.352 ± 0.049
2.727SerPro: 2.727 ± 0.054
1.414SerGln: 1.414 ± 0.032
3.288SerArg: 3.288 ± 0.057
2.241SerSer: 2.241 ± 0.065
2.417SerThr: 2.417 ± 0.062
3.787SerVal: 3.787 ± 0.056
0.712SerTrp: 0.712 ± 0.028
1.193SerTyr: 1.193 ± 0.038
0.0SerXaa: 0.0 ± 0.0
Thr
8.006ThrAla: 8.006 ± 0.088
0.357ThrCys: 0.357 ± 0.02
3.095ThrAsp: 3.095 ± 0.055
2.206ThrGlu: 2.206 ± 0.044
1.701ThrPhe: 1.701 ± 0.044
5.889ThrGly: 5.889 ± 0.091
0.907ThrHis: 0.907 ± 0.027
2.932ThrIle: 2.932 ± 0.062
1.138ThrLys: 1.138 ± 0.035
6.034ThrLeu: 6.034 ± 0.084
1.276ThrMet: 1.276 ± 0.034
1.334ThrAsn: 1.334 ± 0.042
4.03ThrPro: 4.03 ± 0.064
1.357ThrGln: 1.357 ± 0.037
3.842ThrArg: 3.842 ± 0.059
2.686ThrSer: 2.686 ± 0.069
3.254ThrThr: 3.254 ± 0.065
4.448ThrVal: 4.448 ± 0.068
0.628ThrTrp: 0.628 ± 0.028
1.068ThrTyr: 1.068 ± 0.039
0.001ThrXaa: 0.001 ± 0.001
Val
11.867ValAla: 11.867 ± 0.124
0.524ValCys: 0.524 ± 0.024
4.424ValAsp: 4.424 ± 0.065
3.778ValGlu: 3.778 ± 0.065
2.534ValPhe: 2.534 ± 0.052
5.978ValGly: 5.978 ± 0.093
1.362ValHis: 1.362 ± 0.039
3.923ValIle: 3.923 ± 0.056
1.906ValLys: 1.906 ± 0.045
7.135ValLeu: 7.135 ± 0.092
1.71ValMet: 1.71 ± 0.045
1.916ValAsn: 1.916 ± 0.057
4.543ValPro: 4.543 ± 0.079
1.958ValGln: 1.958 ± 0.047
5.455ValArg: 5.455 ± 0.088
3.902ValSer: 3.902 ± 0.071
5.312ValThr: 5.312 ± 0.075
6.045ValVal: 6.045 ± 0.105
0.909ValTrp: 0.909 ± 0.031
1.337ValTyr: 1.337 ± 0.036
0.001ValXaa: 0.001 ± 0.001
Trp
1.576TrpAla: 1.576 ± 0.048
0.13TrpCys: 0.13 ± 0.013
0.781TrpAsp: 0.781 ± 0.028
0.501TrpGlu: 0.501 ± 0.023
0.528TrpPhe: 0.528 ± 0.025
0.995TrpGly: 0.995 ± 0.036
0.334TrpHis: 0.334 ± 0.02
0.584TrpIle: 0.584 ± 0.022
0.38TrpLys: 0.38 ± 0.019
1.849TrpLeu: 1.849 ± 0.048
0.346TrpMet: 0.346 ± 0.02
0.408TrpAsn: 0.408 ± 0.019
0.804TrpPro: 0.804 ± 0.027
0.719TrpGln: 0.719 ± 0.027
1.255TrpArg: 1.255 ± 0.038
0.868TrpSer: 0.868 ± 0.032
0.873TrpThr: 0.873 ± 0.031
0.913TrpVal: 0.913 ± 0.03
0.268TrpTrp: 0.268 ± 0.019
0.283TrpTyr: 0.283 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.668TyrAla: 2.668 ± 0.058
0.17TyrCys: 0.17 ± 0.013
1.393TyrAsp: 1.393 ± 0.043
0.898TyrGlu: 0.898 ± 0.031
0.828TyrPhe: 0.828 ± 0.031
2.004TyrGly: 2.004 ± 0.053
0.424TyrHis: 0.424 ± 0.022
0.756TyrIle: 0.756 ± 0.029
0.547TyrLys: 0.547 ± 0.027
2.04TyrLeu: 2.04 ± 0.048
0.38TyrMet: 0.38 ± 0.02
0.609TyrAsn: 0.609 ± 0.027
0.995TyrPro: 0.995 ± 0.033
0.661TyrGln: 0.661 ± 0.022
1.746TyrArg: 1.746 ± 0.052
1.001TyrSer: 1.001 ± 0.034
1.009TyrThr: 1.009 ± 0.035
1.539TyrVal: 1.539 ± 0.042
0.34TyrTrp: 0.34 ± 0.018
0.53TyrTyr: 0.53 ± 0.024
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.001XaaAla: 0.001 ± 0.001
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.001
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.001
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3101 proteins (1023288 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski