Amino acid dipepetide frequency for Lacipirellula parvula

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.835AlaAla: 17.835 ± 0.151
1.267AlaCys: 1.267 ± 0.028
6.769AlaAsp: 6.769 ± 0.069
7.612AlaGlu: 7.612 ± 0.085
3.78AlaPhe: 3.78 ± 0.048
9.802AlaGly: 9.802 ± 0.093
1.888AlaHis: 1.888 ± 0.031
6.059AlaIle: 6.059 ± 0.064
4.469AlaLys: 4.469 ± 0.066
10.015AlaLeu: 10.015 ± 0.095
2.79AlaMet: 2.79 ± 0.043
3.861AlaAsn: 3.861 ± 0.055
5.73AlaPro: 5.73 ± 0.074
3.634AlaGln: 3.634 ± 0.044
6.163AlaArg: 6.163 ± 0.068
7.227AlaSer: 7.227 ± 0.066
7.077AlaThr: 7.077 ± 0.067
8.318AlaVal: 8.318 ± 0.071
1.731AlaTrp: 1.731 ± 0.036
2.375AlaTyr: 2.375 ± 0.037
0.0AlaXaa: 0.0 ± 0.0
Cys
0.87CysAla: 0.87 ± 0.021
0.265CysCys: 0.265 ± 0.014
0.693CysAsp: 0.693 ± 0.022
0.668CysGlu: 0.668 ± 0.024
0.373CysPhe: 0.373 ± 0.015
1.096CysGly: 1.096 ± 0.03
0.339CysHis: 0.339 ± 0.017
0.462CysIle: 0.462 ± 0.016
0.301CysLys: 0.301 ± 0.018
0.956CysLeu: 0.956 ± 0.022
0.189CysMet: 0.189 ± 0.009
0.302CysAsn: 0.302 ± 0.014
0.553CysPro: 0.553 ± 0.021
0.342CysGln: 0.342 ± 0.013
0.849CysArg: 0.849 ± 0.023
0.614CysSer: 0.614 ± 0.022
0.488CysThr: 0.488 ± 0.015
0.726CysVal: 0.726 ± 0.019
0.174CysTrp: 0.174 ± 0.01
0.287CysTyr: 0.287 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
6.942AspAla: 6.942 ± 0.067
0.52AspCys: 0.52 ± 0.02
3.635AspAsp: 3.635 ± 0.054
3.88AspGlu: 3.88 ± 0.056
2.444AspPhe: 2.444 ± 0.033
5.819AspGly: 5.819 ± 0.086
1.179AspHis: 1.179 ± 0.027
1.919AspIle: 1.919 ± 0.033
1.591AspLys: 1.591 ± 0.033
5.331AspLeu: 5.331 ± 0.05
0.979AspMet: 0.979 ± 0.019
1.57AspAsn: 1.57 ± 0.033
3.137AspPro: 3.137 ± 0.044
1.979AspGln: 1.979 ± 0.031
3.784AspArg: 3.784 ± 0.044
2.937AspSer: 2.937 ± 0.041
1.809AspThr: 1.809 ± 0.035
4.299AspVal: 4.299 ± 0.055
1.054AspTrp: 1.054 ± 0.024
1.596AspTyr: 1.596 ± 0.032
0.0AspXaa: 0.0 ± 0.0
Glu
6.551GluAla: 6.551 ± 0.069
0.52GluCys: 0.52 ± 0.018
2.287GluAsp: 2.287 ± 0.04
3.441GluGlu: 3.441 ± 0.067
2.327GluPhe: 2.327 ± 0.035
3.826GluGly: 3.826 ± 0.051
1.249GluHis: 1.249 ± 0.028
3.167GluIle: 3.167 ± 0.043
2.256GluLys: 2.256 ± 0.049
6.998GluLeu: 6.998 ± 0.07
1.472GluMet: 1.472 ± 0.027
1.574GluAsn: 1.574 ± 0.027
3.121GluPro: 3.121 ± 0.045
3.043GluGln: 3.043 ± 0.048
4.516GluArg: 4.516 ± 0.057
3.432GluSer: 3.432 ± 0.046
3.137GluThr: 3.137 ± 0.048
4.006GluVal: 4.006 ± 0.052
0.913GluTrp: 0.913 ± 0.021
1.444GluTyr: 1.444 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
4.288PheAla: 4.288 ± 0.048
0.457PheCys: 0.457 ± 0.017
2.643PheAsp: 2.643 ± 0.044
2.127PheGlu: 2.127 ± 0.035
1.32PhePhe: 1.32 ± 0.03
3.352PheGly: 3.352 ± 0.043
0.785PheHis: 0.785 ± 0.019
1.491PheIle: 1.491 ± 0.032
1.062PheLys: 1.062 ± 0.026
3.101PheLeu: 3.101 ± 0.044
0.687PheMet: 0.687 ± 0.02
1.448PheAsn: 1.448 ± 0.03
1.575PhePro: 1.575 ± 0.028
1.135PheGln: 1.135 ± 0.026
2.18PheArg: 2.18 ± 0.032
2.279PheSer: 2.279 ± 0.037
2.215PheThr: 2.215 ± 0.038
2.776PheVal: 2.776 ± 0.041
0.543PheTrp: 0.543 ± 0.02
0.967PheTyr: 0.967 ± 0.025
0.0PheXaa: 0.0 ± 0.0
Gly
8.633GlyAla: 8.633 ± 0.099
0.963GlyCys: 0.963 ± 0.025
5.098GlyAsp: 5.098 ± 0.067
4.884GlyGlu: 4.884 ± 0.059
3.048GlyPhe: 3.048 ± 0.043
7.741GlyGly: 7.741 ± 0.135
1.551GlyHis: 1.551 ± 0.029
3.717GlyIle: 3.717 ± 0.051
3.372GlyLys: 3.372 ± 0.051
6.974GlyLeu: 6.974 ± 0.061
2.056GlyMet: 2.056 ± 0.036
2.872GlyAsn: 2.872 ± 0.067
3.307GlyPro: 3.307 ± 0.052
2.975GlyGln: 2.975 ± 0.047
5.134GlyArg: 5.134 ± 0.051
5.153GlySer: 5.153 ± 0.071
4.726GlyThr: 4.726 ± 0.09
6.678GlyVal: 6.678 ± 0.065
1.433GlyTrp: 1.433 ± 0.026
2.262GlyTyr: 2.262 ± 0.036
0.0GlyXaa: 0.0 ± 0.0
His
2.23HisAla: 2.23 ± 0.04
0.274HisCys: 0.274 ± 0.012
1.225HisAsp: 1.225 ± 0.025
1.243HisGlu: 1.243 ± 0.026
0.821HisPhe: 0.821 ± 0.022
1.806HisGly: 1.806 ± 0.035
0.573HisHis: 0.573 ± 0.019
0.805HisIle: 0.805 ± 0.021
0.482HisLys: 0.482 ± 0.016
1.966HisLeu: 1.966 ± 0.034
0.366HisMet: 0.366 ± 0.014
0.577HisAsn: 0.577 ± 0.017
1.251HisPro: 1.251 ± 0.028
0.733HisGln: 0.733 ± 0.021
1.535HisArg: 1.535 ± 0.028
1.172HisSer: 1.172 ± 0.027
0.848HisThr: 0.848 ± 0.021
1.435HisVal: 1.435 ± 0.03
0.393HisTrp: 0.393 ± 0.015
0.589HisTyr: 0.589 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
6.127IleAla: 6.127 ± 0.067
0.551IleCys: 0.551 ± 0.018
3.525IleAsp: 3.525 ± 0.044
3.309IleGlu: 3.309 ± 0.049
1.44IlePhe: 1.44 ± 0.024
4.221IleGly: 4.221 ± 0.053
0.935IleHis: 0.935 ± 0.021
1.873IleIle: 1.873 ± 0.038
1.292IleLys: 1.292 ± 0.031
3.601IleLeu: 3.601 ± 0.048
0.693IleMet: 0.693 ± 0.021
1.592IleAsn: 1.592 ± 0.035
2.257IlePro: 2.257 ± 0.038
1.299IleGln: 1.299 ± 0.03
2.911IleArg: 2.911 ± 0.038
2.794IleSer: 2.794 ± 0.046
2.464IleThr: 2.464 ± 0.044
3.887IleVal: 3.887 ± 0.05
0.583IleTrp: 0.583 ± 0.019
1.122IleTyr: 1.122 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.27LysAla: 3.27 ± 0.051
0.319LysCys: 0.319 ± 0.016
1.446LysAsp: 1.446 ± 0.031
1.849LysGlu: 1.849 ± 0.04
1.318LysPhe: 1.318 ± 0.03
1.992LysGly: 1.992 ± 0.036
0.703LysHis: 0.703 ± 0.021
1.669LysIle: 1.669 ± 0.033
1.592LysLys: 1.592 ± 0.043
3.867LysLeu: 3.867 ± 0.053
0.824LysMet: 0.824 ± 0.022
1.045LysAsn: 1.045 ± 0.026
2.197LysPro: 2.197 ± 0.04
1.534LysGln: 1.534 ± 0.032
2.298LysArg: 2.298 ± 0.037
2.16LysSer: 2.16 ± 0.038
1.874LysThr: 1.874 ± 0.037
2.211LysVal: 2.211 ± 0.038
0.494LysTrp: 0.494 ± 0.017
0.96LysTyr: 0.96 ± 0.023
0.0LysXaa: 0.0 ± 0.0
Leu
13.118LeuAla: 13.118 ± 0.109
1.022LeuCys: 1.022 ± 0.022
5.364LeuAsp: 5.364 ± 0.057
5.328LeuGlu: 5.328 ± 0.057
3.15LeuPhe: 3.15 ± 0.044
7.426LeuGly: 7.426 ± 0.062
1.937LeuHis: 1.937 ± 0.036
4.143LeuIle: 4.143 ± 0.054
3.301LeuLys: 3.301 ± 0.044
9.778LeuLeu: 9.778 ± 0.102
1.863LeuMet: 1.863 ± 0.032
3.062LeuAsn: 3.062 ± 0.042
5.482LeuPro: 5.482 ± 0.064
3.511LeuGln: 3.511 ± 0.047
6.667LeuArg: 6.667 ± 0.08
5.79LeuSer: 5.79 ± 0.05
5.618LeuThr: 5.618 ± 0.056
6.859LeuVal: 6.859 ± 0.064
1.362LeuTrp: 1.362 ± 0.039
2.135LeuTyr: 2.135 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.354MetAla: 2.354 ± 0.037
0.187MetCys: 0.187 ± 0.01
0.78MetAsp: 0.78 ± 0.02
1.039MetGlu: 1.039 ± 0.021
0.745MetPhe: 0.745 ± 0.02
1.418MetGly: 1.418 ± 0.029
0.417MetHis: 0.417 ± 0.016
1.078MetIle: 1.078 ± 0.026
0.907MetLys: 0.907 ± 0.024
2.362MetLeu: 2.362 ± 0.034
0.515MetMet: 0.515 ± 0.018
0.766MetAsn: 0.766 ± 0.024
1.393MetPro: 1.393 ± 0.029
0.833MetGln: 0.833 ± 0.021
1.524MetArg: 1.524 ± 0.03
1.541MetSer: 1.541 ± 0.027
1.413MetThr: 1.413 ± 0.027
1.34MetVal: 1.34 ± 0.028
0.248MetTrp: 0.248 ± 0.013
0.38MetTyr: 0.38 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.445AsnAla: 3.445 ± 0.045
0.349AsnCys: 0.349 ± 0.014
1.939AsnAsp: 1.939 ± 0.043
1.682AsnGlu: 1.682 ± 0.029
1.307AsnPhe: 1.307 ± 0.03
3.353AsnGly: 3.353 ± 0.061
0.668AsnHis: 0.668 ± 0.018
1.374AsnIle: 1.374 ± 0.026
0.79AsnLys: 0.79 ± 0.023
2.991AsnLeu: 2.991 ± 0.041
0.553AsnMet: 0.553 ± 0.018
1.211AsnAsn: 1.211 ± 0.044
2.03AsnPro: 2.03 ± 0.039
1.14AsnGln: 1.14 ± 0.027
2.13AsnArg: 2.13 ± 0.035
2.026AsnSer: 2.026 ± 0.048
1.494AsnThr: 1.494 ± 0.036
2.529AsnVal: 2.529 ± 0.045
0.577AsnTrp: 0.577 ± 0.019
0.978AsnTyr: 0.978 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
7.01ProAla: 7.01 ± 0.083
0.406ProCys: 0.406 ± 0.014
2.763ProAsp: 2.763 ± 0.042
3.585ProGlu: 3.585 ± 0.045
1.809ProPhe: 1.809 ± 0.03
4.248ProGly: 4.248 ± 0.055
1.109ProHis: 1.109 ± 0.027
2.437ProIle: 2.437 ± 0.04
1.737ProLys: 1.737 ± 0.034
4.939ProLeu: 4.939 ± 0.054
1.047ProMet: 1.047 ± 0.026
1.788ProAsn: 1.788 ± 0.037
3.102ProPro: 3.102 ± 0.057
2.109ProGln: 2.109 ± 0.039
2.989ProArg: 2.989 ± 0.044
3.364ProSer: 3.364 ± 0.045
3.409ProThr: 3.409 ± 0.038
3.615ProVal: 3.615 ± 0.051
0.774ProTrp: 0.774 ± 0.019
1.163ProTyr: 1.163 ± 0.025
0.0ProXaa: 0.0 ± 0.0
Gln
4.091GlnAla: 4.091 ± 0.046
0.338GlnCys: 0.338 ± 0.015
1.26GlnAsp: 1.26 ± 0.025
1.782GlnGlu: 1.782 ± 0.034
1.545GlnPhe: 1.545 ± 0.028
2.456GlnGly: 2.456 ± 0.039
0.826GlnHis: 0.826 ± 0.022
1.895GlnIle: 1.895 ± 0.035
1.147GlnLys: 1.147 ± 0.03
4.53GlnLeu: 4.53 ± 0.053
0.881GlnMet: 0.881 ± 0.018
1.083GlnAsn: 1.083 ± 0.027
2.235GlnPro: 2.235 ± 0.034
2.046GlnGln: 2.046 ± 0.041
3.033GlnArg: 3.033 ± 0.041
2.209GlnSer: 2.209 ± 0.034
2.078GlnThr: 2.078 ± 0.032
2.511GlnVal: 2.511 ± 0.038
0.61GlnTrp: 0.61 ± 0.019
0.881GlnTyr: 0.881 ± 0.023
0.0GlnXaa: 0.0 ± 0.0
Arg
5.77ArgAla: 5.77 ± 0.059
0.695ArgCys: 0.695 ± 0.018
3.572ArgAsp: 3.572 ± 0.049
4.213ArgGlu: 4.213 ± 0.055
2.649ArgPhe: 2.649 ± 0.039
4.58ArgGly: 4.58 ± 0.048
1.521ArgHis: 1.521 ± 0.029
3.348ArgIle: 3.348 ± 0.043
2.165ArgLys: 2.165 ± 0.039
7.092ArgLeu: 7.092 ± 0.083
1.706ArgMet: 1.706 ± 0.027
2.016ArgAsn: 2.016 ± 0.035
3.218ArgPro: 3.218 ± 0.053
2.919ArgGln: 2.919 ± 0.048
5.992ArgArg: 5.992 ± 0.08
3.965ArgSer: 3.965 ± 0.055
3.45ArgThr: 3.45 ± 0.042
4.455ArgVal: 4.455 ± 0.046
1.204ArgTrp: 1.204 ± 0.026
1.754ArgTyr: 1.754 ± 0.029
0.0ArgXaa: 0.0 ± 0.0
Ser
6.469SerAla: 6.469 ± 0.062
0.575SerCys: 0.575 ± 0.022
3.383SerAsp: 3.383 ± 0.05
3.136SerGlu: 3.136 ± 0.045
2.294SerPhe: 2.294 ± 0.037
5.555SerGly: 5.555 ± 0.073
1.304SerHis: 1.304 ± 0.025
2.918SerIle: 2.918 ± 0.039
1.782SerLys: 1.782 ± 0.034
6.112SerLeu: 6.112 ± 0.059
1.311SerMet: 1.311 ± 0.027
2.031SerAsn: 2.031 ± 0.042
3.601SerPro: 3.601 ± 0.051
2.299SerGln: 2.299 ± 0.039
3.864SerArg: 3.864 ± 0.047
4.058SerSer: 4.058 ± 0.067
3.592SerThr: 3.592 ± 0.059
4.038SerVal: 4.038 ± 0.046
0.865SerTrp: 0.865 ± 0.023
1.47SerTyr: 1.47 ± 0.031
0.0SerXaa: 0.0 ± 0.0
Thr
6.512ThrAla: 6.512 ± 0.07
0.519ThrCys: 0.519 ± 0.02
2.741ThrAsp: 2.741 ± 0.037
2.565ThrGlu: 2.565 ± 0.037
2.069ThrPhe: 2.069 ± 0.037
4.944ThrGly: 4.944 ± 0.068
1.055ThrHis: 1.055 ± 0.023
3.007ThrIle: 3.007 ± 0.047
1.662ThrLys: 1.662 ± 0.03
6.016ThrLeu: 6.016 ± 0.06
1.043ThrMet: 1.043 ± 0.021
1.831ThrAsn: 1.831 ± 0.038
3.97ThrPro: 3.97 ± 0.055
1.775ThrGln: 1.775 ± 0.03
2.875ThrArg: 2.875 ± 0.04
3.268ThrSer: 3.268 ± 0.046
3.389ThrThr: 3.389 ± 0.055
4.017ThrVal: 4.017 ± 0.053
0.791ThrTrp: 0.791 ± 0.023
1.335ThrTyr: 1.335 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
9.105ValAla: 9.105 ± 0.084
0.82ValCys: 0.82 ± 0.031
4.913ValAsp: 4.913 ± 0.052
4.766ValGlu: 4.766 ± 0.053
2.254ValPhe: 2.254 ± 0.035
5.719ValGly: 5.719 ± 0.064
1.325ValHis: 1.325 ± 0.026
3.45ValIle: 3.45 ± 0.046
2.357ValLys: 2.357 ± 0.041
6.148ValLeu: 6.148 ± 0.066
1.448ValMet: 1.448 ± 0.03
2.452ValAsn: 2.452 ± 0.042
3.425ValPro: 3.425 ± 0.05
2.251ValGln: 2.251 ± 0.033
4.647ValArg: 4.647 ± 0.048
4.184ValSer: 4.184 ± 0.057
4.288ValThr: 4.288 ± 0.062
5.799ValVal: 5.799 ± 0.065
1.041ValTrp: 1.041 ± 0.024
1.65ValTyr: 1.65 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.248TrpAla: 1.248 ± 0.029
0.17TrpCys: 0.17 ± 0.01
0.76TrpAsp: 0.76 ± 0.018
0.771TrpGlu: 0.771 ± 0.02
0.608TrpPhe: 0.608 ± 0.02
1.025TrpGly: 1.025 ± 0.024
0.399TrpHis: 0.399 ± 0.014
0.767TrpIle: 0.767 ± 0.021
0.717TrpLys: 0.717 ± 0.027
1.711TrpLeu: 1.711 ± 0.035
0.388TrpMet: 0.388 ± 0.014
0.635TrpAsn: 0.635 ± 0.017
0.726TrpPro: 0.726 ± 0.022
0.878TrpGln: 0.878 ± 0.022
1.309TrpArg: 1.309 ± 0.03
1.033TrpSer: 1.033 ± 0.024
0.859TrpThr: 0.859 ± 0.023
0.851TrpVal: 0.851 ± 0.022
0.292TrpTrp: 0.292 ± 0.015
0.372TrpTyr: 0.372 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.536TyrAla: 2.536 ± 0.04
0.329TyrCys: 0.329 ± 0.014
1.61TyrAsp: 1.61 ± 0.036
1.398TyrGlu: 1.398 ± 0.025
1.131TyrPhe: 1.131 ± 0.036
2.2TyrGly: 2.2 ± 0.039
0.561TyrHis: 0.561 ± 0.017
0.895TyrIle: 0.895 ± 0.023
0.641TyrLys: 0.641 ± 0.021
2.428TyrLeu: 2.428 ± 0.037
0.428TyrMet: 0.428 ± 0.014
0.849TyrAsn: 0.849 ± 0.026
1.094TyrPro: 1.094 ± 0.025
1.002TyrGln: 1.002 ± 0.021
1.888TyrArg: 1.888 ± 0.03
1.449TyrSer: 1.449 ± 0.033
1.158TyrThr: 1.158 ± 0.028
1.689TyrVal: 1.689 ± 0.031
0.434TyrTrp: 0.434 ± 0.015
0.764TyrTyr: 0.764 ± 0.02
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6194 proteins (1979840 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski