Amino acid dipepetide frequency for Psychromonas sp. CNPT3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.231AlaAla: 5.231 ± 0.103
1.105AlaCys: 1.105 ± 0.037
4.206AlaAsp: 4.206 ± 0.084
4.22AlaGlu: 4.22 ± 0.082
3.529AlaPhe: 3.529 ± 0.067
5.148AlaGly: 5.148 ± 0.101
1.779AlaHis: 1.779 ± 0.049
6.28AlaIle: 6.28 ± 0.095
5.041AlaLys: 5.041 ± 0.081
10.568AlaLeu: 10.568 ± 0.131
2.398AlaMet: 2.398 ± 0.061
3.186AlaAsn: 3.186 ± 0.08
2.74AlaPro: 2.74 ± 0.075
4.348AlaGln: 4.348 ± 0.071
3.263AlaArg: 3.263 ± 0.073
5.507AlaSer: 5.507 ± 0.081
4.525AlaThr: 4.525 ± 0.098
4.512AlaVal: 4.512 ± 0.085
0.814AlaTrp: 0.814 ± 0.032
2.465AlaTyr: 2.465 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
1.111CysAla: 1.111 ± 0.04
0.179CysCys: 0.179 ± 0.016
0.711CysAsp: 0.711 ± 0.031
0.59CysGlu: 0.59 ± 0.03
0.614CysPhe: 0.614 ± 0.027
0.87CysGly: 0.87 ± 0.033
0.383CysHis: 0.383 ± 0.026
0.996CysIle: 0.996 ± 0.039
0.563CysLys: 0.563 ± 0.028
1.303CysLeu: 1.303 ± 0.042
0.233CysMet: 0.233 ± 0.018
0.419CysAsn: 0.419 ± 0.025
0.483CysPro: 0.483 ± 0.029
0.509CysGln: 0.509 ± 0.025
0.429CysArg: 0.429 ± 0.024
0.869CysSer: 0.869 ± 0.036
0.57CysThr: 0.57 ± 0.024
0.792CysVal: 0.792 ± 0.029
0.127CysTrp: 0.127 ± 0.012
0.398CysTyr: 0.398 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
4.859AspAla: 4.859 ± 0.09
0.614AspCys: 0.614 ± 0.024
2.889AspAsp: 2.889 ± 0.072
3.423AspGlu: 3.423 ± 0.074
2.697AspPhe: 2.697 ± 0.065
3.043AspGly: 3.043 ± 0.123
1.231AspHis: 1.231 ± 0.038
4.589AspIle: 4.589 ± 0.08
3.918AspLys: 3.918 ± 0.073
5.431AspLeu: 5.431 ± 0.085
1.331AspMet: 1.331 ± 0.043
2.556AspAsn: 2.556 ± 0.059
2.086AspPro: 2.086 ± 0.054
1.723AspGln: 1.723 ± 0.045
1.819AspArg: 1.819 ± 0.045
3.043AspSer: 3.043 ± 0.073
2.486AspThr: 2.486 ± 0.065
3.56AspVal: 3.56 ± 0.079
0.62AspTrp: 0.62 ± 0.023
1.65AspTyr: 1.65 ± 0.05
0.0AspXaa: 0.0 ± 0.0
Glu
4.25GluAla: 4.25 ± 0.09
0.548GluCys: 0.548 ± 0.027
2.676GluAsp: 2.676 ± 0.064
2.966GluGlu: 2.966 ± 0.134
2.249GluPhe: 2.249 ± 0.052
3.044GluGly: 3.044 ± 0.07
1.425GluHis: 1.425 ± 0.043
4.402GluIle: 4.402 ± 0.079
4.463GluLys: 4.463 ± 0.072
6.061GluLeu: 6.061 ± 0.11
1.704GluMet: 1.704 ± 0.051
3.228GluAsn: 3.228 ± 0.067
1.493GluPro: 1.493 ± 0.061
3.161GluGln: 3.161 ± 0.074
2.494GluArg: 2.494 ± 0.064
3.292GluSer: 3.292 ± 0.062
2.777GluThr: 2.777 ± 0.069
3.626GluVal: 3.626 ± 0.079
0.468GluTrp: 0.468 ± 0.024
1.682GluTyr: 1.682 ± 0.048
0.0GluXaa: 0.0 ± 0.0
Phe
3.383PheAla: 3.383 ± 0.065
0.632PheCys: 0.632 ± 0.027
2.653PheAsp: 2.653 ± 0.066
2.392PheGlu: 2.392 ± 0.058
2.13PhePhe: 2.13 ± 0.071
2.573PheGly: 2.573 ± 0.062
0.815PheHis: 0.815 ± 0.031
3.765PheIle: 3.765 ± 0.081
2.797PheLys: 2.797 ± 0.061
4.164PheLeu: 4.164 ± 0.086
1.161PheMet: 1.161 ± 0.044
2.457PheAsn: 2.457 ± 0.056
1.33PhePro: 1.33 ± 0.038
1.16PheGln: 1.16 ± 0.034
1.275PheArg: 1.275 ± 0.047
4.115PheSer: 4.115 ± 0.08
2.377PheThr: 2.377 ± 0.064
2.746PheVal: 2.746 ± 0.062
0.491PheTrp: 0.491 ± 0.024
1.567PheTyr: 1.567 ± 0.044
0.0PheXaa: 0.0 ± 0.0
Gly
5.006GlyAla: 5.006 ± 0.102
0.881GlyCys: 0.881 ± 0.031
3.268GlyAsp: 3.268 ± 0.077
3.572GlyGlu: 3.572 ± 0.08
3.07GlyPhe: 3.07 ± 0.065
4.103GlyGly: 4.103 ± 0.097
1.347GlyHis: 1.347 ± 0.044
4.858GlyIle: 4.858 ± 0.089
4.124GlyLys: 4.124 ± 0.078
6.462GlyLeu: 6.462 ± 0.096
1.746GlyMet: 1.746 ± 0.055
2.37GlyAsn: 2.37 ± 0.074
1.453GlyPro: 1.453 ± 0.044
2.336GlyGln: 2.336 ± 0.052
2.635GlyArg: 2.635 ± 0.061
3.886GlySer: 3.886 ± 0.074
3.004GlyThr: 3.004 ± 0.094
4.593GlyVal: 4.593 ± 0.091
0.749GlyTrp: 0.749 ± 0.032
2.323GlyTyr: 2.323 ± 0.057
0.0GlyXaa: 0.0 ± 0.0
His
1.664HisAla: 1.664 ± 0.04
0.426HisCys: 0.426 ± 0.029
1.084HisAsp: 1.084 ± 0.036
1.025HisGlu: 1.025 ± 0.033
1.248HisPhe: 1.248 ± 0.037
1.298HisGly: 1.298 ± 0.037
0.643HisHis: 0.643 ± 0.032
1.648HisIle: 1.648 ± 0.047
1.385HisLys: 1.385 ± 0.036
2.546HisLeu: 2.546 ± 0.052
0.45HisMet: 0.45 ± 0.023
1.005HisAsn: 1.005 ± 0.038
1.011HisPro: 1.011 ± 0.036
1.156HisGln: 1.156 ± 0.039
0.897HisArg: 0.897 ± 0.03
1.48HisSer: 1.48 ± 0.045
0.938HisThr: 0.938 ± 0.038
1.25HisVal: 1.25 ± 0.038
0.305HisTrp: 0.305 ± 0.02
0.974HisTyr: 0.974 ± 0.035
0.0HisXaa: 0.0 ± 0.0
Ile
6.891IleAla: 6.891 ± 0.089
0.976IleCys: 0.976 ± 0.033
4.882IleAsp: 4.882 ± 0.083
5.116IleGlu: 5.116 ± 0.083
3.062IlePhe: 3.062 ± 0.068
4.762IleGly: 4.762 ± 0.081
1.404IleHis: 1.404 ± 0.043
5.37IleIle: 5.37 ± 0.098
5.181IleLys: 5.181 ± 0.083
7.234IleLeu: 7.234 ± 0.11
1.565IleMet: 1.565 ± 0.05
4.068IleAsn: 4.068 ± 0.074
2.66IlePro: 2.66 ± 0.062
2.493IleGln: 2.493 ± 0.058
2.685IleArg: 2.685 ± 0.061
5.89IleSer: 5.89 ± 0.1
4.233IleThr: 4.233 ± 0.079
4.456IleVal: 4.456 ± 0.083
0.657IleTrp: 0.657 ± 0.031
2.282IleTyr: 2.282 ± 0.054
0.0IleXaa: 0.0 ± 0.0
Lys
4.89LysAla: 4.89 ± 0.091
0.476LysCys: 0.476 ± 0.023
3.335LysAsp: 3.335 ± 0.06
4.109LysGlu: 4.109 ± 0.071
1.96LysPhe: 1.96 ± 0.045
3.743LysGly: 3.743 ± 0.074
1.577LysHis: 1.577 ± 0.046
5.014LysIle: 5.014 ± 0.085
5.283LysLys: 5.283 ± 0.1
6.113LysLeu: 6.113 ± 0.096
2.015LysMet: 2.015 ± 0.052
3.851LysAsn: 3.851 ± 0.082
1.955LysPro: 1.955 ± 0.043
3.266LysGln: 3.266 ± 0.073
3.007LysArg: 3.007 ± 0.063
4.1LysSer: 4.1 ± 0.078
3.684LysThr: 3.684 ± 0.071
4.209LysVal: 4.209 ± 0.076
0.575LysTrp: 0.575 ± 0.026
1.866LysTyr: 1.866 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
9.223LeuAla: 9.223 ± 0.132
1.48LeuCys: 1.48 ± 0.041
5.848LeuAsp: 5.848 ± 0.085
5.751LeuGlu: 5.751 ± 0.088
4.98LeuPhe: 4.98 ± 0.105
6.662LeuGly: 6.662 ± 0.102
2.373LeuHis: 2.373 ± 0.054
7.673LeuIle: 7.673 ± 0.112
6.805LeuLys: 6.805 ± 0.104
12.701LeuLeu: 12.701 ± 0.181
2.82LeuMet: 2.82 ± 0.059
5.358LeuAsn: 5.358 ± 0.084
4.345LeuPro: 4.345 ± 0.075
4.947LeuGln: 4.947 ± 0.108
4.314LeuArg: 4.314 ± 0.077
9.163LeuSer: 9.163 ± 0.127
5.798LeuThr: 5.798 ± 0.083
6.193LeuVal: 6.193 ± 0.097
1.01LeuTrp: 1.01 ± 0.039
2.907LeuTyr: 2.907 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
2.287MetAla: 2.287 ± 0.063
0.256MetCys: 0.256 ± 0.018
1.2MetAsp: 1.2 ± 0.039
0.943MetGlu: 0.943 ± 0.038
0.983MetPhe: 0.983 ± 0.036
1.812MetGly: 1.812 ± 0.058
0.657MetHis: 0.657 ± 0.026
1.865MetIle: 1.865 ± 0.052
1.494MetLys: 1.494 ± 0.038
3.063MetLeu: 3.063 ± 0.063
0.7MetMet: 0.7 ± 0.032
1.024MetAsn: 1.024 ± 0.036
1.177MetPro: 1.177 ± 0.032
1.685MetGln: 1.685 ± 0.042
1.159MetArg: 1.159 ± 0.038
1.942MetSer: 1.942 ± 0.051
1.311MetThr: 1.311 ± 0.035
1.394MetVal: 1.394 ± 0.046
0.182MetTrp: 0.182 ± 0.014
0.537MetTyr: 0.537 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
4.095AsnAla: 4.095 ± 0.079
0.507AsnCys: 0.507 ± 0.025
2.621AsnAsp: 2.621 ± 0.1
2.561AsnGlu: 2.561 ± 0.063
1.805AsnPhe: 1.805 ± 0.051
2.661AsnGly: 2.661 ± 0.074
0.806AsnHis: 0.806 ± 0.028
4.256AsnIle: 4.256 ± 0.085
3.662AsnLys: 3.662 ± 0.069
4.21AsnLeu: 4.21 ± 0.071
1.156AsnMet: 1.156 ± 0.04
2.772AsnAsn: 2.772 ± 0.078
1.614AsnPro: 1.614 ± 0.045
1.656AsnGln: 1.656 ± 0.052
1.611AsnArg: 1.611 ± 0.049
3.075AsnSer: 3.075 ± 0.071
2.541AsnThr: 2.541 ± 0.064
3.105AsnVal: 3.105 ± 0.069
0.545AsnTrp: 0.545 ± 0.027
1.558AsnTyr: 1.558 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
2.594ProAla: 2.594 ± 0.062
0.395ProCys: 0.395 ± 0.022
1.809ProAsp: 1.809 ± 0.048
2.443ProGlu: 2.443 ± 0.066
1.713ProPhe: 1.713 ± 0.046
1.907ProGly: 1.907 ± 0.052
0.758ProHis: 0.758 ± 0.03
2.596ProIle: 2.596 ± 0.062
2.081ProLys: 2.081 ± 0.057
4.034ProLeu: 4.034 ± 0.07
0.919ProMet: 0.919 ± 0.035
1.467ProAsn: 1.467 ± 0.05
0.883ProPro: 0.883 ± 0.036
1.415ProGln: 1.415 ± 0.042
1.25ProArg: 1.25 ± 0.045
2.21ProSer: 2.21 ± 0.056
1.831ProThr: 1.831 ± 0.045
2.412ProVal: 2.412 ± 0.067
0.403ProTrp: 0.403 ± 0.022
1.146ProTyr: 1.146 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
3.969GlnAla: 3.969 ± 0.071
0.522GlnCys: 0.522 ± 0.026
2.016GlnAsp: 2.016 ± 0.052
2.3GlnGlu: 2.3 ± 0.057
1.631GlnPhe: 1.631 ± 0.046
3.161GlnGly: 3.161 ± 0.07
1.147GlnHis: 1.147 ± 0.037
2.925GlnIle: 2.925 ± 0.066
3.181GlnLys: 3.181 ± 0.072
5.222GlnLeu: 5.222 ± 0.105
1.008GlnMet: 1.008 ± 0.038
1.806GlnAsn: 1.806 ± 0.049
1.242GlnPro: 1.242 ± 0.037
3.112GlnGln: 3.112 ± 0.081
2.295GlnArg: 2.295 ± 0.055
2.868GlnSer: 2.868 ± 0.061
2.201GlnThr: 2.201 ± 0.052
2.834GlnVal: 2.834 ± 0.057
0.672GlnTrp: 0.672 ± 0.031
1.512GlnTyr: 1.512 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
3.025ArgAla: 3.025 ± 0.067
0.466ArgCys: 0.466 ± 0.024
2.135ArgAsp: 2.135 ± 0.054
2.334ArgGlu: 2.334 ± 0.053
2.002ArgPhe: 2.002 ± 0.05
2.223ArgGly: 2.223 ± 0.059
0.999ArgHis: 0.999 ± 0.038
3.13ArgIle: 3.13 ± 0.076
2.295ArgLys: 2.295 ± 0.052
4.677ArgLeu: 4.677 ± 0.088
0.958ArgMet: 0.958 ± 0.033
1.673ArgAsn: 1.673 ± 0.043
1.367ArgPro: 1.367 ± 0.04
1.829ArgGln: 1.829 ± 0.053
1.927ArgArg: 1.927 ± 0.058
2.472ArgSer: 2.472 ± 0.047
1.773ArgThr: 1.773 ± 0.046
2.634ArgVal: 2.634 ± 0.053
0.495ArgTrp: 0.495 ± 0.025
1.66ArgTyr: 1.66 ± 0.051
0.0ArgXaa: 0.0 ± 0.0
Ser
6.044SerAla: 6.044 ± 0.093
0.769SerCys: 0.769 ± 0.033
3.673SerAsp: 3.673 ± 0.076
3.772SerGlu: 3.772 ± 0.079
3.108SerPhe: 3.108 ± 0.067
4.638SerGly: 4.638 ± 0.089
1.487SerHis: 1.487 ± 0.041
5.346SerIle: 5.346 ± 0.094
3.893SerLys: 3.893 ± 0.068
7.782SerLeu: 7.782 ± 0.124
1.818SerMet: 1.818 ± 0.048
3.027SerAsn: 3.027 ± 0.07
2.31SerPro: 2.31 ± 0.051
2.751SerGln: 2.751 ± 0.06
2.586SerArg: 2.586 ± 0.065
4.626SerSer: 4.626 ± 0.092
3.813SerThr: 3.813 ± 0.077
4.811SerVal: 4.811 ± 0.091
0.705SerTrp: 0.705 ± 0.03
2.131SerTyr: 2.131 ± 0.048
0.0SerXaa: 0.0 ± 0.0
Thr
3.676ThrAla: 3.676 ± 0.084
0.549ThrCys: 0.549 ± 0.028
2.806ThrAsp: 2.806 ± 0.081
2.905ThrGlu: 2.905 ± 0.077
2.269ThrPhe: 2.269 ± 0.061
3.628ThrGly: 3.628 ± 0.083
1.303ThrHis: 1.303 ± 0.037
3.358ThrIle: 3.358 ± 0.072
2.789ThrLys: 2.789 ± 0.067
7.227ThrLeu: 7.227 ± 0.102
1.072ThrMet: 1.072 ± 0.036
1.982ThrAsn: 1.982 ± 0.059
2.348ThrPro: 2.348 ± 0.057
2.864ThrGln: 2.864 ± 0.062
2.053ThrArg: 2.053 ± 0.053
3.163ThrSer: 3.163 ± 0.065
2.697ThrThr: 2.697 ± 0.061
2.903ThrVal: 2.903 ± 0.115
0.586ThrTrp: 0.586 ± 0.027
1.646ThrTyr: 1.646 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
5.495ValAla: 5.495 ± 0.104
0.741ValCys: 0.741 ± 0.031
3.741ValAsp: 3.741 ± 0.09
3.756ValGlu: 3.756 ± 0.074
2.804ValPhe: 2.804 ± 0.063
3.953ValGly: 3.953 ± 0.084
1.211ValHis: 1.211 ± 0.04
4.923ValIle: 4.923 ± 0.075
3.684ValLys: 3.684 ± 0.07
6.423ValLeu: 6.423 ± 0.091
1.662ValMet: 1.662 ± 0.048
2.947ValAsn: 2.947 ± 0.104
2.064ValPro: 2.064 ± 0.051
2.423ValGln: 2.423 ± 0.058
2.422ValArg: 2.422 ± 0.059
4.538ValSer: 4.538 ± 0.077
3.312ValThr: 3.312 ± 0.082
4.215ValVal: 4.215 ± 0.091
0.51ValTrp: 0.51 ± 0.024
1.764ValTyr: 1.764 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.682TrpAla: 0.682 ± 0.028
0.136TrpCys: 0.136 ± 0.013
0.538TrpAsp: 0.538 ± 0.024
0.451TrpGlu: 0.451 ± 0.025
0.504TrpPhe: 0.504 ± 0.027
0.694TrpGly: 0.694 ± 0.031
0.287TrpHis: 0.287 ± 0.017
0.72TrpIle: 0.72 ± 0.029
0.556TrpLys: 0.556 ± 0.023
1.463TrpLeu: 1.463 ± 0.044
0.27TrpMet: 0.27 ± 0.018
0.372TrpAsn: 0.372 ± 0.021
0.388TrpPro: 0.388 ± 0.02
0.731TrpGln: 0.731 ± 0.032
0.491TrpArg: 0.491 ± 0.024
0.623TrpSer: 0.623 ± 0.029
0.426TrpThr: 0.426 ± 0.027
0.677TrpVal: 0.677 ± 0.03
0.14TrpTrp: 0.14 ± 0.014
0.321TrpTyr: 0.321 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.491TyrAla: 2.491 ± 0.061
0.491TyrCys: 0.491 ± 0.023
1.479TyrAsp: 1.479 ± 0.046
1.38TyrGlu: 1.38 ± 0.046
1.625TyrPhe: 1.625 ± 0.044
1.865TyrGly: 1.865 ± 0.05
0.8TyrHis: 0.8 ± 0.034
2.154TyrIle: 2.154 ± 0.05
1.832TyrLys: 1.832 ± 0.045
3.667TyrLeu: 3.667 ± 0.068
0.68TyrMet: 0.68 ± 0.03
1.249TyrAsn: 1.249 ± 0.045
1.319TyrPro: 1.319 ± 0.043
2.059TyrGln: 2.059 ± 0.054
1.462TyrArg: 1.462 ± 0.044
2.228TyrSer: 2.228 ± 0.059
1.527TyrThr: 1.527 ± 0.062
1.663TyrVal: 1.663 ± 0.038
0.417TyrTrp: 0.417 ± 0.023
1.156TyrTyr: 1.156 ± 0.039
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2578 proteins (852767 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski