Amino acid dipepetide frequency for Gilvibacter sp. SZ-19

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.266AlaAla: 7.266 ± 0.124
0.715AlaCys: 0.715 ± 0.031
4.753AlaAsp: 4.753 ± 0.089
4.844AlaGlu: 4.844 ± 0.082
3.689AlaPhe: 3.689 ± 0.068
5.388AlaGly: 5.388 ± 0.078
1.373AlaHis: 1.373 ± 0.04
5.517AlaIle: 5.517 ± 0.09
4.801AlaLys: 4.801 ± 0.086
8.156AlaLeu: 8.156 ± 0.122
1.998AlaMet: 1.998 ± 0.044
3.665AlaAsn: 3.665 ± 0.071
2.51AlaPro: 2.51 ± 0.058
3.381AlaGln: 3.381 ± 0.063
2.749AlaArg: 2.749 ± 0.053
4.528AlaSer: 4.528 ± 0.071
4.414AlaThr: 4.414 ± 0.076
5.634AlaVal: 5.634 ± 0.097
0.823AlaTrp: 0.823 ± 0.033
2.914AlaTyr: 2.914 ± 0.052
0.0AlaXaa: 0.0 ± 0.0
Cys
0.494CysAla: 0.494 ± 0.025
0.133CysCys: 0.133 ± 0.012
0.528CysAsp: 0.528 ± 0.036
0.51CysGlu: 0.51 ± 0.026
0.418CysPhe: 0.418 ± 0.024
0.657CysGly: 0.657 ± 0.031
0.189CysHis: 0.189 ± 0.016
0.483CysIle: 0.483 ± 0.027
0.406CysLys: 0.406 ± 0.026
0.658CysLeu: 0.658 ± 0.026
0.159CysMet: 0.159 ± 0.012
0.355CysAsn: 0.355 ± 0.019
0.338CysPro: 0.338 ± 0.022
0.235CysGln: 0.235 ± 0.018
0.217CysArg: 0.217 ± 0.016
0.544CysSer: 0.544 ± 0.028
0.416CysThr: 0.416 ± 0.023
0.484CysVal: 0.484 ± 0.024
0.065CysTrp: 0.065 ± 0.008
0.272CysTyr: 0.272 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.662AspAla: 4.662 ± 0.095
0.423AspCys: 0.423 ± 0.021
3.22AspAsp: 3.22 ± 0.084
3.572AspGlu: 3.572 ± 0.066
3.599AspPhe: 3.599 ± 0.067
4.643AspGly: 4.643 ± 0.108
1.12AspHis: 1.12 ± 0.036
3.932AspIle: 3.932 ± 0.062
3.503AspLys: 3.503 ± 0.068
5.769AspLeu: 5.769 ± 0.084
1.29AspMet: 1.29 ± 0.04
2.9AspAsn: 2.9 ± 0.07
2.769AspPro: 2.769 ± 0.067
2.284AspGln: 2.284 ± 0.053
2.384AspArg: 2.384 ± 0.042
3.437AspSer: 3.437 ± 0.064
3.001AspThr: 3.001 ± 0.07
3.531AspVal: 3.531 ± 0.073
0.81AspTrp: 0.81 ± 0.028
2.928AspTyr: 2.928 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
5.79GluAla: 5.79 ± 0.095
0.315GluCys: 0.315 ± 0.019
3.69GluAsp: 3.69 ± 0.069
4.452GluGlu: 4.452 ± 0.097
3.066GluPhe: 3.066 ± 0.063
3.74GluGly: 3.74 ± 0.078
1.185GluHis: 1.185 ± 0.043
4.564GluIle: 4.564 ± 0.075
4.015GluLys: 4.015 ± 0.082
6.623GluLeu: 6.623 ± 0.099
1.44GluMet: 1.44 ± 0.046
3.33GluAsn: 3.33 ± 0.066
1.792GluPro: 1.792 ± 0.054
3.007GluGln: 3.007 ± 0.062
2.884GluArg: 2.884 ± 0.066
3.377GluSer: 3.377 ± 0.055
3.586GluThr: 3.586 ± 0.057
4.749GluVal: 4.749 ± 0.085
0.517GluTrp: 0.517 ± 0.025
2.186GluTyr: 2.186 ± 0.05
0.0GluXaa: 0.0 ± 0.0
Phe
3.581PheAla: 3.581 ± 0.072
0.408PheCys: 0.408 ± 0.021
3.45PheAsp: 3.45 ± 0.06
3.484PheGlu: 3.484 ± 0.06
2.54PhePhe: 2.54 ± 0.069
3.834PheGly: 3.834 ± 0.077
0.835PheHis: 0.835 ± 0.029
3.159PheIle: 3.159 ± 0.067
3.171PheLys: 3.171 ± 0.056
4.367PheLeu: 4.367 ± 0.079
1.117PheMet: 1.117 ± 0.032
2.821PheAsn: 2.821 ± 0.065
1.746PhePro: 1.746 ± 0.046
1.467PheGln: 1.467 ± 0.039
1.757PheArg: 1.757 ± 0.055
3.441PheSer: 3.441 ± 0.068
3.07PheThr: 3.07 ± 0.059
3.23PheVal: 3.23 ± 0.066
0.631PheTrp: 0.631 ± 0.027
2.116PheTyr: 2.116 ± 0.05
0.0PheXaa: 0.0 ± 0.0
Gly
5.175GlyAla: 5.175 ± 0.085
0.656GlyCys: 0.656 ± 0.038
4.058GlyAsp: 4.058 ± 0.078
3.731GlyGlu: 3.731 ± 0.075
3.875GlyPhe: 3.875 ± 0.076
5.231GlyGly: 5.231 ± 0.105
1.28GlyHis: 1.28 ± 0.034
5.09GlyIle: 5.09 ± 0.085
4.192GlyLys: 4.192 ± 0.077
6.74GlyLeu: 6.74 ± 0.101
1.756GlyMet: 1.756 ± 0.04
3.43GlyAsn: 3.43 ± 0.078
1.807GlyPro: 1.807 ± 0.05
2.567GlyGln: 2.567 ± 0.054
2.642GlyArg: 2.642 ± 0.058
4.324GlySer: 4.324 ± 0.081
4.273GlyThr: 4.273 ± 0.093
4.845GlyVal: 4.845 ± 0.074
0.848GlyTrp: 0.848 ± 0.033
2.855GlyTyr: 2.855 ± 0.061
0.0GlyXaa: 0.0 ± 0.0
His
1.054HisAla: 1.054 ± 0.032
0.189HisCys: 0.189 ± 0.015
0.845HisAsp: 0.845 ± 0.03
0.993HisGlu: 0.993 ± 0.036
1.094HisPhe: 1.094 ± 0.038
1.147HisGly: 1.147 ± 0.034
0.454HisHis: 0.454 ± 0.026
1.212HisIle: 1.212 ± 0.037
1.163HisLys: 1.163 ± 0.036
1.93HisLeu: 1.93 ± 0.051
0.382HisMet: 0.382 ± 0.022
0.801HisAsn: 0.801 ± 0.029
1.022HisPro: 1.022 ± 0.034
0.81HisGln: 0.81 ± 0.032
0.796HisArg: 0.796 ± 0.029
1.008HisSer: 1.008 ± 0.032
0.891HisThr: 0.891 ± 0.03
0.883HisVal: 0.883 ± 0.034
0.26HisTrp: 0.26 ± 0.015
0.865HisTyr: 0.865 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
6.043IleAla: 6.043 ± 0.09
0.566IleCys: 0.566 ± 0.028
4.518IleAsp: 4.518 ± 0.073
4.288IleGlu: 4.288 ± 0.079
2.931IlePhe: 2.931 ± 0.063
4.749IleGly: 4.749 ± 0.077
1.153IleHis: 1.153 ± 0.039
4.261IleIle: 4.261 ± 0.084
4.052IleLys: 4.052 ± 0.067
5.751IleLeu: 5.751 ± 0.082
1.266IleMet: 1.266 ± 0.04
3.557IleAsn: 3.557 ± 0.062
3.089IlePro: 3.089 ± 0.063
2.308IleGln: 2.308 ± 0.051
2.757IleArg: 2.757 ± 0.057
4.507IleSer: 4.507 ± 0.066
4.089IleThr: 4.089 ± 0.066
4.278IleVal: 4.278 ± 0.077
0.665IleTrp: 0.665 ± 0.03
2.428IleTyr: 2.428 ± 0.056
0.0IleXaa: 0.0 ± 0.0
Lys
5.258LysAla: 5.258 ± 0.09
0.248LysCys: 0.248 ± 0.017
3.684LysAsp: 3.684 ± 0.066
4.646LysGlu: 4.646 ± 0.094
2.179LysPhe: 2.179 ± 0.051
3.747LysGly: 3.747 ± 0.062
1.186LysHis: 1.186 ± 0.043
3.963LysIle: 3.963 ± 0.069
4.548LysLys: 4.548 ± 0.091
5.483LysLeu: 5.483 ± 0.072
1.517LysMet: 1.517 ± 0.038
3.176LysAsn: 3.176 ± 0.067
2.203LysPro: 2.203 ± 0.047
2.682LysGln: 2.682 ± 0.065
3.102LysArg: 3.102 ± 0.065
3.564LysSer: 3.564 ± 0.059
3.665LysThr: 3.665 ± 0.074
3.936LysVal: 3.936 ± 0.075
0.612LysTrp: 0.612 ± 0.025
2.29LysTyr: 2.29 ± 0.054
0.0LysXaa: 0.0 ± 0.0
Leu
7.379LeuAla: 7.379 ± 0.105
0.692LeuCys: 0.692 ± 0.032
5.894LeuAsp: 5.894 ± 0.086
6.746LeuGlu: 6.746 ± 0.086
4.695LeuPhe: 4.695 ± 0.082
6.657LeuGly: 6.657 ± 0.09
1.602LeuHis: 1.602 ± 0.044
6.445LeuIle: 6.445 ± 0.095
6.319LeuLys: 6.319 ± 0.109
9.88LeuLeu: 9.88 ± 0.156
2.237LeuMet: 2.237 ± 0.045
5.011LeuAsn: 5.011 ± 0.083
3.902LeuPro: 3.902 ± 0.071
3.711LeuGln: 3.711 ± 0.08
4.018LeuArg: 4.018 ± 0.08
6.565LeuSer: 6.565 ± 0.1
5.058LeuThr: 5.058 ± 0.076
6.01LeuVal: 6.01 ± 0.086
1.045LeuTrp: 1.045 ± 0.042
3.191LeuTyr: 3.191 ± 0.067
0.0LeuXaa: 0.0 ± 0.0
Met
2.006MetAla: 2.006 ± 0.054
0.121MetCys: 0.121 ± 0.011
1.301MetAsp: 1.301 ± 0.039
1.629MetGlu: 1.629 ± 0.041
0.782MetPhe: 0.782 ± 0.026
1.597MetGly: 1.597 ± 0.045
0.453MetHis: 0.453 ± 0.022
1.469MetIle: 1.469 ± 0.042
1.692MetLys: 1.692 ± 0.042
2.102MetLeu: 2.102 ± 0.046
0.568MetMet: 0.568 ± 0.03
1.198MetAsn: 1.198 ± 0.036
0.909MetPro: 0.909 ± 0.036
1.035MetGln: 1.035 ± 0.032
1.122MetArg: 1.122 ± 0.031
1.415MetSer: 1.415 ± 0.036
1.183MetThr: 1.183 ± 0.04
1.481MetVal: 1.481 ± 0.046
0.169MetTrp: 0.169 ± 0.012
0.704MetTyr: 0.704 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
3.769AsnAla: 3.769 ± 0.061
0.391AsnCys: 0.391 ± 0.023
2.867AsnAsp: 2.867 ± 0.066
2.897AsnGlu: 2.897 ± 0.062
2.638AsnPhe: 2.638 ± 0.063
3.593AsnGly: 3.593 ± 0.079
0.839AsnHis: 0.839 ± 0.034
3.447AsnIle: 3.447 ± 0.071
2.855AsnLys: 2.855 ± 0.055
4.615AsnLeu: 4.615 ± 0.075
1.191AsnMet: 1.191 ± 0.04
2.838AsnAsn: 2.838 ± 0.071
2.85AsnPro: 2.85 ± 0.065
1.967AsnGln: 1.967 ± 0.047
2.183AsnArg: 2.183 ± 0.048
3.12AsnSer: 3.12 ± 0.072
3.176AsnThr: 3.176 ± 0.083
2.812AsnVal: 2.812 ± 0.062
0.728AsnTrp: 0.728 ± 0.028
2.349AsnTyr: 2.349 ± 0.055
0.0AsnXaa: 0.0 ± 0.0
Pro
2.655ProAla: 2.655 ± 0.059
0.221ProCys: 0.221 ± 0.016
2.588ProAsp: 2.588 ± 0.061
3.291ProGlu: 3.291 ± 0.068
1.928ProPhe: 1.928 ± 0.047
2.473ProGly: 2.473 ± 0.06
0.638ProHis: 0.638 ± 0.026
2.46ProIle: 2.46 ± 0.06
2.442ProLys: 2.442 ± 0.05
3.494ProLeu: 3.494 ± 0.066
0.853ProMet: 0.853 ± 0.032
2.135ProAsn: 2.135 ± 0.053
1.02ProPro: 1.02 ± 0.042
1.663ProGln: 1.663 ± 0.043
1.179ProArg: 1.179 ± 0.035
2.052ProSer: 2.052 ± 0.044
2.138ProThr: 2.138 ± 0.049
2.712ProVal: 2.712 ± 0.056
0.42ProTrp: 0.42 ± 0.022
1.498ProTyr: 1.498 ± 0.043
0.0ProXaa: 0.0 ± 0.0
Gln
3.083GlnAla: 3.083 ± 0.065
0.214GlnCys: 0.214 ± 0.014
2.132GlnAsp: 2.132 ± 0.053
2.904GlnGlu: 2.904 ± 0.059
1.806GlnPhe: 1.806 ± 0.048
2.484GlnGly: 2.484 ± 0.06
0.754GlnHis: 0.754 ± 0.026
2.626GlnIle: 2.626 ± 0.053
2.343GlnLys: 2.343 ± 0.057
4.201GlnLeu: 4.201 ± 0.071
1.058GlnMet: 1.058 ± 0.035
1.817GlnAsn: 1.817 ± 0.052
1.248GlnPro: 1.248 ± 0.038
2.246GlnGln: 2.246 ± 0.059
1.766GlnArg: 1.766 ± 0.043
1.909GlnSer: 1.909 ± 0.044
2.149GlnThr: 2.149 ± 0.05
2.41GlnVal: 2.41 ± 0.055
0.493GlnTrp: 0.493 ± 0.024
1.516GlnTyr: 1.516 ± 0.04
0.0GlnXaa: 0.0 ± 0.0
Arg
2.839ArgAla: 2.839 ± 0.061
0.267ArgCys: 0.267 ± 0.019
2.211ArgAsp: 2.211 ± 0.054
2.507ArgGlu: 2.507 ± 0.059
2.389ArgPhe: 2.389 ± 0.058
2.38ArgGly: 2.38 ± 0.053
0.65ArgHis: 0.65 ± 0.029
3.165ArgIle: 3.165 ± 0.053
2.723ArgLys: 2.723 ± 0.059
4.092ArgLeu: 4.092 ± 0.067
1.099ArgMet: 1.099 ± 0.034
2.086ArgAsn: 2.086 ± 0.05
1.446ArgPro: 1.446 ± 0.037
1.386ArgGln: 1.386 ± 0.036
1.729ArgArg: 1.729 ± 0.05
2.548ArgSer: 2.548 ± 0.067
2.02ArgThr: 2.02 ± 0.046
2.633ArgVal: 2.633 ± 0.058
0.494ArgTrp: 0.494 ± 0.02
1.815ArgTyr: 1.815 ± 0.046
0.0ArgXaa: 0.0 ± 0.0
Ser
4.45SerAla: 4.45 ± 0.082
0.637SerCys: 0.637 ± 0.031
3.485SerAsp: 3.485 ± 0.067
3.818SerGlu: 3.818 ± 0.063
3.534SerPhe: 3.534 ± 0.06
5.09SerGly: 5.09 ± 0.087
0.93SerHis: 0.93 ± 0.036
4.006SerIle: 4.006 ± 0.068
3.693SerLys: 3.693 ± 0.069
6.152SerLeu: 6.152 ± 0.101
1.388SerMet: 1.388 ± 0.042
2.987SerAsn: 2.987 ± 0.064
2.252SerPro: 2.252 ± 0.052
2.028SerGln: 2.028 ± 0.051
2.332SerArg: 2.332 ± 0.056
3.795SerSer: 3.795 ± 0.077
3.232SerThr: 3.232 ± 0.063
3.742SerVal: 3.742 ± 0.067
0.79SerTrp: 0.79 ± 0.031
2.506SerTyr: 2.506 ± 0.056
0.0SerXaa: 0.0 ± 0.0
Thr
4.963ThrAla: 4.963 ± 0.082
0.331ThrCys: 0.331 ± 0.019
3.491ThrAsp: 3.491 ± 0.078
3.313ThrGlu: 3.313 ± 0.066
2.775ThrPhe: 2.775 ± 0.068
4.17ThrGly: 4.17 ± 0.084
1.016ThrHis: 1.016 ± 0.033
3.724ThrIle: 3.724 ± 0.07
2.889ThrLys: 2.889 ± 0.06
5.486ThrLeu: 5.486 ± 0.085
0.988ThrMet: 0.988 ± 0.033
2.645ThrAsn: 2.645 ± 0.063
2.669ThrPro: 2.669 ± 0.062
2.112ThrGln: 2.112 ± 0.049
1.936ThrArg: 1.936 ± 0.045
3.23ThrSer: 3.23 ± 0.062
3.591ThrThr: 3.591 ± 0.073
4.318ThrVal: 4.318 ± 0.095
0.621ThrTrp: 0.621 ± 0.033
2.442ThrTyr: 2.442 ± 0.058
0.0ThrXaa: 0.0 ± 0.0
Val
5.242ValAla: 5.242 ± 0.083
0.58ValCys: 0.58 ± 0.026
4.016ValAsp: 4.016 ± 0.076
3.723ValGlu: 3.723 ± 0.068
3.42ValPhe: 3.42 ± 0.067
4.326ValGly: 4.326 ± 0.073
1.113ValHis: 1.113 ± 0.039
4.673ValIle: 4.673 ± 0.076
3.725ValLys: 3.725 ± 0.071
6.543ValLeu: 6.543 ± 0.089
1.475ValMet: 1.475 ± 0.042
3.474ValAsn: 3.474 ± 0.066
2.43ValPro: 2.43 ± 0.051
2.106ValGln: 2.106 ± 0.05
2.451ValArg: 2.451 ± 0.052
4.433ValSer: 4.433 ± 0.067
3.721ValThr: 3.721 ± 0.086
4.987ValVal: 4.987 ± 0.096
0.647ValTrp: 0.647 ± 0.024
2.621ValTyr: 2.621 ± 0.061
0.0ValXaa: 0.0 ± 0.0
Trp
0.771TrpAla: 0.771 ± 0.027
0.089TrpCys: 0.089 ± 0.01
0.684TrpAsp: 0.684 ± 0.029
0.713TrpGlu: 0.713 ± 0.029
0.613TrpPhe: 0.613 ± 0.03
0.724TrpGly: 0.724 ± 0.032
0.24TrpHis: 0.24 ± 0.016
0.761TrpIle: 0.761 ± 0.03
0.679TrpLys: 0.679 ± 0.026
1.086TrpLeu: 1.086 ± 0.034
0.365TrpMet: 0.365 ± 0.02
0.699TrpAsn: 0.699 ± 0.029
0.313TrpPro: 0.313 ± 0.017
0.462TrpGln: 0.462 ± 0.024
0.491TrpArg: 0.491 ± 0.024
0.688TrpSer: 0.688 ± 0.032
0.597TrpThr: 0.597 ± 0.026
0.71TrpVal: 0.71 ± 0.026
0.171TrpTrp: 0.171 ± 0.013
0.47TrpTyr: 0.47 ± 0.025
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.636TyrAla: 2.636 ± 0.05
0.365TyrCys: 0.365 ± 0.022
2.364TyrAsp: 2.364 ± 0.057
2.245TyrGlu: 2.245 ± 0.045
2.255TyrPhe: 2.255 ± 0.049
2.745TyrGly: 2.745 ± 0.052
0.791TyrHis: 0.791 ± 0.03
2.301TyrIle: 2.301 ± 0.055
2.477TyrLys: 2.477 ± 0.052
3.925TyrLeu: 3.925 ± 0.069
0.793TyrMet: 0.793 ± 0.033
2.197TyrAsn: 2.197 ± 0.054
1.548TyrPro: 1.548 ± 0.04
1.7TyrGln: 1.7 ± 0.044
1.985TyrArg: 1.985 ± 0.05
2.362TyrSer: 2.362 ± 0.05
2.438TyrThr: 2.438 ± 0.066
2.323TyrVal: 2.323 ± 0.055
0.518TyrTrp: 0.518 ± 0.024
1.719TyrTyr: 1.719 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2809 proteins (944669 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski