Amino acid dipepetide frequency for Mucilaginibacter sp. PPCGB 2223

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.202AlaAla: 7.202 ± 0.104
0.741AlaCys: 0.741 ± 0.021
5.132AlaAsp: 5.132 ± 0.058
4.384AlaGlu: 4.384 ± 0.065
3.583AlaPhe: 3.583 ± 0.046
6.354AlaGly: 6.354 ± 0.085
1.367AlaHis: 1.367 ± 0.026
5.659AlaIle: 5.659 ± 0.067
4.904AlaLys: 4.904 ± 0.071
7.201AlaLeu: 7.201 ± 0.076
1.897AlaMet: 1.897 ± 0.041
4.012AlaAsn: 4.012 ± 0.062
2.741AlaPro: 2.741 ± 0.05
3.329AlaGln: 3.329 ± 0.05
2.783AlaArg: 2.783 ± 0.04
4.664AlaSer: 4.664 ± 0.074
4.69AlaThr: 4.69 ± 0.109
5.224AlaVal: 5.224 ± 0.067
0.874AlaTrp: 0.874 ± 0.022
3.091AlaTyr: 3.091 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.616CysAla: 0.616 ± 0.02
0.137CysCys: 0.137 ± 0.011
0.467CysAsp: 0.467 ± 0.019
0.377CysGlu: 0.377 ± 0.017
0.471CysPhe: 0.471 ± 0.017
0.668CysGly: 0.668 ± 0.024
0.205CysHis: 0.205 ± 0.011
0.649CysIle: 0.649 ± 0.02
0.489CysLys: 0.489 ± 0.015
0.796CysLeu: 0.796 ± 0.023
0.189CysMet: 0.189 ± 0.012
0.405CysAsn: 0.405 ± 0.019
0.368CysPro: 0.368 ± 0.014
0.221CysGln: 0.221 ± 0.011
0.37CysArg: 0.37 ± 0.017
0.554CysSer: 0.554 ± 0.022
0.502CysThr: 0.502 ± 0.02
0.489CysVal: 0.489 ± 0.017
0.093CysTrp: 0.093 ± 0.008
0.338CysTyr: 0.338 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.279AspAla: 4.279 ± 0.052
0.406AspCys: 0.406 ± 0.018
2.899AspAsp: 2.899 ± 0.053
3.384AspGlu: 3.384 ± 0.055
2.914AspPhe: 2.914 ± 0.046
4.045AspGly: 4.045 ± 0.06
1.193AspHis: 1.193 ± 0.028
4.172AspIle: 4.172 ± 0.055
3.889AspLys: 3.889 ± 0.057
4.833AspLeu: 4.833 ± 0.062
1.335AspMet: 1.335 ± 0.027
2.887AspAsn: 2.887 ± 0.045
2.302AspPro: 2.302 ± 0.045
1.923AspGln: 1.923 ± 0.039
2.15AspArg: 2.15 ± 0.036
2.894AspSer: 2.894 ± 0.048
2.983AspThr: 2.983 ± 0.048
3.583AspVal: 3.583 ± 0.052
0.783AspTrp: 0.783 ± 0.023
2.548AspTyr: 2.548 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
4.019GluAla: 4.019 ± 0.057
0.335GluCys: 0.335 ± 0.015
2.408GluAsp: 2.408 ± 0.044
2.907GluGlu: 2.907 ± 0.058
2.163GluPhe: 2.163 ± 0.038
2.992GluGly: 2.992 ± 0.048
1.174GluHis: 1.174 ± 0.03
3.672GluIle: 3.672 ± 0.058
3.964GluLys: 3.964 ± 0.066
5.328GluLeu: 5.328 ± 0.066
1.315GluMet: 1.315 ± 0.03
2.713GluAsn: 2.713 ± 0.048
1.716GluPro: 1.716 ± 0.033
2.294GluGln: 2.294 ± 0.044
2.3GluArg: 2.3 ± 0.047
2.472GluSer: 2.472 ± 0.039
2.719GluThr: 2.719 ± 0.043
3.422GluVal: 3.422 ± 0.058
0.612GluTrp: 0.612 ± 0.019
1.965GluTyr: 1.965 ± 0.041
0.0GluXaa: 0.0 ± 0.0
Phe
3.639PheAla: 3.639 ± 0.045
0.497PheCys: 0.497 ± 0.019
3.008PheAsp: 3.008 ± 0.044
2.511PheGlu: 2.511 ± 0.044
2.302PhePhe: 2.302 ± 0.041
3.487PheGly: 3.487 ± 0.052
0.761PheHis: 0.761 ± 0.02
3.322PheIle: 3.322 ± 0.044
3.303PheLys: 3.303 ± 0.053
3.956PheLeu: 3.956 ± 0.061
1.015PheMet: 1.015 ± 0.024
2.987PheAsn: 2.987 ± 0.043
1.672PhePro: 1.672 ± 0.034
1.182PheGln: 1.182 ± 0.028
1.762PheArg: 1.762 ± 0.036
3.239PheSer: 3.239 ± 0.048
3.301PheThr: 3.301 ± 0.056
2.825PheVal: 2.825 ± 0.049
0.597PheTrp: 0.597 ± 0.021
2.122PheTyr: 2.122 ± 0.037
0.0PheXaa: 0.0 ± 0.0
Gly
5.039GlyAla: 5.039 ± 0.068
0.688GlyCys: 0.688 ± 0.029
3.633GlyAsp: 3.633 ± 0.055
3.247GlyGlu: 3.247 ± 0.049
3.735GlyPhe: 3.735 ± 0.054
5.354GlyGly: 5.354 ± 0.084
1.37GlyHis: 1.37 ± 0.028
5.413GlyIle: 5.413 ± 0.064
5.205GlyLys: 5.205 ± 0.062
6.344GlyLeu: 6.344 ± 0.072
1.721GlyMet: 1.721 ± 0.036
3.78GlyAsn: 3.78 ± 0.062
1.851GlyPro: 1.851 ± 0.04
2.457GlyGln: 2.457 ± 0.043
2.632GlyArg: 2.632 ± 0.045
4.683GlySer: 4.683 ± 0.083
4.654GlyThr: 4.654 ± 0.105
4.578GlyVal: 4.578 ± 0.063
0.966GlyTrp: 0.966 ± 0.028
3.27GlyTyr: 3.27 ± 0.054
0.0GlyXaa: 0.0 ± 0.0
His
1.269HisAla: 1.269 ± 0.031
0.204HisCys: 0.204 ± 0.012
1.029HisAsp: 1.029 ± 0.025
1.0HisGlu: 1.0 ± 0.025
1.108HisPhe: 1.108 ± 0.03
1.294HisGly: 1.294 ± 0.033
0.571HisHis: 0.571 ± 0.018
1.558HisIle: 1.558 ± 0.029
1.011HisLys: 1.011 ± 0.024
1.933HisLeu: 1.933 ± 0.038
0.393HisMet: 0.393 ± 0.016
1.014HisAsn: 1.014 ± 0.025
1.121HisPro: 1.121 ± 0.026
0.824HisGln: 0.824 ± 0.02
0.812HisArg: 0.812 ± 0.023
1.087HisSer: 1.087 ± 0.027
1.171HisThr: 1.171 ± 0.027
1.068HisVal: 1.068 ± 0.027
0.297HisTrp: 0.297 ± 0.016
0.95HisTyr: 0.95 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
6.08IleAla: 6.08 ± 0.073
0.759IleCys: 0.759 ± 0.021
4.333IleAsp: 4.333 ± 0.054
3.701IleGlu: 3.701 ± 0.059
2.926IlePhe: 2.926 ± 0.044
4.908IleGly: 4.908 ± 0.062
1.315IleHis: 1.315 ± 0.03
4.858IleIle: 4.858 ± 0.07
4.741IleLys: 4.741 ± 0.055
5.834IleLeu: 5.834 ± 0.078
1.32IleMet: 1.32 ± 0.033
4.181IleAsn: 4.181 ± 0.052
3.078IlePro: 3.078 ± 0.045
2.122IleGln: 2.122 ± 0.041
2.993IleArg: 2.993 ± 0.045
4.885IleSer: 4.885 ± 0.057
4.999IleThr: 4.999 ± 0.088
4.345IleVal: 4.345 ± 0.056
0.748IleTrp: 0.748 ± 0.024
2.643IleTyr: 2.643 ± 0.039
0.0IleXaa: 0.0 ± 0.0
Lys
5.497LysAla: 5.497 ± 0.072
0.326LysCys: 0.326 ± 0.016
3.732LysAsp: 3.732 ± 0.048
3.633LysGlu: 3.633 ± 0.058
2.427LysPhe: 2.427 ± 0.045
4.232LysGly: 4.232 ± 0.056
1.312LysHis: 1.312 ± 0.031
4.619LysIle: 4.619 ± 0.062
4.955LysLys: 4.955 ± 0.077
6.059LysLeu: 6.059 ± 0.072
1.787LysMet: 1.787 ± 0.039
3.92LysAsn: 3.92 ± 0.051
3.086LysPro: 3.086 ± 0.049
2.879LysGln: 2.879 ± 0.042
2.592LysArg: 2.592 ± 0.046
3.502LysSer: 3.502 ± 0.048
4.277LysThr: 4.277 ± 0.048
4.252LysVal: 4.252 ± 0.059
0.751LysTrp: 0.751 ± 0.022
2.855LysTyr: 2.855 ± 0.043
0.0LysXaa: 0.0 ± 0.0
Leu
7.168LeuAla: 7.168 ± 0.07
0.829LeuCys: 0.829 ± 0.024
4.653LeuAsp: 4.653 ± 0.068
4.185LeuGlu: 4.185 ± 0.067
4.486LeuPhe: 4.486 ± 0.071
5.723LeuGly: 5.723 ± 0.074
1.798LeuHis: 1.798 ± 0.031
6.347LeuIle: 6.347 ± 0.06
6.866LeuLys: 6.866 ± 0.07
8.943LeuLeu: 8.943 ± 0.099
2.144LeuMet: 2.144 ± 0.041
5.441LeuAsn: 5.441 ± 0.065
4.221LeuPro: 4.221 ± 0.056
3.604LeuGln: 3.604 ± 0.057
3.582LeuArg: 3.582 ± 0.053
6.47LeuSer: 6.47 ± 0.06
5.78LeuThr: 5.78 ± 0.074
5.406LeuVal: 5.406 ± 0.058
0.952LeuTrp: 0.952 ± 0.026
3.368LeuTyr: 3.368 ± 0.051
0.0LeuXaa: 0.0 ± 0.0
Met
2.066MetAla: 2.066 ± 0.037
0.143MetCys: 0.143 ± 0.009
1.191MetAsp: 1.191 ± 0.027
1.19MetGlu: 1.19 ± 0.027
0.837MetPhe: 0.837 ± 0.025
1.55MetGly: 1.55 ± 0.033
0.497MetHis: 0.497 ± 0.019
1.5MetIle: 1.5 ± 0.035
1.823MetLys: 1.823 ± 0.038
2.232MetLeu: 2.232 ± 0.043
0.597MetMet: 0.597 ± 0.021
1.198MetAsn: 1.198 ± 0.027
1.144MetPro: 1.144 ± 0.027
1.0MetGln: 1.0 ± 0.025
0.986MetArg: 0.986 ± 0.026
1.304MetSer: 1.304 ± 0.031
1.144MetThr: 1.144 ± 0.025
1.496MetVal: 1.496 ± 0.031
0.219MetTrp: 0.219 ± 0.013
0.683MetTyr: 0.683 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
4.188AsnAla: 4.188 ± 0.053
0.473AsnCys: 0.473 ± 0.019
2.851AsnAsp: 2.851 ± 0.041
2.598AsnGlu: 2.598 ± 0.043
2.627AsnPhe: 2.627 ± 0.045
4.292AsnGly: 4.292 ± 0.056
1.052AsnHis: 1.052 ± 0.029
4.19AsnIle: 4.19 ± 0.06
3.39AsnLys: 3.39 ± 0.047
4.867AsnLeu: 4.867 ± 0.062
1.22AsnMet: 1.22 ± 0.023
3.471AsnAsn: 3.471 ± 0.06
2.839AsnPro: 2.839 ± 0.047
2.046AsnGln: 2.046 ± 0.035
2.219AsnArg: 2.219 ± 0.037
3.232AsnSer: 3.232 ± 0.053
3.594AsnThr: 3.594 ± 0.063
3.245AsnVal: 3.245 ± 0.058
0.782AsnTrp: 0.782 ± 0.023
2.644AsnTyr: 2.644 ± 0.047
0.0AsnXaa: 0.0 ± 0.0
Pro
3.999ProAla: 3.999 ± 0.061
0.252ProCys: 0.252 ± 0.013
2.886ProAsp: 2.886 ± 0.047
2.666ProGlu: 2.666 ± 0.043
1.928ProPhe: 1.928 ± 0.034
3.358ProGly: 3.358 ± 0.052
0.777ProHis: 0.777 ± 0.023
2.231ProIle: 2.231 ± 0.04
2.228ProLys: 2.228 ± 0.034
3.33ProLeu: 3.33 ± 0.046
0.831ProMet: 0.831 ± 0.023
2.103ProAsn: 2.103 ± 0.037
1.363ProPro: 1.363 ± 0.039
1.604ProGln: 1.604 ± 0.032
1.21ProArg: 1.21 ± 0.023
2.157ProSer: 2.157 ± 0.039
2.128ProThr: 2.128 ± 0.046
3.727ProVal: 3.727 ± 0.057
0.425ProTrp: 0.425 ± 0.016
1.694ProTyr: 1.694 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.924GlnAla: 2.924 ± 0.042
0.216GlnCys: 0.216 ± 0.012
1.651GlnAsp: 1.651 ± 0.033
1.574GlnGlu: 1.574 ± 0.034
1.694GlnPhe: 1.694 ± 0.034
2.237GlnGly: 2.237 ± 0.038
0.837GlnHis: 0.837 ± 0.025
2.494GlnIle: 2.494 ± 0.043
2.66GlnLys: 2.66 ± 0.042
3.831GlnLeu: 3.831 ± 0.057
0.976GlnMet: 0.976 ± 0.026
2.218GlnAsn: 2.218 ± 0.034
1.628GlnPro: 1.628 ± 0.035
2.24GlnGln: 2.24 ± 0.072
1.536GlnArg: 1.536 ± 0.036
2.192GlnSer: 2.192 ± 0.039
2.38GlnThr: 2.38 ± 0.038
2.509GlnVal: 2.509 ± 0.04
0.459GlnTrp: 0.459 ± 0.017
1.669GlnTyr: 1.669 ± 0.035
0.0GlnXaa: 0.0 ± 0.0
Arg
2.699ArgAla: 2.699 ± 0.042
0.248ArgCys: 0.248 ± 0.012
2.086ArgAsp: 2.086 ± 0.043
2.22ArgGlu: 2.22 ± 0.043
2.1ArgPhe: 2.1 ± 0.035
2.225ArgGly: 2.225 ± 0.041
0.763ArgHis: 0.763 ± 0.021
2.987ArgIle: 2.987 ± 0.05
2.634ArgLys: 2.634 ± 0.049
3.841ArgLeu: 3.841 ± 0.054
1.083ArgMet: 1.083 ± 0.027
2.066ArgAsn: 2.066 ± 0.037
1.477ArgPro: 1.477 ± 0.03
1.545ArgGln: 1.545 ± 0.032
1.657ArgArg: 1.657 ± 0.033
2.285ArgSer: 2.285 ± 0.041
2.045ArgThr: 2.045 ± 0.04
2.452ArgVal: 2.452 ± 0.045
0.538ArgTrp: 0.538 ± 0.018
1.882ArgTyr: 1.882 ± 0.035
0.0ArgXaa: 0.0 ± 0.0
Ser
5.072SerAla: 5.072 ± 0.075
0.547SerCys: 0.547 ± 0.018
3.153SerAsp: 3.153 ± 0.045
2.633SerGlu: 2.633 ± 0.042
3.339SerPhe: 3.339 ± 0.048
5.034SerGly: 5.034 ± 0.079
1.133SerHis: 1.133 ± 0.027
4.371SerIle: 4.371 ± 0.061
3.596SerLys: 3.596 ± 0.049
5.718SerLeu: 5.718 ± 0.066
1.242SerMet: 1.242 ± 0.025
3.159SerAsn: 3.159 ± 0.06
2.469SerPro: 2.469 ± 0.038
2.026SerGln: 2.026 ± 0.04
2.277SerArg: 2.277 ± 0.038
3.837SerSer: 3.837 ± 0.077
3.669SerThr: 3.669 ± 0.087
4.322SerVal: 4.322 ± 0.06
0.717SerTrp: 0.717 ± 0.022
2.719SerTyr: 2.719 ± 0.051
0.0SerXaa: 0.0 ± 0.0
Thr
5.329ThrAla: 5.329 ± 0.107
0.428ThrCys: 0.428 ± 0.016
3.739ThrAsp: 3.739 ± 0.054
2.876ThrGlu: 2.876 ± 0.052
2.797ThrPhe: 2.797 ± 0.045
5.387ThrGly: 5.387 ± 0.101
1.114ThrHis: 1.114 ± 0.031
4.536ThrIle: 4.536 ± 0.072
3.068ThrLys: 3.068 ± 0.051
5.636ThrLeu: 5.636 ± 0.068
1.068ThrMet: 1.068 ± 0.027
3.055ThrAsn: 3.055 ± 0.056
2.917ThrPro: 2.917 ± 0.065
2.103ThrGln: 2.103 ± 0.042
2.117ThrArg: 2.117 ± 0.036
3.788ThrSer: 3.788 ± 0.089
4.029ThrThr: 4.029 ± 0.097
4.437ThrVal: 4.437 ± 0.084
0.738ThrTrp: 0.738 ± 0.023
2.755ThrTyr: 2.755 ± 0.083
0.0ThrXaa: 0.0 ± 0.0
Val
4.928ValAla: 4.928 ± 0.063
0.653ValCys: 0.653 ± 0.022
3.484ValAsp: 3.484 ± 0.048
3.04ValGlu: 3.04 ± 0.048
3.218ValPhe: 3.218 ± 0.048
3.735ValGly: 3.735 ± 0.057
1.113ValHis: 1.113 ± 0.026
4.803ValIle: 4.803 ± 0.053
4.608ValLys: 4.608 ± 0.053
6.086ValLeu: 6.086 ± 0.073
1.508ValMet: 1.508 ± 0.033
3.928ValAsn: 3.928 ± 0.066
2.676ValPro: 2.676 ± 0.045
2.099ValGln: 2.099 ± 0.033
2.378ValArg: 2.378 ± 0.041
4.412ValSer: 4.412 ± 0.059
4.355ValThr: 4.355 ± 0.088
4.348ValVal: 4.348 ± 0.06
0.744ValTrp: 0.744 ± 0.024
2.729ValTyr: 2.729 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
0.881TrpAla: 0.881 ± 0.026
0.123TrpCys: 0.123 ± 0.008
0.686TrpAsp: 0.686 ± 0.022
0.576TrpGlu: 0.576 ± 0.019
0.615TrpPhe: 0.615 ± 0.021
0.82TrpGly: 0.82 ± 0.022
0.293TrpHis: 0.293 ± 0.014
0.748TrpIle: 0.748 ± 0.021
0.774TrpLys: 0.774 ± 0.022
1.178TrpLeu: 1.178 ± 0.027
0.367TrpMet: 0.367 ± 0.016
0.666TrpAsn: 0.666 ± 0.021
0.417TrpPro: 0.417 ± 0.017
0.577TrpGln: 0.577 ± 0.02
0.511TrpArg: 0.511 ± 0.016
0.671TrpSer: 0.671 ± 0.023
0.667TrpThr: 0.667 ± 0.021
0.777TrpVal: 0.777 ± 0.025
0.215TrpTrp: 0.215 ± 0.013
0.503TrpTyr: 0.503 ± 0.02
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.938TyrAla: 2.938 ± 0.045
0.398TyrCys: 0.398 ± 0.017
2.383TyrAsp: 2.383 ± 0.047
1.786TyrGlu: 1.786 ± 0.033
2.219TyrPhe: 2.219 ± 0.037
2.927TyrGly: 2.927 ± 0.046
1.068TyrHis: 1.068 ± 0.025
2.643TyrIle: 2.643 ± 0.039
2.584TyrLys: 2.584 ± 0.035
4.019TyrLeu: 4.019 ± 0.054
0.791TyrMet: 0.791 ± 0.023
2.616TyrAsn: 2.616 ± 0.044
1.808TyrPro: 1.808 ± 0.037
1.852TyrGln: 1.852 ± 0.036
1.965TyrArg: 1.965 ± 0.029
2.709TyrSer: 2.709 ± 0.045
2.827TyrThr: 2.827 ± 0.067
2.339TyrVal: 2.339 ± 0.036
0.556TyrTrp: 0.556 ± 0.021
2.053TyrTyr: 2.053 ± 0.042
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4665 proteins (1631383 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski