Amino acid dipepetide frequency for Paenibacillus uliginis N3/975

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.169AlaAla: 7.169 ± 0.095
0.688AlaCys: 0.688 ± 0.019
3.743AlaAsp: 3.743 ± 0.055
5.031AlaGlu: 5.031 ± 0.068
3.103AlaPhe: 3.103 ± 0.047
5.913AlaGly: 5.913 ± 0.066
1.273AlaHis: 1.273 ± 0.029
5.063AlaIle: 5.063 ± 0.055
4.054AlaLys: 4.054 ± 0.059
7.531AlaLeu: 7.531 ± 0.074
2.133AlaMet: 2.133 ± 0.037
2.487AlaAsn: 2.487 ± 0.041
2.239AlaPro: 2.239 ± 0.039
2.401AlaGln: 2.401 ± 0.041
2.984AlaArg: 2.984 ± 0.045
4.621AlaSer: 4.621 ± 0.053
3.226AlaThr: 3.226 ± 0.051
6.044AlaVal: 6.044 ± 0.076
0.81AlaTrp: 0.81 ± 0.02
2.498AlaTyr: 2.498 ± 0.042
0.0AlaXaa: 0.0 ± 0.0
Cys
0.504CysAla: 0.504 ± 0.018
0.108CysCys: 0.108 ± 0.008
0.438CysAsp: 0.438 ± 0.016
0.448CysGlu: 0.448 ± 0.017
0.356CysPhe: 0.356 ± 0.015
0.79CysGly: 0.79 ± 0.024
0.22CysHis: 0.22 ± 0.011
0.557CysIle: 0.557 ± 0.02
0.393CysLys: 0.393 ± 0.016
0.741CysLeu: 0.741 ± 0.019
0.237CysMet: 0.237 ± 0.012
0.308CysAsn: 0.308 ± 0.016
0.367CysPro: 0.367 ± 0.017
0.242CysGln: 0.242 ± 0.012
0.43CysArg: 0.43 ± 0.016
0.601CysSer: 0.601 ± 0.019
0.423CysThr: 0.423 ± 0.017
0.47CysVal: 0.47 ± 0.016
0.098CysTrp: 0.098 ± 0.006
0.293CysTyr: 0.293 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
3.411AspAla: 3.411 ± 0.053
0.422AspCys: 0.422 ± 0.014
2.573AspAsp: 2.573 ± 0.042
3.943AspGlu: 3.943 ± 0.051
2.226AspPhe: 2.226 ± 0.036
3.786AspGly: 3.786 ± 0.057
1.183AspHis: 1.183 ± 0.028
3.974AspIle: 3.974 ± 0.05
3.097AspLys: 3.097 ± 0.046
5.052AspLeu: 5.052 ± 0.059
1.548AspMet: 1.548 ± 0.03
2.047AspAsn: 2.047 ± 0.035
2.286AspPro: 2.286 ± 0.036
2.068AspGln: 2.068 ± 0.039
2.63AspArg: 2.63 ± 0.037
3.016AspSer: 3.016 ± 0.041
2.652AspThr: 2.652 ± 0.037
3.635AspVal: 3.635 ± 0.05
0.782AspTrp: 0.782 ± 0.022
2.135AspTyr: 2.135 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
5.443GluAla: 5.443 ± 0.063
0.45GluCys: 0.45 ± 0.017
3.504GluAsp: 3.504 ± 0.054
5.706GluGlu: 5.706 ± 0.071
2.311GluPhe: 2.311 ± 0.037
4.494GluGly: 4.494 ± 0.054
1.515GluHis: 1.515 ± 0.03
4.762GluIle: 4.762 ± 0.059
4.384GluLys: 4.384 ± 0.064
7.186GluLeu: 7.186 ± 0.073
2.193GluMet: 2.193 ± 0.043
2.805GluAsn: 2.805 ± 0.04
2.189GluPro: 2.189 ± 0.036
3.708GluGln: 3.708 ± 0.052
3.814GluArg: 3.814 ± 0.053
3.855GluSer: 3.855 ± 0.05
3.427GluThr: 3.427 ± 0.05
4.658GluVal: 4.658 ± 0.053
0.924GluTrp: 0.924 ± 0.024
2.231GluTyr: 2.231 ± 0.04
0.0GluXaa: 0.0 ± 0.0
Phe
2.962PheAla: 2.962 ± 0.043
0.4PheCys: 0.4 ± 0.014
2.297PheAsp: 2.297 ± 0.035
2.549PheGlu: 2.549 ± 0.039
1.936PhePhe: 1.936 ± 0.042
3.133PheGly: 3.133 ± 0.05
0.903PheHis: 0.903 ± 0.024
3.297PheIle: 3.297 ± 0.057
2.125PheLys: 2.125 ± 0.035
3.9PheLeu: 3.9 ± 0.061
1.321PheMet: 1.321 ± 0.028
1.798PheAsn: 1.798 ± 0.039
1.548PhePro: 1.548 ± 0.033
1.438PheGln: 1.438 ± 0.032
1.958PheArg: 1.958 ± 0.033
3.025PheSer: 3.025 ± 0.047
2.532PheThr: 2.532 ± 0.038
2.872PheVal: 2.872 ± 0.045
0.523PheTrp: 0.523 ± 0.021
1.557PheTyr: 1.557 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
5.034GlyAla: 5.034 ± 0.071
0.696GlyCys: 0.696 ± 0.021
3.456GlyAsp: 3.456 ± 0.045
4.461GlyGlu: 4.461 ± 0.058
3.268GlyPhe: 3.268 ± 0.046
5.208GlyGly: 5.208 ± 0.068
1.445GlyHis: 1.445 ± 0.031
5.841GlyIle: 5.841 ± 0.064
4.588GlyLys: 4.588 ± 0.054
7.021GlyLeu: 7.021 ± 0.067
2.43GlyMet: 2.43 ± 0.038
2.819GlyAsn: 2.819 ± 0.048
1.826GlyPro: 1.826 ± 0.037
2.574GlyGln: 2.574 ± 0.045
3.264GlyArg: 3.264 ± 0.047
4.699GlySer: 4.699 ± 0.063
4.266GlyThr: 4.266 ± 0.051
5.14GlyVal: 5.14 ± 0.055
0.974GlyTrp: 0.974 ± 0.028
2.995GlyTyr: 2.995 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
1.385HisAla: 1.385 ± 0.027
0.2HisCys: 0.2 ± 0.011
1.041HisAsp: 1.041 ± 0.023
1.394HisGlu: 1.394 ± 0.033
1.021HisPhe: 1.021 ± 0.024
1.481HisGly: 1.481 ± 0.033
0.606HisHis: 0.606 ± 0.019
1.529HisIle: 1.529 ± 0.029
0.969HisLys: 0.969 ± 0.027
2.083HisLeu: 2.083 ± 0.041
0.611HisMet: 0.611 ± 0.022
0.756HisAsn: 0.756 ± 0.021
1.125HisPro: 1.125 ± 0.027
0.815HisGln: 0.815 ± 0.021
1.081HisArg: 1.081 ± 0.027
1.257HisSer: 1.257 ± 0.028
1.103HisThr: 1.103 ± 0.025
1.424HisVal: 1.424 ± 0.034
0.281HisTrp: 0.281 ± 0.013
0.858HisTyr: 0.858 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
5.571IleAla: 5.571 ± 0.061
0.647IleCys: 0.647 ± 0.019
3.887IleAsp: 3.887 ± 0.047
4.708IleGlu: 4.708 ± 0.056
2.661IlePhe: 2.661 ± 0.048
5.474IleGly: 5.474 ± 0.064
1.661IleHis: 1.661 ± 0.031
4.956IleIle: 4.956 ± 0.076
3.547IleLys: 3.547 ± 0.05
6.416IleLeu: 6.416 ± 0.072
1.982IleMet: 1.982 ± 0.035
2.719IleAsn: 2.719 ± 0.044
3.325IlePro: 3.325 ± 0.044
2.797IleGln: 2.797 ± 0.04
3.594IleArg: 3.594 ± 0.047
5.157IleSer: 5.157 ± 0.061
4.157IleThr: 4.157 ± 0.059
5.235IleVal: 5.235 ± 0.061
0.739IleTrp: 0.739 ± 0.02
2.233IleTyr: 2.233 ± 0.04
0.0IleXaa: 0.0 ± 0.0
Lys
4.162LysAla: 4.162 ± 0.058
0.306LysCys: 0.306 ± 0.014
3.411LysAsp: 3.411 ± 0.045
4.92LysGlu: 4.92 ± 0.061
1.707LysPhe: 1.707 ± 0.034
3.847LysGly: 3.847 ± 0.054
1.233LysHis: 1.233 ± 0.028
3.6LysIle: 3.6 ± 0.052
3.895LysLys: 3.895 ± 0.057
5.628LysLeu: 5.628 ± 0.068
1.844LysMet: 1.844 ± 0.034
2.442LysAsn: 2.442 ± 0.038
2.343LysPro: 2.343 ± 0.035
2.67LysGln: 2.67 ± 0.038
2.9LysArg: 2.9 ± 0.043
3.435LysSer: 3.435 ± 0.052
3.008LysThr: 3.008 ± 0.04
3.954LysVal: 3.954 ± 0.054
0.729LysTrp: 0.729 ± 0.019
2.045LysTyr: 2.045 ± 0.04
0.0LysXaa: 0.0 ± 0.0
Leu
7.361LeuAla: 7.361 ± 0.078
0.846LeuCys: 0.846 ± 0.026
5.156LeuAsp: 5.156 ± 0.064
6.351LeuGlu: 6.351 ± 0.074
4.619LeuPhe: 4.619 ± 0.066
6.634LeuGly: 6.634 ± 0.065
2.016LeuHis: 2.016 ± 0.036
6.948LeuIle: 6.948 ± 0.088
5.892LeuLys: 5.892 ± 0.064
10.652LeuLeu: 10.652 ± 0.121
2.779LeuMet: 2.779 ± 0.045
4.29LeuAsn: 4.29 ± 0.052
4.241LeuPro: 4.241 ± 0.051
3.824LeuGln: 3.824 ± 0.048
4.576LeuArg: 4.576 ± 0.058
7.393LeuSer: 7.393 ± 0.071
5.618LeuThr: 5.618 ± 0.053
6.183LeuVal: 6.183 ± 0.066
1.024LeuTrp: 1.024 ± 0.027
3.362LeuTyr: 3.362 ± 0.052
0.0LeuXaa: 0.0 ± 0.0
Met
2.139MetAla: 2.139 ± 0.041
0.188MetCys: 0.188 ± 0.011
1.693MetAsp: 1.693 ± 0.03
2.028MetGlu: 2.028 ± 0.031
1.16MetPhe: 1.16 ± 0.027
1.897MetGly: 1.897 ± 0.04
0.469MetHis: 0.469 ± 0.016
2.202MetIle: 2.202 ± 0.036
2.29MetLys: 2.29 ± 0.036
3.08MetLeu: 3.08 ± 0.042
0.995MetMet: 0.995 ± 0.027
1.809MetAsn: 1.809 ± 0.033
1.121MetPro: 1.121 ± 0.029
1.011MetGln: 1.011 ± 0.024
1.235MetArg: 1.235 ± 0.029
1.985MetSer: 1.985 ± 0.029
1.752MetThr: 1.752 ± 0.03
1.898MetVal: 1.898 ± 0.032
0.261MetTrp: 0.261 ± 0.013
0.847MetTyr: 0.847 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
2.687AsnAla: 2.687 ± 0.041
0.292AsnCys: 0.292 ± 0.013
2.093AsnAsp: 2.093 ± 0.038
2.95AsnGlu: 2.95 ± 0.041
1.445AsnPhe: 1.445 ± 0.027
3.31AsnGly: 3.31 ± 0.049
0.974AsnHis: 0.974 ± 0.025
2.854AsnIle: 2.854 ± 0.044
2.393AsnLys: 2.393 ± 0.039
3.57AsnLeu: 3.57 ± 0.044
1.235AsnMet: 1.235 ± 0.027
1.867AsnAsn: 1.867 ± 0.033
2.087AsnPro: 2.087 ± 0.036
1.639AsnGln: 1.639 ± 0.029
2.204AsnArg: 2.204 ± 0.032
2.386AsnSer: 2.386 ± 0.035
2.147AsnThr: 2.147 ± 0.039
2.761AsnVal: 2.761 ± 0.046
0.533AsnTrp: 0.533 ± 0.016
1.493AsnTyr: 1.493 ± 0.031
0.0AsnXaa: 0.0 ± 0.0
Pro
2.827ProAla: 2.827 ± 0.037
0.241ProCys: 0.241 ± 0.011
2.582ProAsp: 2.582 ± 0.04
3.401ProGlu: 3.401 ± 0.047
1.786ProPhe: 1.786 ± 0.035
2.767ProGly: 2.767 ± 0.053
0.822ProHis: 0.822 ± 0.022
2.518ProIle: 2.518 ± 0.04
1.94ProLys: 1.94 ± 0.034
3.805ProLeu: 3.805 ± 0.046
0.959ProMet: 0.959 ± 0.024
1.52ProAsn: 1.52 ± 0.031
1.183ProPro: 1.183 ± 0.031
1.324ProGln: 1.324 ± 0.029
1.34ProArg: 1.34 ± 0.026
2.559ProSer: 2.559 ± 0.045
1.923ProThr: 1.923 ± 0.037
3.132ProVal: 3.132 ± 0.042
0.487ProTrp: 0.487 ± 0.018
1.516ProTyr: 1.516 ± 0.034
0.0ProXaa: 0.0 ± 0.0
Gln
2.92GlnAla: 2.92 ± 0.046
0.233GlnCys: 0.233 ± 0.011
1.905GlnAsp: 1.905 ± 0.031
2.796GlnGlu: 2.796 ± 0.045
1.572GlnPhe: 1.572 ± 0.029
2.549GlnGly: 2.549 ± 0.038
0.827GlnHis: 0.827 ± 0.023
2.564GlnIle: 2.564 ± 0.04
2.239GlnLys: 2.239 ± 0.038
4.119GlnLeu: 4.119 ± 0.052
1.267GlnMet: 1.267 ± 0.026
1.507GlnAsn: 1.507 ± 0.035
1.402GlnPro: 1.402 ± 0.028
1.783GlnGln: 1.783 ± 0.04
1.797GlnArg: 1.797 ± 0.031
2.369GlnSer: 2.369 ± 0.032
1.864GlnThr: 1.864 ± 0.034
2.539GlnVal: 2.539 ± 0.042
0.507GlnTrp: 0.507 ± 0.017
1.423GlnTyr: 1.423 ± 0.03
0.0GlnXaa: 0.0 ± 0.0
Arg
2.816ArgAla: 2.816 ± 0.044
0.379ArgCys: 0.379 ± 0.016
2.368ArgAsp: 2.368 ± 0.042
3.566ArgGlu: 3.566 ± 0.048
2.093ArgPhe: 2.093 ± 0.035
2.849ArgGly: 2.849 ± 0.045
1.038ArgHis: 1.038 ± 0.023
3.61ArgIle: 3.61 ± 0.049
3.23ArgLys: 3.23 ± 0.046
4.909ArgLeu: 4.909 ± 0.053
1.593ArgMet: 1.593 ± 0.03
2.063ArgAsn: 2.063 ± 0.032
1.58ArgPro: 1.58 ± 0.033
1.923ArgGln: 1.923 ± 0.04
2.542ArgArg: 2.542 ± 0.049
3.01ArgSer: 3.01 ± 0.037
2.484ArgThr: 2.484 ± 0.038
2.955ArgVal: 2.955 ± 0.039
0.601ArgTrp: 0.601 ± 0.018
1.824ArgTyr: 1.824 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
4.434SerAla: 4.434 ± 0.057
0.48SerCys: 0.48 ± 0.017
3.259SerAsp: 3.259 ± 0.048
4.121SerGlu: 4.121 ± 0.055
3.175SerPhe: 3.175 ± 0.047
5.44SerGly: 5.44 ± 0.061
1.308SerHis: 1.308 ± 0.027
4.788SerIle: 4.788 ± 0.056
3.622SerLys: 3.622 ± 0.049
6.651SerLeu: 6.651 ± 0.065
1.937SerMet: 1.937 ± 0.033
2.544SerAsn: 2.544 ± 0.043
2.505SerPro: 2.505 ± 0.04
2.147SerGln: 2.147 ± 0.038
3.147SerArg: 3.147 ± 0.045
4.688SerSer: 4.688 ± 0.057
3.371SerThr: 3.371 ± 0.047
4.681SerVal: 4.681 ± 0.05
0.845SerTrp: 0.845 ± 0.022
2.388SerTyr: 2.388 ± 0.042
0.0SerXaa: 0.0 ± 0.0
Thr
4.341ThrAla: 4.341 ± 0.051
0.394ThrCys: 0.394 ± 0.015
2.841ThrAsp: 2.841 ± 0.045
3.475ThrGlu: 3.475 ± 0.049
2.341ThrPhe: 2.341 ± 0.038
4.521ThrGly: 4.521 ± 0.083
1.035ThrHis: 1.035 ± 0.024
3.8ThrIle: 3.8 ± 0.049
2.652ThrLys: 2.652 ± 0.04
5.404ThrLeu: 5.404 ± 0.062
1.408ThrMet: 1.408 ± 0.029
1.989ThrAsn: 1.989 ± 0.034
2.464ThrPro: 2.464 ± 0.039
1.566ThrGln: 1.566 ± 0.033
2.105ThrArg: 2.105 ± 0.037
3.49ThrSer: 3.49 ± 0.046
2.843ThrThr: 2.843 ± 0.047
4.447ThrVal: 4.447 ± 0.049
0.603ThrTrp: 0.603 ± 0.016
1.889ThrTyr: 1.889 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
4.625ValAla: 4.625 ± 0.058
0.621ValCys: 0.621 ± 0.018
3.606ValAsp: 3.606 ± 0.048
4.455ValGlu: 4.455 ± 0.055
3.013ValPhe: 3.013 ± 0.051
4.438ValGly: 4.438 ± 0.055
1.45ValHis: 1.45 ± 0.032
5.312ValIle: 5.312 ± 0.063
4.137ValLys: 4.137 ± 0.049
7.231ValLeu: 7.231 ± 0.07
2.18ValMet: 2.18 ± 0.037
2.961ValAsn: 2.961 ± 0.038
2.902ValPro: 2.902 ± 0.045
2.56ValGln: 2.56 ± 0.041
3.218ValArg: 3.218 ± 0.05
4.893ValSer: 4.893 ± 0.053
4.304ValThr: 4.304 ± 0.056
4.897ValVal: 4.897 ± 0.061
0.835ValTrp: 0.835 ± 0.025
2.46ValTyr: 2.46 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
0.751TrpAla: 0.751 ± 0.022
0.094TrpCys: 0.094 ± 0.007
0.666TrpAsp: 0.666 ± 0.016
0.762TrpGlu: 0.762 ± 0.023
0.57TrpPhe: 0.57 ± 0.016
0.816TrpGly: 0.816 ± 0.023
0.245TrpHis: 0.245 ± 0.012
0.942TrpIle: 0.942 ± 0.023
0.776TrpLys: 0.776 ± 0.021
1.33TrpLeu: 1.33 ± 0.027
0.418TrpMet: 0.418 ± 0.014
0.698TrpAsn: 0.698 ± 0.021
0.334TrpPro: 0.334 ± 0.015
0.407TrpGln: 0.407 ± 0.015
0.568TrpArg: 0.568 ± 0.021
0.833TrpSer: 0.833 ± 0.023
0.61TrpThr: 0.61 ± 0.022
0.768TrpVal: 0.768 ± 0.023
0.184TrpTrp: 0.184 ± 0.011
0.408TrpTyr: 0.408 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.47TyrAla: 2.47 ± 0.04
0.337TyrCys: 0.337 ± 0.015
1.944TyrAsp: 1.944 ± 0.035
2.496TyrGlu: 2.496 ± 0.036
1.707TyrPhe: 1.707 ± 0.034
2.659TyrGly: 2.659 ± 0.04
0.799TyrHis: 0.799 ± 0.024
2.35TyrIle: 2.35 ± 0.041
1.856TyrLys: 1.856 ± 0.036
3.458TyrLeu: 3.458 ± 0.051
0.99TyrMet: 0.99 ± 0.027
1.494TyrAsn: 1.494 ± 0.031
1.563TyrPro: 1.563 ± 0.028
1.283TyrGln: 1.283 ± 0.027
2.047TyrArg: 2.047 ± 0.038
2.256TyrSer: 2.256 ± 0.038
1.9TyrThr: 1.9 ± 0.035
2.406TyrVal: 2.406 ± 0.04
0.44TyrTrp: 0.44 ± 0.019
1.435TyrTyr: 1.435 ± 0.035
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5718 proteins (1773498 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski