Amino acid dipepetide frequency for Sphingobium sp. AP49

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.498AlaAla: 18.498 ± 0.184
1.03AlaCys: 1.03 ± 0.028
8.262AlaAsp: 8.262 ± 0.085
6.691AlaGlu: 6.691 ± 0.087
4.199AlaPhe: 4.199 ± 0.062
11.349AlaGly: 11.349 ± 0.103
2.509AlaHis: 2.509 ± 0.043
7.259AlaIle: 7.259 ± 0.079
4.007AlaLys: 4.007 ± 0.081
14.273AlaLeu: 14.273 ± 0.13
4.213AlaMet: 4.213 ± 0.064
3.182AlaAsn: 3.182 ± 0.057
6.598AlaPro: 6.598 ± 0.097
5.19AlaGln: 5.19 ± 0.07
9.586AlaArg: 9.586 ± 0.105
6.585AlaSer: 6.585 ± 0.094
6.763AlaThr: 6.763 ± 0.104
8.211AlaVal: 8.211 ± 0.079
1.645AlaTrp: 1.645 ± 0.036
2.779AlaTyr: 2.779 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
1.074CysAla: 1.074 ± 0.03
0.079CysCys: 0.079 ± 0.008
0.464CysAsp: 0.464 ± 0.021
0.341CysGlu: 0.341 ± 0.016
0.288CysPhe: 0.288 ± 0.016
0.812CysGly: 0.812 ± 0.023
0.215CysHis: 0.215 ± 0.013
0.37CysIle: 0.37 ± 0.015
0.153CysLys: 0.153 ± 0.011
0.724CysLeu: 0.724 ± 0.023
0.164CysMet: 0.164 ± 0.012
0.185CysAsn: 0.185 ± 0.012
0.432CysPro: 0.432 ± 0.019
0.209CysGln: 0.209 ± 0.012
0.489CysArg: 0.489 ± 0.02
0.41CysSer: 0.41 ± 0.018
0.438CysThr: 0.438 ± 0.016
0.478CysVal: 0.478 ± 0.018
0.136CysTrp: 0.136 ± 0.01
0.176CysTyr: 0.176 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
8.119AspAla: 8.119 ± 0.085
0.474AspCys: 0.474 ± 0.021
3.428AspAsp: 3.428 ± 0.055
3.074AspGlu: 3.074 ± 0.06
2.246AspPhe: 2.246 ± 0.043
6.083AspGly: 6.083 ± 0.083
1.444AspHis: 1.444 ± 0.037
3.396AspIle: 3.396 ± 0.049
1.865AspLys: 1.865 ± 0.039
5.845AspLeu: 5.845 ± 0.072
1.725AspMet: 1.725 ± 0.035
1.537AspAsn: 1.537 ± 0.039
3.893AspPro: 3.893 ± 0.067
2.065AspGln: 2.065 ± 0.044
4.947AspArg: 4.947 ± 0.073
2.545AspSer: 2.545 ± 0.042
2.469AspThr: 2.469 ± 0.046
4.047AspVal: 4.047 ± 0.06
1.163AspTrp: 1.163 ± 0.031
1.823AspTyr: 1.823 ± 0.041
0.0AspXaa: 0.0 ± 0.0
Glu
7.005GluAla: 7.005 ± 0.084
0.29GluCys: 0.29 ± 0.014
2.566GluAsp: 2.566 ± 0.059
2.702GluGlu: 2.702 ± 0.061
1.26GluPhe: 1.26 ± 0.031
4.35GluGly: 4.35 ± 0.065
0.973GluHis: 0.973 ± 0.027
2.729GluIle: 2.729 ± 0.05
1.912GluLys: 1.912 ± 0.04
4.449GluLeu: 4.449 ± 0.058
1.396GluMet: 1.396 ± 0.033
1.236GluAsn: 1.236 ± 0.029
2.296GluPro: 2.296 ± 0.047
2.131GluGln: 2.131 ± 0.04
4.153GluArg: 4.153 ± 0.066
2.051GluSer: 2.051 ± 0.043
2.836GluThr: 2.836 ± 0.05
3.02GluVal: 3.02 ± 0.054
0.676GluTrp: 0.676 ± 0.023
0.957GluTyr: 0.957 ± 0.025
0.0GluXaa: 0.0 ± 0.0
Phe
4.782PheAla: 4.782 ± 0.056
0.312PheCys: 0.312 ± 0.017
2.677PheAsp: 2.677 ± 0.049
1.724PheGlu: 1.724 ± 0.035
1.218PhePhe: 1.218 ± 0.034
3.518PheGly: 3.518 ± 0.05
0.758PheHis: 0.758 ± 0.025
1.444PheIle: 1.444 ± 0.038
0.872PheLys: 0.872 ± 0.025
3.141PheLeu: 3.141 ± 0.053
0.758PheMet: 0.758 ± 0.021
0.972PheAsn: 0.972 ± 0.028
1.543PhePro: 1.543 ± 0.031
0.921PheGln: 0.921 ± 0.026
2.168PheArg: 2.168 ± 0.042
2.008PheSer: 2.008 ± 0.04
2.043PheThr: 2.043 ± 0.049
2.437PheVal: 2.437 ± 0.049
0.514PheTrp: 0.514 ± 0.019
0.94PheTyr: 0.94 ± 0.028
0.0PheXaa: 0.0 ± 0.0
Gly
10.291GlyAla: 10.291 ± 0.099
0.799GlyCys: 0.799 ± 0.024
5.114GlyAsp: 5.114 ± 0.065
4.499GlyGlu: 4.499 ± 0.069
3.598GlyPhe: 3.598 ± 0.055
7.925GlyGly: 7.925 ± 0.094
1.954GlyHis: 1.954 ± 0.039
4.561GlyIle: 4.561 ± 0.058
3.356GlyLys: 3.356 ± 0.057
8.841GlyLeu: 8.841 ± 0.096
2.512GlyMet: 2.512 ± 0.047
2.381GlyAsn: 2.381 ± 0.06
3.801GlyPro: 3.801 ± 0.063
3.315GlyGln: 3.315 ± 0.045
6.325GlyArg: 6.325 ± 0.083
4.806GlySer: 4.806 ± 0.067
4.955GlyThr: 4.955 ± 0.094
6.031GlyVal: 6.031 ± 0.068
1.75GlyTrp: 1.75 ± 0.046
2.498GlyTyr: 2.498 ± 0.049
0.0GlyXaa: 0.0 ± 0.0
His
2.468HisAla: 2.468 ± 0.042
0.206HisCys: 0.206 ± 0.013
1.323HisAsp: 1.323 ± 0.035
0.945HisGlu: 0.945 ± 0.031
0.903HisPhe: 0.903 ± 0.023
2.127HisGly: 2.127 ± 0.044
0.584HisHis: 0.584 ± 0.023
1.034HisIle: 1.034 ± 0.028
0.489HisLys: 0.489 ± 0.019
1.94HisLeu: 1.94 ± 0.041
0.543HisMet: 0.543 ± 0.021
0.492HisAsn: 0.492 ± 0.018
1.285HisPro: 1.285 ± 0.03
0.582HisGln: 0.582 ± 0.022
1.431HisArg: 1.431 ± 0.037
0.951HisSer: 0.951 ± 0.025
0.586HisThr: 0.586 ± 0.021
1.6HisVal: 1.6 ± 0.035
0.416HisTrp: 0.416 ± 0.017
0.643HisTyr: 0.643 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
8.133IleAla: 8.133 ± 0.079
0.472IleCys: 0.472 ± 0.02
4.247IleAsp: 4.247 ± 0.063
3.202IleGlu: 3.202 ± 0.056
1.751IlePhe: 1.751 ± 0.036
5.374IleGly: 5.374 ± 0.072
0.993IleHis: 0.993 ± 0.025
2.4IleIle: 2.4 ± 0.052
1.247IleLys: 1.247 ± 0.03
4.596IleLeu: 4.596 ± 0.066
0.984IleMet: 0.984 ± 0.027
1.42IleAsn: 1.42 ± 0.038
2.336IlePro: 2.336 ± 0.04
1.248IleGln: 1.248 ± 0.034
3.188IleArg: 3.188 ± 0.045
2.753IleSer: 2.753 ± 0.046
2.49IleThr: 2.49 ± 0.056
4.034IleVal: 4.034 ± 0.062
0.71IleTrp: 0.71 ± 0.02
1.064IleTyr: 1.064 ± 0.028
0.0IleXaa: 0.0 ± 0.0
Lys
4.32LysAla: 4.32 ± 0.072
0.149LysCys: 0.149 ± 0.008
1.747LysAsp: 1.747 ± 0.044
1.244LysGlu: 1.244 ± 0.036
0.78LysPhe: 0.78 ± 0.026
2.824LysGly: 2.824 ± 0.048
0.477LysHis: 0.477 ± 0.02
1.518LysIle: 1.518 ± 0.036
1.126LysLys: 1.126 ± 0.035
3.086LysLeu: 3.086 ± 0.053
0.758LysMet: 0.758 ± 0.025
0.748LysAsn: 0.748 ± 0.027
2.071LysPro: 2.071 ± 0.042
0.965LysGln: 0.965 ± 0.029
2.082LysArg: 2.082 ± 0.042
1.526LysSer: 1.526 ± 0.033
1.61LysThr: 1.61 ± 0.042
2.213LysVal: 2.213 ± 0.049
0.387LysTrp: 0.387 ± 0.02
0.607LysTyr: 0.607 ± 0.02
0.0LysXaa: 0.0 ± 0.0
Leu
13.865LeuAla: 13.865 ± 0.129
0.847LeuCys: 0.847 ± 0.025
6.216LeuAsp: 6.216 ± 0.079
4.357LeuGlu: 4.357 ± 0.07
3.788LeuPhe: 3.788 ± 0.064
8.096LeuGly: 8.096 ± 0.094
1.866LeuHis: 1.866 ± 0.039
5.096LeuIle: 5.096 ± 0.062
2.977LeuLys: 2.977 ± 0.056
10.076LeuLeu: 10.076 ± 0.133
2.288LeuMet: 2.288 ± 0.041
2.459LeuAsn: 2.459 ± 0.045
5.894LeuPro: 5.894 ± 0.075
2.589LeuGln: 2.589 ± 0.04
6.673LeuArg: 6.673 ± 0.076
6.29LeuSer: 6.29 ± 0.078
5.659LeuThr: 5.659 ± 0.075
6.931LeuVal: 6.931 ± 0.093
1.353LeuTrp: 1.353 ± 0.033
2.104LeuTyr: 2.104 ± 0.042
0.0LeuXaa: 0.0 ± 0.0
Met
3.678MetAla: 3.678 ± 0.062
0.14MetCys: 0.14 ± 0.009
1.295MetAsp: 1.295 ± 0.029
1.069MetGlu: 1.069 ± 0.031
0.667MetPhe: 0.667 ± 0.024
2.048MetGly: 2.048 ± 0.039
0.409MetHis: 0.409 ± 0.02
1.416MetIle: 1.416 ± 0.034
0.95MetLys: 0.95 ± 0.026
2.875MetLeu: 2.875 ± 0.053
0.706MetMet: 0.706 ± 0.022
0.704MetAsn: 0.704 ± 0.023
1.611MetPro: 1.611 ± 0.037
0.908MetGln: 0.908 ± 0.026
1.843MetArg: 1.843 ± 0.036
1.435MetSer: 1.435 ± 0.033
1.843MetThr: 1.843 ± 0.035
1.75MetVal: 1.75 ± 0.041
0.233MetTrp: 0.233 ± 0.013
0.268MetTyr: 0.268 ± 0.014
0.0MetXaa: 0.0 ± 0.0
Asn
3.357AsnAla: 3.357 ± 0.07
0.225AsnCys: 0.225 ± 0.014
1.439AsnAsp: 1.439 ± 0.035
1.05AsnGlu: 1.05 ± 0.028
0.997AsnPhe: 0.997 ± 0.028
2.636AsnGly: 2.636 ± 0.052
0.497AsnHis: 0.497 ± 0.018
1.433AsnIle: 1.433 ± 0.035
0.684AsnLys: 0.684 ± 0.025
2.408AsnLeu: 2.408 ± 0.047
0.626AsnMet: 0.626 ± 0.02
0.706AsnAsn: 0.706 ± 0.033
1.729AsnPro: 1.729 ± 0.037
0.798AsnGln: 0.798 ± 0.024
1.812AsnArg: 1.812 ± 0.038
1.333AsnSer: 1.333 ± 0.045
0.961AsnThr: 0.961 ± 0.032
1.912AsnVal: 1.912 ± 0.049
0.459AsnTrp: 0.459 ± 0.019
0.769AsnTyr: 0.769 ± 0.038
0.0AsnXaa: 0.0 ± 0.0
Pro
7.293ProAla: 7.293 ± 0.097
0.301ProCys: 0.301 ± 0.015
4.064ProAsp: 4.064 ± 0.065
3.128ProGlu: 3.128 ± 0.05
1.997ProPhe: 1.997 ± 0.042
4.902ProGly: 4.902 ± 0.073
1.137ProHis: 1.137 ± 0.026
2.626ProIle: 2.626 ± 0.045
1.516ProLys: 1.516 ± 0.035
5.073ProLeu: 5.073 ± 0.072
1.37ProMet: 1.37 ± 0.033
1.296ProAsn: 1.296 ± 0.031
2.839ProPro: 2.839 ± 0.079
1.896ProGln: 1.896 ± 0.039
2.952ProArg: 2.952 ± 0.046
2.742ProSer: 2.742 ± 0.051
2.722ProThr: 2.722 ± 0.051
4.454ProVal: 4.454 ± 0.06
0.741ProTrp: 0.741 ± 0.026
1.156ProTyr: 1.156 ± 0.033
0.0ProXaa: 0.0 ± 0.0
Gln
4.837GlnAla: 4.837 ± 0.064
0.242GlnCys: 0.242 ± 0.014
1.727GlnAsp: 1.727 ± 0.035
1.365GlnGlu: 1.365 ± 0.032
1.115GlnPhe: 1.115 ± 0.027
2.849GlnGly: 2.849 ± 0.05
0.603GlnHis: 0.603 ± 0.022
1.896GlnIle: 1.896 ± 0.038
0.995GlnLys: 0.995 ± 0.028
3.224GlnLeu: 3.224 ± 0.05
0.928GlnMet: 0.928 ± 0.026
0.792GlnAsn: 0.792 ± 0.026
2.033GlnPro: 2.033 ± 0.047
1.328GlnGln: 1.328 ± 0.037
2.597GlnArg: 2.597 ± 0.047
1.849GlnSer: 1.849 ± 0.037
1.685GlnThr: 1.685 ± 0.036
2.355GlnVal: 2.355 ± 0.049
0.552GlnTrp: 0.552 ± 0.021
0.727GlnTyr: 0.727 ± 0.026
0.0GlnXaa: 0.0 ± 0.0
Arg
8.572ArgAla: 8.572 ± 0.088
0.463ArgCys: 0.463 ± 0.019
4.281ArgAsp: 4.281 ± 0.058
3.399ArgGlu: 3.399 ± 0.058
2.949ArgPhe: 2.949 ± 0.056
4.828ArgGly: 4.828 ± 0.065
1.738ArgHis: 1.738 ± 0.039
4.281ArgIle: 4.281 ± 0.069
2.025ArgLys: 2.025 ± 0.041
7.956ArgLeu: 7.956 ± 0.093
1.904ArgMet: 1.904 ± 0.039
1.848ArgAsn: 1.848 ± 0.041
3.691ArgPro: 3.691 ± 0.063
2.607ArgGln: 2.607 ± 0.048
5.39ArgArg: 5.39 ± 0.075
3.513ArgSer: 3.513 ± 0.052
3.506ArgThr: 3.506 ± 0.048
4.321ArgVal: 4.321 ± 0.06
1.227ArgTrp: 1.227 ± 0.029
1.965ArgTyr: 1.965 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.517SerAla: 6.517 ± 0.113
0.42SerCys: 0.42 ± 0.022
3.164SerAsp: 3.164 ± 0.047
2.249SerGlu: 2.249 ± 0.045
2.155SerPhe: 2.155 ± 0.035
5.363SerGly: 5.363 ± 0.082
1.036SerHis: 1.036 ± 0.032
2.878SerIle: 2.878 ± 0.05
1.426SerLys: 1.426 ± 0.036
5.262SerLeu: 5.262 ± 0.071
1.223SerMet: 1.223 ± 0.028
1.403SerAsn: 1.403 ± 0.042
2.819SerPro: 2.819 ± 0.045
1.638SerGln: 1.638 ± 0.039
3.399SerArg: 3.399 ± 0.056
2.794SerSer: 2.794 ± 0.057
2.598SerThr: 2.598 ± 0.053
3.65SerVal: 3.65 ± 0.052
0.841SerTrp: 0.841 ± 0.027
1.482SerTyr: 1.482 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
6.321ThrAla: 6.321 ± 0.093
0.356ThrCys: 0.356 ± 0.015
3.095ThrAsp: 3.095 ± 0.051
2.117ThrGlu: 2.117 ± 0.044
1.591ThrPhe: 1.591 ± 0.033
5.379ThrGly: 5.379 ± 0.083
1.007ThrHis: 1.007 ± 0.026
2.995ThrIle: 2.995 ± 0.054
1.378ThrLys: 1.378 ± 0.039
5.851ThrLeu: 5.851 ± 0.074
1.197ThrMet: 1.197 ± 0.028
1.295ThrAsn: 1.295 ± 0.046
3.59ThrPro: 3.59 ± 0.059
1.677ThrGln: 1.677 ± 0.036
3.306ThrArg: 3.306 ± 0.052
2.693ThrSer: 2.693 ± 0.065
2.606ThrThr: 2.606 ± 0.052
3.875ThrVal: 3.875 ± 0.064
0.601ThrTrp: 0.601 ± 0.02
1.198ThrTyr: 1.198 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
9.304ValAla: 9.304 ± 0.093
0.479ValCys: 0.479 ± 0.02
4.561ValAsp: 4.561 ± 0.073
4.196ValGlu: 4.196 ± 0.065
1.835ValPhe: 1.835 ± 0.036
5.492ValGly: 5.492 ± 0.071
1.397ValHis: 1.397 ± 0.033
3.618ValIle: 3.618 ± 0.055
2.133ValLys: 2.133 ± 0.042
5.844ValLeu: 5.844 ± 0.072
1.634ValMet: 1.634 ± 0.034
1.983ValAsn: 1.983 ± 0.049
3.826ValPro: 3.826 ± 0.05
2.201ValGln: 2.201 ± 0.047
4.86ValArg: 4.86 ± 0.068
3.802ValSer: 3.802 ± 0.056
4.439ValThr: 4.439 ± 0.065
4.686ValVal: 4.686 ± 0.07
0.775ValTrp: 0.775 ± 0.025
1.413ValTyr: 1.413 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.522TrpAla: 1.522 ± 0.039
0.115TrpCys: 0.115 ± 0.009
0.789TrpAsp: 0.789 ± 0.027
0.568TrpGlu: 0.568 ± 0.022
0.514TrpPhe: 0.514 ± 0.019
1.022TrpGly: 1.022 ± 0.031
0.372TrpHis: 0.372 ± 0.015
0.779TrpIle: 0.779 ± 0.022
0.483TrpLys: 0.483 ± 0.018
1.777TrpLeu: 1.777 ± 0.037
0.406TrpMet: 0.406 ± 0.016
0.487TrpAsn: 0.487 ± 0.019
0.776TrpPro: 0.776 ± 0.025
0.62TrpGln: 0.62 ± 0.021
1.288TrpArg: 1.288 ± 0.035
0.939TrpSer: 0.939 ± 0.03
0.933TrpThr: 0.933 ± 0.03
0.863TrpVal: 0.863 ± 0.025
0.269TrpTrp: 0.269 ± 0.017
0.318TrpTyr: 0.318 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.873TyrAla: 2.873 ± 0.05
0.237TyrCys: 0.237 ± 0.015
1.726TyrAsp: 1.726 ± 0.05
1.07TyrGlu: 1.07 ± 0.03
0.903TyrPhe: 0.903 ± 0.027
2.335TyrGly: 2.335 ± 0.047
0.579TyrHis: 0.579 ± 0.022
0.913TyrIle: 0.913 ± 0.027
0.645TyrLys: 0.645 ± 0.024
2.184TyrLeu: 2.184 ± 0.042
0.514TyrMet: 0.514 ± 0.018
0.706TyrAsn: 0.706 ± 0.024
1.141TyrPro: 1.141 ± 0.029
0.811TyrGln: 0.811 ± 0.027
1.974TyrArg: 1.974 ± 0.042
1.291TyrSer: 1.291 ± 0.038
0.988TyrThr: 0.988 ± 0.032
1.607TyrVal: 1.607 ± 0.036
0.392TyrTrp: 0.392 ± 0.019
0.735TyrTyr: 0.735 ± 0.026
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4212 proteins (1352865 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski