Amino acid dipepetide frequency for Planococcus sp. Y42

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.343AlaAla: 9.343 ± 0.123
0.621AlaCys: 0.621 ± 0.022
5.14AlaAsp: 5.14 ± 0.076
7.178AlaGlu: 7.178 ± 0.089
3.897AlaPhe: 3.897 ± 0.065
7.407AlaGly: 7.407 ± 0.092
1.578AlaHis: 1.578 ± 0.036
6.056AlaIle: 6.056 ± 0.082
4.284AlaLys: 4.284 ± 0.07
8.486AlaLeu: 8.486 ± 0.109
2.323AlaMet: 2.323 ± 0.049
2.666AlaAsn: 2.666 ± 0.051
2.565AlaPro: 2.565 ± 0.055
2.571AlaGln: 2.571 ± 0.048
3.523AlaArg: 3.523 ± 0.052
4.434AlaSer: 4.434 ± 0.064
3.801AlaThr: 3.801 ± 0.066
7.238AlaVal: 7.238 ± 0.098
0.737AlaTrp: 0.737 ± 0.028
2.706AlaTyr: 2.706 ± 0.054
0.0AlaXaa: 0.0 ± 0.0
Cys
0.448CysAla: 0.448 ± 0.024
0.087CysCys: 0.087 ± 0.01
0.32CysAsp: 0.32 ± 0.018
0.412CysGlu: 0.412 ± 0.017
0.26CysPhe: 0.26 ± 0.016
0.642CysGly: 0.642 ± 0.027
0.164CysHis: 0.164 ± 0.012
0.372CysIle: 0.372 ± 0.02
0.217CysLys: 0.217 ± 0.013
0.569CysLeu: 0.569 ± 0.022
0.139CysMet: 0.139 ± 0.01
0.204CysAsn: 0.204 ± 0.013
0.328CysPro: 0.328 ± 0.017
0.201CysGln: 0.201 ± 0.014
0.397CysArg: 0.397 ± 0.018
0.442CysSer: 0.442 ± 0.019
0.384CysThr: 0.384 ± 0.021
0.359CysVal: 0.359 ± 0.018
0.055CysTrp: 0.055 ± 0.007
0.199CysTyr: 0.199 ± 0.014
0.0CysXaa: 0.0 ± 0.0
Asp
4.237AspAla: 4.237 ± 0.065
0.354AspCys: 0.354 ± 0.02
2.463AspAsp: 2.463 ± 0.059
4.734AspGlu: 4.734 ± 0.076
2.579AspPhe: 2.579 ± 0.05
3.838AspGly: 3.838 ± 0.067
1.142AspHis: 1.142 ± 0.03
3.693AspIle: 3.693 ± 0.064
2.473AspLys: 2.473 ± 0.053
5.042AspLeu: 5.042 ± 0.074
1.475AspMet: 1.475 ± 0.037
1.63AspAsn: 1.63 ± 0.038
2.333AspPro: 2.333 ± 0.055
1.886AspGln: 1.886 ± 0.043
3.152AspArg: 3.152 ± 0.056
2.728AspSer: 2.728 ± 0.053
2.573AspThr: 2.573 ± 0.05
3.994AspVal: 3.994 ± 0.06
0.776AspTrp: 0.776 ± 0.029
2.095AspTyr: 2.095 ± 0.046
0.0AspXaa: 0.0 ± 0.0
Glu
7.106GluAla: 7.106 ± 0.084
0.302GluCys: 0.302 ± 0.018
4.134GluAsp: 4.134 ± 0.064
8.382GluGlu: 8.382 ± 0.118
2.805GluPhe: 2.805 ± 0.048
4.893GluGly: 4.893 ± 0.072
1.487GluHis: 1.487 ± 0.036
4.817GluIle: 4.817 ± 0.073
5.04GluLys: 5.04 ± 0.074
7.989GluLeu: 7.989 ± 0.099
2.276GluMet: 2.276 ± 0.047
3.151GluAsn: 3.151 ± 0.055
2.609GluPro: 2.609 ± 0.06
4.157GluGln: 4.157 ± 0.068
4.344GluArg: 4.344 ± 0.079
3.591GluSer: 3.591 ± 0.061
4.82GluThr: 4.82 ± 0.071
5.553GluVal: 5.553 ± 0.071
0.939GluTrp: 0.939 ± 0.03
2.01GluTyr: 2.01 ± 0.046
0.0GluXaa: 0.0 ± 0.0
Phe
3.536PheAla: 3.536 ± 0.065
0.3PheCys: 0.3 ± 0.019
2.524PheAsp: 2.524 ± 0.053
2.883PheGlu: 2.883 ± 0.051
2.319PhePhe: 2.319 ± 0.067
3.498PheGly: 3.498 ± 0.06
0.965PheHis: 0.965 ± 0.029
3.428PheIle: 3.428 ± 0.062
1.894PheLys: 1.894 ± 0.04
4.55PheLeu: 4.55 ± 0.083
1.154PheMet: 1.154 ± 0.034
1.728PheAsn: 1.728 ± 0.04
1.727PhePro: 1.727 ± 0.04
1.473PheGln: 1.473 ± 0.033
2.077PheArg: 2.077 ± 0.052
3.03PheSer: 3.03 ± 0.055
2.645PheThr: 2.645 ± 0.05
3.016PheVal: 3.016 ± 0.056
0.463PheTrp: 0.463 ± 0.022
1.602PheTyr: 1.602 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
5.73GlyAla: 5.73 ± 0.085
0.58GlyCys: 0.58 ± 0.026
3.586GlyAsp: 3.586 ± 0.068
5.185GlyGlu: 5.185 ± 0.086
3.606GlyPhe: 3.606 ± 0.062
5.207GlyGly: 5.207 ± 0.084
1.539GlyHis: 1.539 ± 0.04
5.691GlyIle: 5.691 ± 0.07
4.483GlyLys: 4.483 ± 0.071
7.059GlyLeu: 7.059 ± 0.078
2.255GlyMet: 2.255 ± 0.045
2.58GlyAsn: 2.58 ± 0.046
1.947GlyPro: 1.947 ± 0.041
2.547GlyGln: 2.547 ± 0.048
3.345GlyArg: 3.345 ± 0.061
4.156GlySer: 4.156 ± 0.067
4.475GlyThr: 4.475 ± 0.07
5.084GlyVal: 5.084 ± 0.076
0.883GlyTrp: 0.883 ± 0.03
2.728GlyTyr: 2.728 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
1.644HisAla: 1.644 ± 0.046
0.185HisCys: 0.185 ± 0.013
1.062HisAsp: 1.062 ± 0.033
1.49HisGlu: 1.49 ± 0.037
0.961HisPhe: 0.961 ± 0.033
1.499HisGly: 1.499 ± 0.038
0.557HisHis: 0.557 ± 0.023
1.319HisIle: 1.319 ± 0.035
0.892HisLys: 0.892 ± 0.028
2.016HisLeu: 2.016 ± 0.045
0.508HisMet: 0.508 ± 0.019
0.646HisAsn: 0.646 ± 0.024
1.24HisPro: 1.24 ± 0.037
0.765HisGln: 0.765 ± 0.024
1.045HisArg: 1.045 ± 0.027
1.18HisSer: 1.18 ± 0.036
1.075HisThr: 1.075 ± 0.029
1.437HisVal: 1.437 ± 0.037
0.231HisTrp: 0.231 ± 0.013
0.75HisTyr: 0.75 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
6.311IleAla: 6.311 ± 0.09
0.467IleCys: 0.467 ± 0.02
3.957IleAsp: 3.957 ± 0.059
5.114IleGlu: 5.114 ± 0.069
2.644IlePhe: 2.644 ± 0.056
5.75IleGly: 5.75 ± 0.091
1.483IleHis: 1.483 ± 0.041
4.398IleIle: 4.398 ± 0.071
2.809IleLys: 2.809 ± 0.057
5.845IleLeu: 5.845 ± 0.079
1.467IleMet: 1.467 ± 0.035
2.297IleAsn: 2.297 ± 0.051
3.042IlePro: 3.042 ± 0.068
2.563IleGln: 2.563 ± 0.045
3.813IleArg: 3.813 ± 0.058
4.192IleSer: 4.192 ± 0.067
3.742IleThr: 3.742 ± 0.054
4.654IleVal: 4.654 ± 0.071
0.586IleTrp: 0.586 ± 0.024
2.009IleTyr: 2.009 ± 0.044
0.0IleXaa: 0.0 ± 0.0
Lys
4.47LysAla: 4.47 ± 0.074
0.23LysCys: 0.23 ± 0.014
2.68LysAsp: 2.68 ± 0.063
5.234LysGlu: 5.234 ± 0.079
1.724LysPhe: 1.724 ± 0.038
3.594LysGly: 3.594 ± 0.059
1.005LysHis: 1.005 ± 0.031
3.147LysIle: 3.147 ± 0.061
4.307LysLys: 4.307 ± 0.082
4.828LysLeu: 4.828 ± 0.08
1.732LysMet: 1.732 ± 0.035
2.246LysAsn: 2.246 ± 0.055
2.08LysPro: 2.08 ± 0.046
2.417LysGln: 2.417 ± 0.051
3.205LysArg: 3.205 ± 0.052
2.743LysSer: 2.743 ± 0.045
3.237LysThr: 3.237 ± 0.065
3.509LysVal: 3.509 ± 0.057
0.708LysTrp: 0.708 ± 0.025
1.529LysTyr: 1.529 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
8.993LeuAla: 8.993 ± 0.11
0.527LeuCys: 0.527 ± 0.023
4.954LeuAsp: 4.954 ± 0.072
7.03LeuGlu: 7.03 ± 0.093
4.753LeuPhe: 4.753 ± 0.089
6.316LeuGly: 6.316 ± 0.089
1.993LeuHis: 1.993 ± 0.05
6.632LeuIle: 6.632 ± 0.09
5.666LeuLys: 5.666 ± 0.089
10.78LeuLeu: 10.78 ± 0.141
2.473LeuMet: 2.473 ± 0.053
3.82LeuAsn: 3.82 ± 0.058
4.373LeuPro: 4.373 ± 0.07
3.778LeuGln: 3.778 ± 0.066
4.234LeuArg: 4.234 ± 0.066
6.221LeuSer: 6.221 ± 0.071
6.076LeuThr: 6.076 ± 0.079
6.292LeuVal: 6.292 ± 0.093
0.906LeuTrp: 0.906 ± 0.032
3.144LeuTyr: 3.144 ± 0.054
0.0LeuXaa: 0.0 ± 0.0
Met
2.389MetAla: 2.389 ± 0.054
0.123MetCys: 0.123 ± 0.011
1.529MetAsp: 1.529 ± 0.037
2.174MetGlu: 2.174 ± 0.042
0.905MetPhe: 0.905 ± 0.028
1.695MetGly: 1.695 ± 0.047
0.462MetHis: 0.462 ± 0.02
1.7MetIle: 1.7 ± 0.04
2.172MetLys: 2.172 ± 0.043
2.436MetLeu: 2.436 ± 0.046
0.775MetMet: 0.775 ± 0.033
1.374MetAsn: 1.374 ± 0.032
1.113MetPro: 1.113 ± 0.031
1.026MetGln: 1.026 ± 0.031
1.264MetArg: 1.264 ± 0.038
1.416MetSer: 1.416 ± 0.033
1.849MetThr: 1.849 ± 0.045
1.56MetVal: 1.56 ± 0.032
0.189MetTrp: 0.189 ± 0.014
0.673MetTyr: 0.673 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.989AsnAla: 2.989 ± 0.051
0.254AsnCys: 0.254 ± 0.015
1.803AsnAsp: 1.803 ± 0.044
2.852AsnGlu: 2.852 ± 0.054
1.517AsnPhe: 1.517 ± 0.035
3.074AsnGly: 3.074 ± 0.057
0.801AsnHis: 0.801 ± 0.025
2.34AsnIle: 2.34 ± 0.045
1.789AsnLys: 1.789 ± 0.043
3.217AsnLeu: 3.217 ± 0.059
0.948AsnMet: 0.948 ± 0.029
1.281AsnAsn: 1.281 ± 0.044
2.035AsnPro: 2.035 ± 0.047
1.381AsnGln: 1.381 ± 0.037
2.188AsnArg: 2.188 ± 0.049
1.917AsnSer: 1.917 ± 0.044
1.878AsnThr: 1.878 ± 0.045
2.596AsnVal: 2.596 ± 0.048
0.461AsnTrp: 0.461 ± 0.019
1.25AsnTyr: 1.25 ± 0.036
0.0AsnXaa: 0.0 ± 0.0
Pro
3.749ProAla: 3.749 ± 0.058
0.186ProCys: 0.186 ± 0.014
2.613ProAsp: 2.613 ± 0.05
3.901ProGlu: 3.901 ± 0.06
2.038ProPhe: 2.038 ± 0.043
2.78ProGly: 2.78 ± 0.054
0.857ProHis: 0.857 ± 0.026
2.406ProIle: 2.406 ± 0.044
1.891ProLys: 1.891 ± 0.04
3.823ProLeu: 3.823 ± 0.057
0.829ProMet: 0.829 ± 0.026
1.381ProAsn: 1.381 ± 0.036
1.203ProPro: 1.203 ± 0.032
1.186ProGln: 1.186 ± 0.035
1.274ProArg: 1.274 ± 0.032
2.091ProSer: 2.091 ± 0.045
1.777ProThr: 1.777 ± 0.039
3.596ProVal: 3.596 ± 0.06
0.37ProTrp: 0.37 ± 0.022
1.381ProTyr: 1.381 ± 0.038
0.0ProXaa: 0.0 ± 0.0
Gln
3.37GlnAla: 3.37 ± 0.067
0.181GlnCys: 0.181 ± 0.012
1.757GlnAsp: 1.757 ± 0.044
3.383GlnGlu: 3.383 ± 0.062
1.547GlnPhe: 1.547 ± 0.038
2.229GlnGly: 2.229 ± 0.047
0.827GlnHis: 0.827 ± 0.027
2.053GlnIle: 2.053 ± 0.037
2.164GlnLys: 2.164 ± 0.05
4.375GlnLeu: 4.375 ± 0.083
1.039GlnMet: 1.039 ± 0.031
1.381GlnAsn: 1.381 ± 0.035
1.485GlnPro: 1.485 ± 0.038
2.049GlnGln: 2.049 ± 0.058
1.821GlnArg: 1.821 ± 0.04
1.977GlnSer: 1.977 ± 0.045
2.023GlnThr: 2.023 ± 0.043
2.499GlnVal: 2.499 ± 0.054
0.41GlnTrp: 0.41 ± 0.019
1.139GlnTyr: 1.139 ± 0.034
0.0GlnXaa: 0.0 ± 0.0
Arg
3.222ArgAla: 3.222 ± 0.048
0.29ArgCys: 0.29 ± 0.016
2.498ArgAsp: 2.498 ± 0.049
4.001ArgGlu: 4.001 ± 0.064
2.322ArgPhe: 2.322 ± 0.049
2.845ArgGly: 2.845 ± 0.053
1.117ArgHis: 1.117 ± 0.033
3.415ArgIle: 3.415 ± 0.059
3.365ArgLys: 3.365 ± 0.063
5.111ArgLeu: 5.111 ± 0.082
1.514ArgMet: 1.514 ± 0.034
2.032ArgAsn: 2.032 ± 0.038
1.728ArgPro: 1.728 ± 0.039
2.258ArgGln: 2.258 ± 0.041
2.58ArgArg: 2.58 ± 0.06
2.624ArgSer: 2.624 ± 0.046
2.712ArgThr: 2.712 ± 0.049
3.017ArgVal: 3.017 ± 0.058
0.523ArgTrp: 0.523 ± 0.022
1.745ArgTyr: 1.745 ± 0.048
0.0ArgXaa: 0.0 ± 0.0
Ser
4.611SerAla: 4.611 ± 0.078
0.334SerCys: 0.334 ± 0.017
2.896SerAsp: 2.896 ± 0.051
3.837SerGlu: 3.837 ± 0.059
2.905SerPhe: 2.905 ± 0.051
4.845SerGly: 4.845 ± 0.071
1.124SerHis: 1.124 ± 0.028
3.873SerIle: 3.873 ± 0.054
2.717SerLys: 2.717 ± 0.052
5.512SerLeu: 5.512 ± 0.067
1.585SerMet: 1.585 ± 0.039
1.936SerAsn: 1.936 ± 0.041
2.255SerPro: 2.255 ± 0.044
1.785SerGln: 1.785 ± 0.041
2.81SerArg: 2.81 ± 0.052
3.4SerSer: 3.4 ± 0.072
2.716SerThr: 2.716 ± 0.057
4.278SerVal: 4.278 ± 0.063
0.609SerTrp: 0.609 ± 0.026
1.953SerTyr: 1.953 ± 0.045
0.0SerXaa: 0.0 ± 0.0
Thr
5.576ThrAla: 5.576 ± 0.075
0.362ThrCys: 0.362 ± 0.021
3.461ThrAsp: 3.461 ± 0.058
4.548ThrGlu: 4.548 ± 0.081
2.477ThrPhe: 2.477 ± 0.052
4.968ThrGly: 4.968 ± 0.061
1.054ThrHis: 1.054 ± 0.032
3.745ThrIle: 3.745 ± 0.058
2.536ThrLys: 2.536 ± 0.045
5.103ThrLeu: 5.103 ± 0.07
1.248ThrMet: 1.248 ± 0.031
1.861ThrAsn: 1.861 ± 0.043
2.353ThrPro: 2.353 ± 0.052
1.451ThrGln: 1.451 ± 0.038
2.117ThrArg: 2.117 ± 0.044
2.861ThrSer: 2.861 ± 0.056
2.635ThrThr: 2.635 ± 0.055
4.779ThrVal: 4.779 ± 0.075
0.538ThrTrp: 0.538 ± 0.025
1.893ThrTyr: 1.893 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
5.623ValAla: 5.623 ± 0.076
0.533ValCys: 0.533 ± 0.024
3.501ValAsp: 3.501 ± 0.061
4.882ValGlu: 4.882 ± 0.074
3.427ValPhe: 3.427 ± 0.06
4.342ValGly: 4.342 ± 0.074
1.467ValHis: 1.467 ± 0.038
5.252ValIle: 5.252 ± 0.077
3.829ValLys: 3.829 ± 0.066
7.718ValLeu: 7.718 ± 0.092
1.909ValMet: 1.909 ± 0.043
2.733ValAsn: 2.733 ± 0.046
3.146ValPro: 3.146 ± 0.054
2.51ValGln: 2.51 ± 0.049
3.353ValArg: 3.353 ± 0.054
4.429ValSer: 4.429 ± 0.063
4.622ValThr: 4.622 ± 0.067
4.667ValVal: 4.667 ± 0.082
0.691ValTrp: 0.691 ± 0.027
2.245ValTyr: 2.245 ± 0.041
0.0ValXaa: 0.0 ± 0.0
Trp
0.745TrpAla: 0.745 ± 0.025
0.052TrpCys: 0.052 ± 0.006
0.473TrpAsp: 0.473 ± 0.02
0.665TrpGlu: 0.665 ± 0.03
0.535TrpPhe: 0.535 ± 0.023
0.684TrpGly: 0.684 ± 0.027
0.225TrpHis: 0.225 ± 0.013
0.799TrpIle: 0.799 ± 0.028
0.602TrpLys: 0.602 ± 0.026
1.292TrpLeu: 1.292 ± 0.039
0.363TrpMet: 0.363 ± 0.018
0.517TrpAsn: 0.517 ± 0.023
0.296TrpPro: 0.296 ± 0.017
0.484TrpGln: 0.484 ± 0.022
0.498TrpArg: 0.498 ± 0.019
0.599TrpSer: 0.599 ± 0.027
0.664TrpThr: 0.664 ± 0.023
0.676TrpVal: 0.676 ± 0.028
0.14TrpTrp: 0.14 ± 0.01
0.311TrpTyr: 0.311 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.463TyrAla: 2.463 ± 0.05
0.233TyrCys: 0.233 ± 0.016
1.843TyrAsp: 1.843 ± 0.042
2.52TyrGlu: 2.52 ± 0.05
1.592TyrPhe: 1.592 ± 0.041
2.559TyrGly: 2.559 ± 0.042
0.656TyrHis: 0.656 ± 0.024
2.01TyrIle: 2.01 ± 0.043
1.492TyrLys: 1.492 ± 0.041
3.21TyrLeu: 3.21 ± 0.063
0.811TyrMet: 0.811 ± 0.03
1.105TyrAsn: 1.105 ± 0.033
1.475TyrPro: 1.475 ± 0.036
1.22TyrGln: 1.22 ± 0.033
1.877TyrArg: 1.877 ± 0.045
1.949TyrSer: 1.949 ± 0.042
1.861TyrThr: 1.861 ± 0.037
2.077TyrVal: 2.077 ± 0.039
0.408TyrTrp: 0.408 ± 0.021
1.169TyrTyr: 1.169 ± 0.033
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3921 proteins (1117653 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski