Amino acid dipepetide frequency for Paracoccus suum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
22.341AlaAla: 22.341 ± 0.313
1.057AlaCys: 1.057 ± 0.032
7.817AlaAsp: 7.817 ± 0.117
9.186AlaGlu: 9.186 ± 0.128
4.242AlaPhe: 4.242 ± 0.063
12.515AlaGly: 12.515 ± 0.144
2.332AlaHis: 2.332 ± 0.055
6.397AlaIle: 6.397 ± 0.075
3.309AlaLys: 3.309 ± 0.08
14.909AlaLeu: 14.909 ± 0.163
4.229AlaMet: 4.229 ± 0.064
2.784AlaAsn: 2.784 ± 0.057
7.704AlaPro: 7.704 ± 0.135
4.634AlaGln: 4.634 ± 0.082
11.02AlaArg: 11.02 ± 0.142
6.315AlaSer: 6.315 ± 0.088
6.71AlaThr: 6.71 ± 0.092
9.378AlaVal: 9.378 ± 0.112
1.671AlaTrp: 1.671 ± 0.043
2.259AlaTyr: 2.259 ± 0.055
0.0AlaXaa: 0.0 ± 0.0
Cys
1.053CysAla: 1.053 ± 0.035
0.103CysCys: 0.103 ± 0.009
0.539CysAsp: 0.539 ± 0.025
0.395CysGlu: 0.395 ± 0.022
0.247CysPhe: 0.247 ± 0.017
0.864CysGly: 0.864 ± 0.035
0.266CysHis: 0.266 ± 0.018
0.377CysIle: 0.377 ± 0.022
0.149CysLys: 0.149 ± 0.012
0.71CysLeu: 0.71 ± 0.031
0.139CysMet: 0.139 ± 0.011
0.192CysAsn: 0.192 ± 0.013
0.487CysPro: 0.487 ± 0.023
0.217CysGln: 0.217 ± 0.015
0.566CysArg: 0.566 ± 0.029
0.346CysSer: 0.346 ± 0.018
0.386CysThr: 0.386 ± 0.023
0.51CysVal: 0.51 ± 0.024
0.115CysTrp: 0.115 ± 0.012
0.161CysTyr: 0.161 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
7.854AspAla: 7.854 ± 0.095
0.467AspCys: 0.467 ± 0.024
3.173AspAsp: 3.173 ± 0.077
3.286AspGlu: 3.286 ± 0.064
2.088AspPhe: 2.088 ± 0.046
5.683AspGly: 5.683 ± 0.087
1.269AspHis: 1.269 ± 0.039
2.485AspIle: 2.485 ± 0.059
1.421AspLys: 1.421 ± 0.05
6.433AspLeu: 6.433 ± 0.081
1.371AspMet: 1.371 ± 0.035
1.147AspAsn: 1.147 ± 0.037
4.169AspPro: 4.169 ± 0.075
1.794AspGln: 1.794 ± 0.045
4.718AspArg: 4.718 ± 0.081
2.252AspSer: 2.252 ± 0.048
2.59AspThr: 2.59 ± 0.059
3.821AspVal: 3.821 ± 0.065
1.209AspTrp: 1.209 ± 0.035
1.4AspTyr: 1.4 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
8.98GluAla: 8.98 ± 0.131
0.312GluCys: 0.312 ± 0.021
2.757GluAsp: 2.757 ± 0.059
2.386GluGlu: 2.386 ± 0.055
1.554GluPhe: 1.554 ± 0.04
5.12GluGly: 5.12 ± 0.078
0.975GluHis: 0.975 ± 0.032
3.011GluIle: 3.011 ± 0.059
1.45GluLys: 1.45 ± 0.047
4.534GluLeu: 4.534 ± 0.072
1.6GluMet: 1.6 ± 0.042
1.263GluAsn: 1.263 ± 0.035
2.644GluPro: 2.644 ± 0.057
1.573GluGln: 1.573 ± 0.039
4.27GluArg: 4.27 ± 0.073
1.974GluSer: 1.974 ± 0.048
2.975GluThr: 2.975 ± 0.057
4.587GluVal: 4.587 ± 0.076
0.698GluTrp: 0.698 ± 0.032
0.91GluTyr: 0.91 ± 0.031
0.0GluXaa: 0.0 ± 0.0
Phe
4.483PheAla: 4.483 ± 0.069
0.358PheCys: 0.358 ± 0.02
2.512PheAsp: 2.512 ± 0.049
1.716PheGlu: 1.716 ± 0.051
1.139PhePhe: 1.139 ± 0.039
3.394PheGly: 3.394 ± 0.06
0.678PheHis: 0.678 ± 0.026
1.447PheIle: 1.447 ± 0.041
0.721PheLys: 0.721 ± 0.029
3.033PheLeu: 3.033 ± 0.05
0.733PheMet: 0.733 ± 0.026
0.89PheAsn: 0.89 ± 0.03
1.497PhePro: 1.497 ± 0.04
0.959PheGln: 0.959 ± 0.036
2.132PheArg: 2.132 ± 0.052
1.787PheSer: 1.787 ± 0.048
1.88PheThr: 1.88 ± 0.051
2.476PheVal: 2.476 ± 0.06
0.567PheTrp: 0.567 ± 0.025
0.793PheTyr: 0.793 ± 0.029
0.0PheXaa: 0.0 ± 0.0
Gly
11.3GlyAla: 11.3 ± 0.146
0.83GlyCys: 0.83 ± 0.028
4.786GlyAsp: 4.786 ± 0.09
4.73GlyGlu: 4.73 ± 0.07
3.544GlyPhe: 3.544 ± 0.063
8.322GlyGly: 8.322 ± 0.131
1.886GlyHis: 1.886 ± 0.047
4.518GlyIle: 4.518 ± 0.087
2.914GlyLys: 2.914 ± 0.067
9.771GlyLeu: 9.771 ± 0.109
2.6GlyMet: 2.6 ± 0.056
2.134GlyAsn: 2.134 ± 0.074
4.486GlyPro: 4.486 ± 0.071
3.2GlyGln: 3.2 ± 0.064
6.986GlyArg: 6.986 ± 0.101
4.337GlySer: 4.337 ± 0.069
5.088GlyThr: 5.088 ± 0.086
6.486GlyVal: 6.486 ± 0.089
1.62GlyTrp: 1.62 ± 0.046
2.055GlyTyr: 2.055 ± 0.047
0.0GlyXaa: 0.0 ± 0.0
His
2.418HisAla: 2.418 ± 0.056
0.196HisCys: 0.196 ± 0.017
1.297HisAsp: 1.297 ± 0.038
0.946HisGlu: 0.946 ± 0.033
0.71HisPhe: 0.71 ± 0.026
1.922HisGly: 1.922 ± 0.049
0.497HisHis: 0.497 ± 0.025
0.822HisIle: 0.822 ± 0.033
0.393HisLys: 0.393 ± 0.021
2.047HisLeu: 2.047 ± 0.049
0.464HisMet: 0.464 ± 0.025
0.394HisAsn: 0.394 ± 0.023
1.439HisPro: 1.439 ± 0.042
0.546HisGln: 0.546 ± 0.027
1.408HisArg: 1.408 ± 0.038
0.779HisSer: 0.779 ± 0.027
0.698HisThr: 0.698 ± 0.032
1.407HisVal: 1.407 ± 0.038
0.339HisTrp: 0.339 ± 0.017
0.498HisTyr: 0.498 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
7.734IleAla: 7.734 ± 0.091
0.484IleCys: 0.484 ± 0.024
3.427IleAsp: 3.427 ± 0.059
3.217IleGlu: 3.217 ± 0.058
1.499IlePhe: 1.499 ± 0.044
4.759IleGly: 4.759 ± 0.073
0.852IleHis: 0.852 ± 0.033
2.048IleIle: 2.048 ± 0.055
1.099IleLys: 1.099 ± 0.033
4.397IleLeu: 4.397 ± 0.087
1.04IleMet: 1.04 ± 0.031
1.183IleAsn: 1.183 ± 0.037
2.243IlePro: 2.243 ± 0.053
1.028IleGln: 1.028 ± 0.035
3.133IleArg: 3.133 ± 0.056
2.463IleSer: 2.463 ± 0.05
2.803IleThr: 2.803 ± 0.055
3.565IleVal: 3.565 ± 0.072
0.653IleTrp: 0.653 ± 0.027
1.043IleTyr: 1.043 ± 0.035
0.0IleXaa: 0.0 ± 0.0
Lys
3.446LysAla: 3.446 ± 0.084
0.126LysCys: 0.126 ± 0.01
1.422LysAsp: 1.422 ± 0.046
1.011LysGlu: 1.011 ± 0.035
0.703LysPhe: 0.703 ± 0.029
2.324LysGly: 2.324 ± 0.059
0.438LysHis: 0.438 ± 0.02
1.225LysIle: 1.225 ± 0.041
0.782LysLys: 0.782 ± 0.036
2.43LysLeu: 2.43 ± 0.06
0.65LysMet: 0.65 ± 0.025
0.581LysAsn: 0.581 ± 0.029
1.547LysPro: 1.547 ± 0.042
0.678LysGln: 0.678 ± 0.028
1.734LysArg: 1.734 ± 0.048
1.411LysSer: 1.411 ± 0.043
1.541LysThr: 1.541 ± 0.041
1.985LysVal: 1.985 ± 0.048
0.288LysTrp: 0.288 ± 0.02
0.496LysTyr: 0.496 ± 0.028
0.0LysXaa: 0.0 ± 0.0
Leu
15.145LeuAla: 15.145 ± 0.157
0.795LeuCys: 0.795 ± 0.032
6.09LeuAsp: 6.09 ± 0.078
4.433LeuGlu: 4.433 ± 0.076
3.074LeuPhe: 3.074 ± 0.072
9.055LeuGly: 9.055 ± 0.128
1.854LeuHis: 1.854 ± 0.045
5.225LeuIle: 5.225 ± 0.093
2.56LeuLys: 2.56 ± 0.053
8.929LeuLeu: 8.929 ± 0.137
2.66LeuMet: 2.66 ± 0.059
2.314LeuAsn: 2.314 ± 0.046
6.254LeuPro: 6.254 ± 0.087
2.618LeuGln: 2.618 ± 0.053
7.778LeuArg: 7.778 ± 0.119
6.108LeuSer: 6.108 ± 0.088
6.386LeuThr: 6.386 ± 0.09
7.069LeuVal: 7.069 ± 0.099
1.331LeuTrp: 1.331 ± 0.038
1.73LeuTyr: 1.73 ± 0.044
0.0LeuXaa: 0.0 ± 0.0
Met
3.736MetAla: 3.736 ± 0.067
0.138MetCys: 0.138 ± 0.012
1.355MetAsp: 1.355 ± 0.037
1.112MetGlu: 1.112 ± 0.031
0.745MetPhe: 0.745 ± 0.03
2.145MetGly: 2.145 ± 0.055
0.411MetHis: 0.411 ± 0.024
1.452MetIle: 1.452 ± 0.037
0.742MetLys: 0.742 ± 0.031
2.744MetLeu: 2.744 ± 0.059
0.711MetMet: 0.711 ± 0.029
0.736MetAsn: 0.736 ± 0.026
1.602MetPro: 1.602 ± 0.037
0.911MetGln: 0.911 ± 0.027
1.997MetArg: 1.997 ± 0.048
1.632MetSer: 1.632 ± 0.042
2.01MetThr: 2.01 ± 0.051
1.805MetVal: 1.805 ± 0.046
0.213MetTrp: 0.213 ± 0.015
0.299MetTyr: 0.299 ± 0.02
0.0MetXaa: 0.0 ± 0.0
Asn
2.917AsnAla: 2.917 ± 0.053
0.218AsnCys: 0.218 ± 0.015
1.246AsnAsp: 1.246 ± 0.058
0.965AsnGlu: 0.965 ± 0.032
0.807AsnPhe: 0.807 ± 0.032
2.083AsnGly: 2.083 ± 0.054
0.408AsnHis: 0.408 ± 0.018
1.167AsnIle: 1.167 ± 0.038
0.498AsnLys: 0.498 ± 0.023
2.382AsnLeu: 2.382 ± 0.052
0.575AsnMet: 0.575 ± 0.024
0.528AsnAsn: 0.528 ± 0.027
1.824AsnPro: 1.824 ± 0.041
0.619AsnGln: 0.619 ± 0.025
1.669AsnArg: 1.669 ± 0.041
0.998AsnSer: 0.998 ± 0.037
1.057AsnThr: 1.057 ± 0.039
1.583AsnVal: 1.583 ± 0.043
0.419AsnTrp: 0.419 ± 0.021
0.559AsnTyr: 0.559 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
8.388ProAla: 8.388 ± 0.143
0.33ProCys: 0.33 ± 0.02
4.209ProAsp: 4.209 ± 0.069
4.041ProGlu: 4.041 ± 0.067
1.802ProPhe: 1.802 ± 0.049
5.603ProGly: 5.603 ± 0.092
1.105ProHis: 1.105 ± 0.037
2.258ProIle: 2.258 ± 0.05
1.325ProLys: 1.325 ± 0.042
5.336ProLeu: 5.336 ± 0.083
1.354ProMet: 1.354 ± 0.04
1.19ProAsn: 1.19 ± 0.032
3.271ProPro: 3.271 ± 0.078
2.075ProGln: 2.075 ± 0.053
3.68ProArg: 3.68 ± 0.078
2.647ProSer: 2.647 ± 0.061
2.73ProThr: 2.73 ± 0.054
4.624ProVal: 4.624 ± 0.074
0.775ProTrp: 0.775 ± 0.031
1.122ProTyr: 1.122 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
4.2GlnAla: 4.2 ± 0.073
0.179GlnCys: 0.179 ± 0.015
1.508GlnAsp: 1.508 ± 0.039
1.095GlnGlu: 1.095 ± 0.035
1.003GlnPhe: 1.003 ± 0.03
2.661GlnGly: 2.661 ± 0.053
0.58GlnHis: 0.58 ± 0.025
1.929GlnIle: 1.929 ± 0.043
0.791GlnLys: 0.791 ± 0.031
2.911GlnLeu: 2.911 ± 0.062
1.008GlnMet: 1.008 ± 0.032
0.73GlnAsn: 0.73 ± 0.031
2.005GlnPro: 2.005 ± 0.048
1.117GlnGln: 1.117 ± 0.036
2.388GlnArg: 2.388 ± 0.057
1.589GlnSer: 1.589 ± 0.039
1.727GlnThr: 1.727 ± 0.037
2.446GlnVal: 2.446 ± 0.046
0.371GlnTrp: 0.371 ± 0.02
0.551GlnTyr: 0.551 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
9.886ArgAla: 9.886 ± 0.102
0.485ArgCys: 0.485 ± 0.026
4.5ArgAsp: 4.5 ± 0.068
4.085ArgGlu: 4.085 ± 0.064
2.658ArgPhe: 2.658 ± 0.052
5.611ArgGly: 5.611 ± 0.077
1.643ArgHis: 1.643 ± 0.042
4.052ArgIle: 4.052 ± 0.063
1.85ArgLys: 1.85 ± 0.04
8.596ArgLeu: 8.596 ± 0.137
2.144ArgMet: 2.144 ± 0.048
1.626ArgAsn: 1.626 ± 0.036
4.356ArgPro: 4.356 ± 0.078
2.499ArgGln: 2.499 ± 0.056
6.116ArgArg: 6.116 ± 0.09
3.221ArgSer: 3.221 ± 0.062
3.327ArgThr: 3.327 ± 0.059
5.19ArgVal: 5.19 ± 0.082
1.08ArgTrp: 1.08 ± 0.035
1.472ArgTyr: 1.472 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.207SerAla: 6.207 ± 0.1
0.377SerCys: 0.377 ± 0.023
2.81SerAsp: 2.81 ± 0.061
2.356SerGlu: 2.356 ± 0.048
1.921SerPhe: 1.921 ± 0.048
5.467SerGly: 5.467 ± 0.082
0.977SerHis: 0.977 ± 0.034
2.284SerIle: 2.284 ± 0.054
1.028SerLys: 1.028 ± 0.041
4.857SerLeu: 4.857 ± 0.081
1.156SerMet: 1.156 ± 0.038
1.131SerAsn: 1.131 ± 0.036
2.657SerPro: 2.657 ± 0.064
1.503SerGln: 1.503 ± 0.042
3.454SerArg: 3.454 ± 0.069
2.15SerSer: 2.15 ± 0.049
2.32SerThr: 2.32 ± 0.049
3.545SerVal: 3.545 ± 0.062
0.695SerTrp: 0.695 ± 0.029
1.134SerTyr: 1.134 ± 0.037
0.0SerXaa: 0.0 ± 0.0
Thr
7.13ThrAla: 7.13 ± 0.093
0.458ThrCys: 0.458 ± 0.025
3.051ThrAsp: 3.051 ± 0.06
2.876ThrGlu: 2.876 ± 0.058
1.737ThrPhe: 1.737 ± 0.044
5.522ThrGly: 5.522 ± 0.081
1.037ThrHis: 1.037 ± 0.034
2.5ThrIle: 2.5 ± 0.053
1.114ThrLys: 1.114 ± 0.035
5.894ThrLeu: 5.894 ± 0.082
1.158ThrMet: 1.158 ± 0.036
1.13ThrAsn: 1.13 ± 0.036
3.692ThrPro: 3.692 ± 0.064
1.423ThrGln: 1.423 ± 0.038
3.794ThrArg: 3.794 ± 0.059
2.435ThrSer: 2.435 ± 0.049
2.898ThrThr: 2.898 ± 0.07
4.224ThrVal: 4.224 ± 0.067
0.646ThrTrp: 0.646 ± 0.029
1.043ThrTyr: 1.043 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
9.671ValAla: 9.671 ± 0.108
0.557ValCys: 0.557 ± 0.025
3.954ValAsp: 3.954 ± 0.07
4.214ValGlu: 4.214 ± 0.066
2.454ValPhe: 2.454 ± 0.057
5.57ValGly: 5.57 ± 0.083
1.25ValHis: 1.25 ± 0.043
4.097ValIle: 4.097 ± 0.072
1.839ValLys: 1.839 ± 0.054
7.605ValLeu: 7.605 ± 0.106
2.138ValMet: 2.138 ± 0.053
1.699ValAsn: 1.699 ± 0.043
4.18ValPro: 4.18 ± 0.066
2.097ValGln: 2.097 ± 0.047
4.737ValArg: 4.737 ± 0.06
3.813ValSer: 3.813 ± 0.062
4.838ValThr: 4.838 ± 0.083
5.596ValVal: 5.596 ± 0.105
1.011ValTrp: 1.011 ± 0.035
1.289ValTyr: 1.289 ± 0.04
0.0ValXaa: 0.0 ± 0.0
Trp
1.471TrpAla: 1.471 ± 0.044
0.141TrpCys: 0.141 ± 0.012
0.813TrpAsp: 0.813 ± 0.033
0.525TrpGlu: 0.525 ± 0.026
0.516TrpPhe: 0.516 ± 0.024
1.078TrpGly: 1.078 ± 0.039
0.341TrpHis: 0.341 ± 0.019
0.632TrpIle: 0.632 ± 0.026
0.366TrpLys: 0.366 ± 0.021
1.866TrpLeu: 1.866 ± 0.05
0.364TrpMet: 0.364 ± 0.02
0.391TrpAsn: 0.391 ± 0.022
0.769TrpPro: 0.769 ± 0.031
0.652TrpGln: 0.652 ± 0.025
1.327TrpArg: 1.327 ± 0.039
0.783TrpSer: 0.783 ± 0.028
0.822TrpThr: 0.822 ± 0.031
0.89TrpVal: 0.89 ± 0.036
0.261TrpTrp: 0.261 ± 0.018
0.25TrpTyr: 0.25 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.451TyrAla: 2.451 ± 0.05
0.21TyrCys: 0.21 ± 0.014
1.362TyrAsp: 1.362 ± 0.04
0.996TyrGlu: 0.996 ± 0.038
0.752TyrPhe: 0.752 ± 0.027
1.894TyrGly: 1.894 ± 0.049
0.421TyrHis: 0.421 ± 0.022
0.744TyrIle: 0.744 ± 0.029
0.467TyrLys: 0.467 ± 0.025
2.023TyrLeu: 2.023 ± 0.044
0.402TyrMet: 0.402 ± 0.022
0.483TyrAsn: 0.483 ± 0.024
1.039TyrPro: 1.039 ± 0.035
0.645TyrGln: 0.645 ± 0.027
1.489TyrArg: 1.489 ± 0.049
0.989TyrSer: 0.989 ± 0.034
0.975TyrThr: 0.975 ± 0.035
1.424TyrVal: 1.424 ± 0.038
0.295TyrTrp: 0.295 ± 0.017
0.476TyrTyr: 0.476 ± 0.022
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3039 proteins (964569 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski