Amino acid dipepetide frequency for Arthrobacter sp. NCCP-1664

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
24.227AlaAla: 24.227 ± 0.264
0.835AlaCys: 0.835 ± 0.034
7.088AlaAsp: 7.088 ± 0.1
8.908AlaGlu: 8.908 ± 0.102
3.833AlaPhe: 3.833 ± 0.065
15.882AlaGly: 15.882 ± 0.166
2.592AlaHis: 2.592 ± 0.051
4.47AlaIle: 4.47 ± 0.076
3.018AlaLys: 3.018 ± 0.068
14.551AlaLeu: 14.551 ± 0.138
2.75AlaMet: 2.75 ± 0.059
2.53AlaAsn: 2.53 ± 0.053
7.302AlaPro: 7.302 ± 0.13
3.971AlaGln: 3.971 ± 0.071
9.943AlaArg: 9.943 ± 0.119
7.041AlaSer: 7.041 ± 0.085
6.965AlaThr: 6.965 ± 0.097
12.049AlaVal: 12.049 ± 0.125
1.902AlaTrp: 1.902 ± 0.05
2.494AlaTyr: 2.494 ± 0.044
0.0AlaXaa: 0.0 ± 0.0
Cys
0.864CysAla: 0.864 ± 0.031
0.055CysCys: 0.055 ± 0.009
0.304CysAsp: 0.304 ± 0.017
0.293CysGlu: 0.293 ± 0.014
0.19CysPhe: 0.19 ± 0.012
0.751CysGly: 0.751 ± 0.029
0.174CysHis: 0.174 ± 0.015
0.226CysIle: 0.226 ± 0.013
0.088CysLys: 0.088 ± 0.009
0.582CysLeu: 0.582 ± 0.025
0.116CysMet: 0.116 ± 0.011
0.124CysAsn: 0.124 ± 0.011
0.41CysPro: 0.41 ± 0.024
0.169CysGln: 0.169 ± 0.013
0.492CysArg: 0.492 ± 0.025
0.39CysSer: 0.39 ± 0.019
0.386CysThr: 0.386 ± 0.021
0.466CysVal: 0.466 ± 0.023
0.098CysTrp: 0.098 ± 0.01
0.115CysTyr: 0.115 ± 0.01
0.0CysXaa: 0.0 ± 0.0
Asp
7.366AspAla: 7.366 ± 0.092
0.297AspCys: 0.297 ± 0.019
2.363AspAsp: 2.363 ± 0.062
3.113AspGlu: 3.113 ± 0.068
1.761AspPhe: 1.761 ± 0.051
5.493AspGly: 5.493 ± 0.084
1.134AspHis: 1.134 ± 0.038
2.043AspIle: 2.043 ± 0.052
1.059AspLys: 1.059 ± 0.035
6.005AspLeu: 6.005 ± 0.079
0.864AspMet: 0.864 ± 0.032
0.841AspAsn: 0.841 ± 0.031
4.176AspPro: 4.176 ± 0.075
1.413AspGln: 1.413 ± 0.044
3.943AspArg: 3.943 ± 0.071
2.448AspSer: 2.448 ± 0.049
2.677AspThr: 2.677 ± 0.055
4.234AspVal: 4.234 ± 0.064
0.79AspTrp: 0.79 ± 0.028
1.145AspTyr: 1.145 ± 0.038
0.0AspXaa: 0.0 ± 0.0
Glu
8.615GluAla: 8.615 ± 0.119
0.31GluCys: 0.31 ± 0.02
3.421GluAsp: 3.421 ± 0.057
3.757GluGlu: 3.757 ± 0.07
1.653GluPhe: 1.653 ± 0.046
4.813GluGly: 4.813 ± 0.073
1.578GluHis: 1.578 ± 0.04
2.433GluIle: 2.433 ± 0.063
1.533GluLys: 1.533 ± 0.048
6.399GluLeu: 6.399 ± 0.087
0.969GluMet: 0.969 ± 0.031
1.297GluAsn: 1.297 ± 0.035
3.177GluPro: 3.177 ± 0.061
2.123GluGln: 2.123 ± 0.052
4.817GluArg: 4.817 ± 0.077
2.654GluSer: 2.654 ± 0.056
2.852GluThr: 2.852 ± 0.059
4.585GluVal: 4.585 ± 0.069
0.76GluTrp: 0.76 ± 0.029
1.114GluTyr: 1.114 ± 0.033
0.0GluXaa: 0.0 ± 0.0
Phe
4.117PheAla: 4.117 ± 0.064
0.239PheCys: 0.239 ± 0.017
1.914PheAsp: 1.914 ± 0.048
1.73PheGlu: 1.73 ± 0.042
1.032PhePhe: 1.032 ± 0.038
3.501PheGly: 3.501 ± 0.071
0.597PheHis: 0.597 ± 0.023
1.138PheIle: 1.138 ± 0.038
0.597PheLys: 0.597 ± 0.027
2.873PheLeu: 2.873 ± 0.062
0.523PheMet: 0.523 ± 0.025
0.709PheAsn: 0.709 ± 0.029
1.466PhePro: 1.466 ± 0.042
0.746PheGln: 0.746 ± 0.034
1.88PheArg: 1.88 ± 0.047
1.723PheSer: 1.723 ± 0.044
1.96PheThr: 1.96 ± 0.045
2.369PheVal: 2.369 ± 0.052
0.443PheTrp: 0.443 ± 0.022
0.645PheTyr: 0.645 ± 0.031
0.0PheXaa: 0.0 ± 0.0
Gly
11.763GlyAla: 11.763 ± 0.139
0.674GlyCys: 0.674 ± 0.028
4.427GlyAsp: 4.427 ± 0.077
5.112GlyGlu: 5.112 ± 0.085
3.31GlyPhe: 3.31 ± 0.079
8.853GlyGly: 8.853 ± 0.151
2.313GlyHis: 2.313 ± 0.051
4.644GlyIle: 4.644 ± 0.073
2.566GlyLys: 2.566 ± 0.064
10.12GlyLeu: 10.12 ± 0.107
2.098GlyMet: 2.098 ± 0.039
2.202GlyAsn: 2.202 ± 0.061
5.38GlyPro: 5.38 ± 0.089
3.195GlyGln: 3.195 ± 0.065
7.375GlyArg: 7.375 ± 0.106
5.595GlySer: 5.595 ± 0.081
6.85GlyThr: 6.85 ± 0.094
7.278GlyVal: 7.278 ± 0.089
1.69GlyTrp: 1.69 ± 0.039
2.218GlyTyr: 2.218 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.595HisAla: 2.595 ± 0.058
0.159HisCys: 0.159 ± 0.013
1.091HisAsp: 1.091 ± 0.036
1.144HisGlu: 1.144 ± 0.041
0.651HisPhe: 0.651 ± 0.026
2.339HisGly: 2.339 ± 0.048
0.645HisHis: 0.645 ± 0.029
0.697HisIle: 0.697 ± 0.026
0.404HisLys: 0.404 ± 0.021
2.233HisLeu: 2.233 ± 0.058
0.39HisMet: 0.39 ± 0.019
0.435HisAsn: 0.435 ± 0.022
1.673HisPro: 1.673 ± 0.049
0.615HisGln: 0.615 ± 0.026
1.782HisArg: 1.782 ± 0.043
1.061HisSer: 1.061 ± 0.035
1.093HisThr: 1.093 ± 0.037
1.639HisVal: 1.639 ± 0.039
0.344HisTrp: 0.344 ± 0.019
0.496HisTyr: 0.496 ± 0.026
0.0HisXaa: 0.0 ± 0.0
Ile
5.28IleAla: 5.28 ± 0.076
0.283IleCys: 0.283 ± 0.017
2.518IleAsp: 2.518 ± 0.061
2.344IleGlu: 2.344 ± 0.057
1.076IlePhe: 1.076 ± 0.036
3.885IleGly: 3.885 ± 0.073
0.748IleHis: 0.748 ± 0.028
1.415IleIle: 1.415 ± 0.044
0.896IleLys: 0.896 ± 0.037
3.316IleLeu: 3.316 ± 0.062
0.583IleMet: 0.583 ± 0.025
0.973IleAsn: 0.973 ± 0.037
2.166IlePro: 2.166 ± 0.051
0.907IleGln: 0.907 ± 0.032
2.465IleArg: 2.465 ± 0.05
2.043IleSer: 2.043 ± 0.046
2.208IleThr: 2.208 ± 0.059
3.285IleVal: 3.285 ± 0.07
0.379IleTrp: 0.379 ± 0.019
0.685IleTyr: 0.685 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.095LysAla: 3.095 ± 0.062
0.096LysCys: 0.096 ± 0.01
1.342LysAsp: 1.342 ± 0.04
1.185LysGlu: 1.185 ± 0.046
0.583LysPhe: 0.583 ± 0.029
1.759LysGly: 1.759 ± 0.052
0.492LysHis: 0.492 ± 0.021
1.024LysIle: 1.024 ± 0.039
0.809LysLys: 0.809 ± 0.034
2.106LysLeu: 2.106 ± 0.053
0.496LysMet: 0.496 ± 0.024
0.545LysAsn: 0.545 ± 0.025
1.296LysPro: 1.296 ± 0.038
0.633LysGln: 0.633 ± 0.033
1.518LysArg: 1.518 ± 0.042
1.217LysSer: 1.217 ± 0.043
1.291LysThr: 1.291 ± 0.042
2.085LysVal: 2.085 ± 0.057
0.25LysTrp: 0.25 ± 0.015
0.558LysTyr: 0.558 ± 0.026
0.0LysXaa: 0.0 ± 0.0
Leu
16.247LeuAla: 16.247 ± 0.159
0.687LeuCys: 0.687 ± 0.029
6.103LeuAsp: 6.103 ± 0.077
6.446LeuGlu: 6.446 ± 0.091
2.851LeuPhe: 2.851 ± 0.058
9.934LeuGly: 9.934 ± 0.105
2.076LeuHis: 2.076 ± 0.043
3.264LeuIle: 3.264 ± 0.074
2.22LeuLys: 2.22 ± 0.049
10.919LeuLeu: 10.919 ± 0.137
1.829LeuMet: 1.829 ± 0.044
2.094LeuAsn: 2.094 ± 0.049
6.261LeuPro: 6.261 ± 0.099
2.63LeuGln: 2.63 ± 0.053
7.843LeuArg: 7.843 ± 0.122
5.646LeuSer: 5.646 ± 0.087
5.135LeuThr: 5.135 ± 0.084
8.974LeuVal: 8.974 ± 0.113
1.257LeuTrp: 1.257 ± 0.039
1.65LeuTyr: 1.65 ± 0.04
0.0LeuXaa: 0.0 ± 0.0
Met
2.735MetAla: 2.735 ± 0.058
0.131MetCys: 0.131 ± 0.012
1.025MetAsp: 1.025 ± 0.037
0.995MetGlu: 0.995 ± 0.032
0.549MetPhe: 0.549 ± 0.023
1.672MetGly: 1.672 ± 0.051
0.383MetHis: 0.383 ± 0.019
0.671MetIle: 0.671 ± 0.029
0.515MetLys: 0.515 ± 0.025
1.912MetLeu: 1.912 ± 0.047
0.363MetMet: 0.363 ± 0.023
0.518MetAsn: 0.518 ± 0.021
1.145MetPro: 1.145 ± 0.033
0.468MetGln: 0.468 ± 0.022
1.249MetArg: 1.249 ± 0.036
1.322MetSer: 1.322 ± 0.039
1.299MetThr: 1.299 ± 0.036
1.593MetVal: 1.593 ± 0.043
0.178MetTrp: 0.178 ± 0.013
0.281MetTyr: 0.281 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
2.685AsnAla: 2.685 ± 0.054
0.152AsnCys: 0.152 ± 0.013
0.995AsnAsp: 0.995 ± 0.035
0.981AsnGlu: 0.981 ± 0.034
0.69AsnPhe: 0.69 ± 0.033
2.1AsnGly: 2.1 ± 0.054
0.482AsnHis: 0.482 ± 0.023
0.889AsnIle: 0.889 ± 0.033
0.471AsnLys: 0.471 ± 0.025
2.102AsnLeu: 2.102 ± 0.046
0.413AsnMet: 0.413 ± 0.019
0.548AsnAsn: 0.548 ± 0.026
1.734AsnPro: 1.734 ± 0.043
0.614AsnGln: 0.614 ± 0.03
1.525AsnArg: 1.525 ± 0.043
1.011AsnSer: 1.011 ± 0.029
1.247AsnThr: 1.247 ± 0.044
1.704AsnVal: 1.704 ± 0.046
0.315AsnTrp: 0.315 ± 0.019
0.461AsnTyr: 0.461 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
9.475ProAla: 9.475 ± 0.149
0.264ProCys: 0.264 ± 0.018
3.644ProAsp: 3.644 ± 0.069
4.706ProGlu: 4.706 ± 0.071
1.654ProPhe: 1.654 ± 0.049
6.673ProGly: 6.673 ± 0.101
1.185ProHis: 1.185 ± 0.038
1.476ProIle: 1.476 ± 0.044
1.159ProLys: 1.159 ± 0.039
5.374ProLeu: 5.374 ± 0.084
1.074ProMet: 1.074 ± 0.029
1.079ProAsn: 1.079 ± 0.038
2.71ProPro: 2.71 ± 0.068
1.865ProGln: 1.865 ± 0.046
3.579ProArg: 3.579 ± 0.081
3.057ProSer: 3.057 ± 0.066
2.695ProThr: 2.695 ± 0.052
5.2ProVal: 5.2 ± 0.073
0.837ProTrp: 0.837 ± 0.032
1.071ProTyr: 1.071 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.275GlnAla: 4.275 ± 0.071
0.13GlnCys: 0.13 ± 0.013
1.501GlnAsp: 1.501 ± 0.039
1.686GlnGlu: 1.686 ± 0.046
0.748GlnPhe: 0.748 ± 0.029
2.372GlnGly: 2.372 ± 0.046
0.627GlnHis: 0.627 ± 0.024
1.141GlnIle: 1.141 ± 0.037
0.68GlnLys: 0.68 ± 0.029
3.271GlnLeu: 3.271 ± 0.06
0.536GlnMet: 0.536 ± 0.024
0.599GlnAsn: 0.599 ± 0.026
1.717GlnPro: 1.717 ± 0.043
1.095GlnGln: 1.095 ± 0.041
2.325GlnArg: 2.325 ± 0.048
1.261GlnSer: 1.261 ± 0.038
1.351GlnThr: 1.351 ± 0.041
2.459GlnVal: 2.459 ± 0.05
0.449GlnTrp: 0.449 ± 0.023
0.574GlnTyr: 0.574 ± 0.027
0.0GlnXaa: 0.0 ± 0.0
Arg
8.959ArgAla: 8.959 ± 0.117
0.449ArgCys: 0.449 ± 0.024
3.69ArgAsp: 3.69 ± 0.08
4.452ArgGlu: 4.452 ± 0.085
2.438ArgPhe: 2.438 ± 0.05
5.681ArgGly: 5.681 ± 0.082
1.858ArgHis: 1.858 ± 0.047
3.545ArgIle: 3.545 ± 0.06
1.698ArgLys: 1.698 ± 0.043
7.924ArgLeu: 7.924 ± 0.11
1.707ArgMet: 1.707 ± 0.041
1.659ArgAsn: 1.659 ± 0.041
4.218ArgPro: 4.218 ± 0.078
2.422ArgGln: 2.422 ± 0.055
6.74ArgArg: 6.74 ± 0.107
3.738ArgSer: 3.738 ± 0.066
4.5ArgThr: 4.5 ± 0.069
5.27ArgVal: 5.27 ± 0.078
1.165ArgTrp: 1.165 ± 0.04
1.544ArgTyr: 1.544 ± 0.038
0.0ArgXaa: 0.0 ± 0.0
Ser
7.146SerAla: 7.146 ± 0.102
0.354SerCys: 0.354 ± 0.021
2.178SerAsp: 2.178 ± 0.048
2.465SerGlu: 2.465 ± 0.059
1.695SerPhe: 1.695 ± 0.04
6.031SerGly: 6.031 ± 0.093
1.038SerHis: 1.038 ± 0.032
2.011SerIle: 2.011 ± 0.051
1.098SerLys: 1.098 ± 0.041
5.265SerLeu: 5.265 ± 0.076
1.199SerMet: 1.199 ± 0.034
1.174SerAsn: 1.174 ± 0.033
3.216SerPro: 3.216 ± 0.054
1.347SerGln: 1.347 ± 0.041
3.76SerArg: 3.76 ± 0.057
3.037SerSer: 3.037 ± 0.07
3.201SerThr: 3.201 ± 0.062
4.216SerVal: 4.216 ± 0.072
0.75SerTrp: 0.75 ± 0.028
1.216SerTyr: 1.216 ± 0.035
0.0SerXaa: 0.0 ± 0.0
Thr
7.767ThrAla: 7.767 ± 0.089
0.304ThrCys: 0.304 ± 0.017
2.891ThrAsp: 2.891 ± 0.066
2.852ThrGlu: 2.852 ± 0.06
1.627ThrPhe: 1.627 ± 0.043
6.166ThrGly: 6.166 ± 0.074
1.133ThrHis: 1.133 ± 0.032
2.112ThrIle: 2.112 ± 0.055
1.063ThrLys: 1.063 ± 0.038
5.538ThrLeu: 5.538 ± 0.072
0.928ThrMet: 0.928 ± 0.034
1.126ThrAsn: 1.126 ± 0.037
3.541ThrPro: 3.541 ± 0.068
1.32ThrGln: 1.32 ± 0.038
3.344ThrArg: 3.344 ± 0.065
2.845ThrSer: 2.845 ± 0.068
3.196ThrThr: 3.196 ± 0.067
5.49ThrVal: 5.49 ± 0.079
0.651ThrTrp: 0.651 ± 0.03
1.196ThrTyr: 1.196 ± 0.038
0.0ThrXaa: 0.0 ± 0.0
Val
11.031ValAla: 11.031 ± 0.131
0.595ValCys: 0.595 ± 0.025
4.802ValAsp: 4.802 ± 0.078
4.987ValGlu: 4.987 ± 0.079
2.656ValPhe: 2.656 ± 0.052
6.686ValGly: 6.686 ± 0.104
1.751ValHis: 1.751 ± 0.048
2.968ValIle: 2.968 ± 0.067
1.734ValLys: 1.734 ± 0.05
9.801ValLeu: 9.801 ± 0.129
1.529ValMet: 1.529 ± 0.04
1.83ValAsn: 1.83 ± 0.044
5.474ValPro: 5.474 ± 0.082
2.189ValGln: 2.189 ± 0.042
6.243ValArg: 6.243 ± 0.085
4.546ValSer: 4.546 ± 0.07
4.28ValThr: 4.28 ± 0.083
8.239ValVal: 8.239 ± 0.113
1.008ValTrp: 1.008 ± 0.031
1.424ValTyr: 1.424 ± 0.042
0.0ValXaa: 0.0 ± 0.0
Trp
1.592TrpAla: 1.592 ± 0.045
0.099TrpCys: 0.099 ± 0.01
0.713TrpAsp: 0.713 ± 0.028
0.712TrpGlu: 0.712 ± 0.027
0.517TrpPhe: 0.517 ± 0.026
1.026TrpGly: 1.026 ± 0.039
0.323TrpHis: 0.323 ± 0.019
0.639TrpIle: 0.639 ± 0.031
0.361TrpLys: 0.361 ± 0.02
1.668TrpLeu: 1.668 ± 0.051
0.349TrpMet: 0.349 ± 0.02
0.374TrpAsn: 0.374 ± 0.021
0.664TrpPro: 0.664 ± 0.026
0.483TrpGln: 0.483 ± 0.026
1.113TrpArg: 1.113 ± 0.039
0.7TrpSer: 0.7 ± 0.03
0.873TrpThr: 0.873 ± 0.031
1.09TrpVal: 1.09 ± 0.035
0.279TrpTrp: 0.279 ± 0.017
0.272TrpTyr: 0.272 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.518TyrAla: 2.518 ± 0.055
0.184TyrCys: 0.184 ± 0.013
1.153TyrAsp: 1.153 ± 0.037
0.993TyrGlu: 0.993 ± 0.035
0.689TyrPhe: 0.689 ± 0.028
2.047TyrGly: 2.047 ± 0.049
0.355TyrHis: 0.355 ± 0.021
0.652TyrIle: 0.652 ± 0.031
0.408TyrLys: 0.408 ± 0.023
2.108TyrLeu: 2.108 ± 0.05
0.283TyrMet: 0.283 ± 0.015
0.461TyrAsn: 0.461 ± 0.022
1.05TyrPro: 1.05 ± 0.033
0.593TyrGln: 0.593 ± 0.024
1.719TyrArg: 1.719 ± 0.046
1.063TyrSer: 1.063 ± 0.031
1.032TyrThr: 1.032 ± 0.036
1.548TyrVal: 1.548 ± 0.036
0.303TyrTrp: 0.303 ± 0.017
0.46TyrTyr: 0.46 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2962 proteins (963181 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski