Amino acid dipepetide frequency for Azospirillum sp. CAG:260

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.607AlaAla: 9.607 ± 0.199
1.025AlaCys: 1.025 ± 0.054
5.481AlaAsp: 5.481 ± 0.104
6.903AlaGlu: 6.903 ± 0.151
3.348AlaPhe: 3.348 ± 0.077
6.178AlaGly: 6.178 ± 0.124
1.205AlaHis: 1.205 ± 0.042
4.456AlaIle: 4.456 ± 0.108
5.604AlaLys: 5.604 ± 0.118
8.117AlaLeu: 8.117 ± 0.143
2.09AlaMet: 2.09 ± 0.064
3.281AlaAsn: 3.281 ± 0.103
2.999AlaPro: 2.999 ± 0.083
3.118AlaGln: 3.118 ± 0.088
3.955AlaArg: 3.955 ± 0.081
4.397AlaSer: 4.397 ± 0.105
3.226AlaThr: 3.226 ± 0.093
6.931AlaVal: 6.931 ± 0.138
0.805AlaTrp: 0.805 ± 0.04
2.978AlaTyr: 2.978 ± 0.087
0.0AlaXaa: 0.0 ± 0.0
Cys
0.865CysAla: 0.865 ± 0.041
0.224CysCys: 0.224 ± 0.019
0.717CysAsp: 0.717 ± 0.035
0.714CysGlu: 0.714 ± 0.039
0.609CysPhe: 0.609 ± 0.034
1.201CysGly: 1.201 ± 0.05
0.241CysHis: 0.241 ± 0.021
0.687CysIle: 0.687 ± 0.042
0.645CysLys: 0.645 ± 0.039
1.228CysLeu: 1.228 ± 0.051
0.182CysMet: 0.182 ± 0.019
0.419CysAsn: 0.419 ± 0.03
0.7CysPro: 0.7 ± 0.046
0.328CysGln: 0.328 ± 0.027
0.911CysArg: 0.911 ± 0.045
0.791CysSer: 0.791 ± 0.038
0.516CysThr: 0.516 ± 0.041
0.734CysVal: 0.734 ± 0.04
0.161CysTrp: 0.161 ± 0.019
0.471CysTyr: 0.471 ± 0.031
0.0CysXaa: 0.0 ± 0.0
Asp
4.058AspAla: 4.058 ± 0.102
0.831AspCys: 0.831 ± 0.044
3.175AspAsp: 3.175 ± 0.095
4.339AspGlu: 4.339 ± 0.112
3.095AspPhe: 3.095 ± 0.087
4.223AspGly: 4.223 ± 0.113
0.685AspHis: 0.685 ± 0.035
4.479AspIle: 4.479 ± 0.098
4.416AspLys: 4.416 ± 0.101
4.895AspLeu: 4.895 ± 0.103
1.645AspMet: 1.645 ± 0.059
3.204AspAsn: 3.204 ± 0.083
1.645AspPro: 1.645 ± 0.049
1.087AspGln: 1.087 ± 0.048
2.429AspArg: 2.429 ± 0.068
2.758AspSer: 2.758 ± 0.072
2.441AspThr: 2.441 ± 0.056
3.676AspVal: 3.676 ± 0.098
0.807AspTrp: 0.807 ± 0.035
2.555AspTyr: 2.555 ± 0.074
0.0AspXaa: 0.0 ± 0.0
Glu
5.965GluAla: 5.965 ± 0.131
0.548GluCys: 0.548 ± 0.029
3.879GluAsp: 3.879 ± 0.093
5.305GluGlu: 5.305 ± 0.16
2.591GluPhe: 2.591 ± 0.071
3.517GluGly: 3.517 ± 0.093
1.171GluHis: 1.171 ± 0.05
5.259GluIle: 5.259 ± 0.102
5.908GluLys: 5.908 ± 0.128
6.176GluLeu: 6.176 ± 0.127
1.888GluMet: 1.888 ± 0.067
4.225GluAsn: 4.225 ± 0.089
2.048GluPro: 2.048 ± 0.074
2.909GluGln: 2.909 ± 0.097
3.338GluArg: 3.338 ± 0.091
2.617GluSer: 2.617 ± 0.075
3.917GluThr: 3.917 ± 0.104
4.134GluVal: 4.134 ± 0.097
0.547GluTrp: 0.547 ± 0.037
2.209GluTyr: 2.209 ± 0.068
0.0GluXaa: 0.0 ± 0.0
Phe
3.64PheAla: 3.64 ± 0.097
0.678PheCys: 0.678 ± 0.038
2.849PheAsp: 2.849 ± 0.082
2.676PheGlu: 2.676 ± 0.08
1.943PhePhe: 1.943 ± 0.077
3.069PheGly: 3.069 ± 0.08
0.623PheHis: 0.623 ± 0.037
2.885PheIle: 2.885 ± 0.086
2.513PheLys: 2.513 ± 0.07
3.896PheLeu: 3.896 ± 0.092
1.245PheMet: 1.245 ± 0.055
2.249PheAsn: 2.249 ± 0.072
1.533PhePro: 1.533 ± 0.056
1.137PheGln: 1.137 ± 0.048
1.877PheArg: 1.877 ± 0.059
3.05PheSer: 3.05 ± 0.074
2.054PheThr: 2.054 ± 0.069
2.843PheVal: 2.843 ± 0.084
0.615PheTrp: 0.615 ± 0.038
1.883PheTyr: 1.883 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
4.955GlyAla: 4.955 ± 0.125
1.019GlyCys: 1.019 ± 0.048
3.555GlyAsp: 3.555 ± 0.091
4.316GlyGlu: 4.316 ± 0.095
2.991GlyPhe: 2.991 ± 0.09
4.796GlyGly: 4.796 ± 0.137
1.228GlyHis: 1.228 ± 0.053
4.845GlyIle: 4.845 ± 0.1
5.149GlyLys: 5.149 ± 0.102
6.278GlyLeu: 6.278 ± 0.13
1.871GlyMet: 1.871 ± 0.085
3.12GlyAsn: 3.12 ± 0.12
1.349GlyPro: 1.349 ± 0.062
2.215GlyGln: 2.215 ± 0.066
3.469GlyArg: 3.469 ± 0.094
4.024GlySer: 4.024 ± 0.126
3.576GlyThr: 3.576 ± 0.152
4.299GlyVal: 4.299 ± 0.109
0.871GlyTrp: 0.871 ± 0.047
2.756GlyTyr: 2.756 ± 0.091
0.0GlyXaa: 0.0 ± 0.0
His
1.127HisAla: 1.127 ± 0.05
0.26HisCys: 0.26 ± 0.022
0.86HisAsp: 0.86 ± 0.042
0.979HisGlu: 0.979 ± 0.045
0.81HisPhe: 0.81 ± 0.04
1.15HisGly: 1.15 ± 0.047
0.374HisHis: 0.374 ± 0.03
1.182HisIle: 1.182 ± 0.049
1.114HisLys: 1.114 ± 0.048
1.716HisLeu: 1.716 ± 0.065
0.266HisMet: 0.266 ± 0.022
0.845HisAsn: 0.845 ± 0.044
0.977HisPro: 0.977 ± 0.048
0.607HisGln: 0.607 ± 0.037
0.827HisArg: 0.827 ± 0.04
0.826HisSer: 0.826 ± 0.041
0.672HisThr: 0.672 ± 0.038
0.801HisVal: 0.801 ± 0.039
0.218HisTrp: 0.218 ± 0.019
0.662HisTyr: 0.662 ± 0.032
0.0HisXaa: 0.0 ± 0.0
Ile
5.904IleAla: 5.904 ± 0.093
0.934IleCys: 0.934 ± 0.041
4.094IleAsp: 4.094 ± 0.1
4.371IleGlu: 4.371 ± 0.097
2.811IlePhe: 2.811 ± 0.072
4.475IleGly: 4.475 ± 0.158
1.012IleHis: 1.012 ± 0.04
4.807IleIle: 4.807 ± 0.12
4.699IleLys: 4.699 ± 0.101
5.975IleLeu: 5.975 ± 0.115
1.649IleMet: 1.649 ± 0.054
3.443IleAsn: 3.443 ± 0.108
2.794IlePro: 2.794 ± 0.071
1.767IleGln: 1.767 ± 0.06
3.441IleArg: 3.441 ± 0.084
4.591IleSer: 4.591 ± 0.098
3.498IleThr: 3.498 ± 0.114
4.263IleVal: 4.263 ± 0.1
0.672IleTrp: 0.672 ± 0.035
2.353IleTyr: 2.353 ± 0.073
0.0IleXaa: 0.0 ± 0.0
Lys
5.893LysAla: 5.893 ± 0.118
0.583LysCys: 0.583 ± 0.035
3.855LysAsp: 3.855 ± 0.076
5.119LysGlu: 5.119 ± 0.121
2.553LysPhe: 2.553 ± 0.063
3.794LysGly: 3.794 ± 0.099
1.103LysHis: 1.103 ± 0.053
5.648LysIle: 5.648 ± 0.109
5.625LysLys: 5.625 ± 0.123
6.612LysLeu: 6.612 ± 0.131
2.008LysMet: 2.008 ± 0.058
3.938LysAsn: 3.938 ± 0.105
2.49LysPro: 2.49 ± 0.081
2.488LysGln: 2.488 ± 0.071
3.179LysArg: 3.179 ± 0.074
3.642LysSer: 3.642 ± 0.089
3.997LysThr: 3.997 ± 0.078
4.496LysVal: 4.496 ± 0.091
0.579LysTrp: 0.579 ± 0.034
2.5LysTyr: 2.5 ± 0.074
0.0LysXaa: 0.0 ± 0.0
Leu
8.326LeuAla: 8.326 ± 0.134
1.156LeuCys: 1.156 ± 0.054
5.109LeuAsp: 5.109 ± 0.102
5.705LeuGlu: 5.705 ± 0.127
4.12LeuPhe: 4.12 ± 0.095
6.157LeuGly: 6.157 ± 0.113
1.558LeuHis: 1.558 ± 0.066
5.728LeuIle: 5.728 ± 0.115
6.682LeuLys: 6.682 ± 0.126
9.359LeuLeu: 9.359 ± 0.176
2.232LeuMet: 2.232 ± 0.07
4.815LeuAsn: 4.815 ± 0.093
4.589LeuPro: 4.589 ± 0.104
3.46LeuGln: 3.46 ± 0.088
4.722LeuArg: 4.722 ± 0.109
6.694LeuSer: 6.694 ± 0.133
5.28LeuThr: 5.28 ± 0.1
5.373LeuVal: 5.373 ± 0.114
1.046LeuTrp: 1.046 ± 0.053
2.946LeuTyr: 2.946 ± 0.07
0.0LeuXaa: 0.0 ± 0.0
Met
2.251MetAla: 2.251 ± 0.061
0.213MetCys: 0.213 ± 0.02
1.296MetAsp: 1.296 ± 0.048
1.568MetGlu: 1.568 ± 0.06
1.051MetPhe: 1.051 ± 0.047
1.484MetGly: 1.484 ± 0.061
0.345MetHis: 0.345 ± 0.027
1.609MetIle: 1.609 ± 0.057
1.962MetLys: 1.962 ± 0.068
2.56MetLeu: 2.56 ± 0.071
0.697MetMet: 0.697 ± 0.035
1.251MetAsn: 1.251 ± 0.047
1.287MetPro: 1.287 ± 0.051
1.008MetGln: 1.008 ± 0.045
1.182MetArg: 1.182 ± 0.046
1.75MetSer: 1.75 ± 0.05
1.484MetThr: 1.484 ± 0.056
1.573MetVal: 1.573 ± 0.058
0.148MetTrp: 0.148 ± 0.016
0.683MetTyr: 0.683 ± 0.037
0.0MetXaa: 0.0 ± 0.0
Asn
3.951AsnAla: 3.951 ± 0.091
0.604AsnCys: 0.604 ± 0.037
2.5AsnAsp: 2.5 ± 0.066
2.856AsnGlu: 2.856 ± 0.077
2.26AsnPhe: 2.26 ± 0.068
3.633AsnGly: 3.633 ± 0.127
0.81AsnHis: 0.81 ± 0.04
3.583AsnIle: 3.583 ± 0.098
3.445AsnLys: 3.445 ± 0.08
4.796AsnLeu: 4.796 ± 0.098
1.266AsnMet: 1.266 ± 0.046
2.765AsnAsn: 2.765 ± 0.111
2.513AsnPro: 2.513 ± 0.073
1.771AsnGln: 1.771 ± 0.061
2.731AsnArg: 2.731 ± 0.076
2.731AsnSer: 2.731 ± 0.086
2.069AsnThr: 2.069 ± 0.078
3.023AsnVal: 3.023 ± 0.099
0.562AsnTrp: 0.562 ± 0.028
1.942AsnTyr: 1.942 ± 0.068
0.0AsnXaa: 0.0 ± 0.0
Pro
3.722ProAla: 3.722 ± 0.111
0.393ProCys: 0.393 ± 0.026
2.581ProAsp: 2.581 ± 0.077
3.902ProGlu: 3.902 ± 0.1
1.625ProPhe: 1.625 ± 0.055
2.317ProGly: 2.317 ± 0.072
0.714ProHis: 0.714 ± 0.036
1.868ProIle: 1.868 ± 0.06
2.139ProLys: 2.139 ± 0.069
3.61ProLeu: 3.61 ± 0.085
0.772ProMet: 0.772 ± 0.039
1.528ProAsn: 1.528 ± 0.052
1.201ProPro: 1.201 ± 0.052
1.978ProGln: 1.978 ± 0.071
1.552ProArg: 1.552 ± 0.061
2.097ProSer: 2.097 ± 0.069
1.594ProThr: 1.594 ± 0.055
3.209ProVal: 3.209 ± 0.082
0.391ProTrp: 0.391 ± 0.028
1.497ProTyr: 1.497 ± 0.049
0.0ProXaa: 0.0 ± 0.0
Gln
3.179GlnAla: 3.179 ± 0.089
0.298GlnCys: 0.298 ± 0.025
1.742GlnAsp: 1.742 ± 0.052
2.412GlnGlu: 2.412 ± 0.071
1.137GlnPhe: 1.137 ± 0.053
2.057GlnGly: 2.057 ± 0.068
0.537GlnHis: 0.537 ± 0.029
2.537GlnIle: 2.537 ± 0.067
2.919GlnLys: 2.919 ± 0.079
2.818GlnLeu: 2.818 ± 0.086
1.063GlnMet: 1.063 ± 0.047
2.044GlnAsn: 2.044 ± 0.064
1.433GlnPro: 1.433 ± 0.06
1.592GlnGln: 1.592 ± 0.068
1.636GlnArg: 1.636 ± 0.058
1.9GlnSer: 1.9 ± 0.069
2.11GlnThr: 2.11 ± 0.062
2.078GlnVal: 2.078 ± 0.069
0.249GlnTrp: 0.249 ± 0.021
1.122GlnTyr: 1.122 ± 0.051
0.0GlnXaa: 0.0 ± 0.0
Arg
3.502ArgAla: 3.502 ± 0.089
0.596ArgCys: 0.596 ± 0.035
2.507ArgAsp: 2.507 ± 0.074
3.653ArgGlu: 3.653 ± 0.093
2.281ArgPhe: 2.281 ± 0.058
2.742ArgGly: 2.742 ± 0.075
1.029ArgHis: 1.029 ± 0.047
3.534ArgIle: 3.534 ± 0.083
3.578ArgLys: 3.578 ± 0.089
5.193ArgLeu: 5.193 ± 0.107
1.329ArgMet: 1.329 ± 0.047
2.367ArgAsn: 2.367 ± 0.076
1.902ArgPro: 1.902 ± 0.065
2.393ArgGln: 2.393 ± 0.075
3.521ArgArg: 3.521 ± 0.093
2.545ArgSer: 2.545 ± 0.069
2.109ArgThr: 2.109 ± 0.057
2.733ArgVal: 2.733 ± 0.082
0.493ArgTrp: 0.493 ± 0.032
1.864ArgTyr: 1.864 ± 0.057
0.0ArgXaa: 0.0 ± 0.0
Ser
4.917SerAla: 4.917 ± 0.1
0.809SerCys: 0.809 ± 0.046
3.251SerAsp: 3.251 ± 0.079
3.557SerGlu: 3.557 ± 0.084
2.744SerPhe: 2.744 ± 0.08
4.857SerGly: 4.857 ± 0.123
0.892SerHis: 0.892 ± 0.044
3.452SerIle: 3.452 ± 0.103
3.405SerLys: 3.405 ± 0.085
6.181SerLeu: 6.181 ± 0.13
1.334SerMet: 1.334 ± 0.057
2.441SerAsn: 2.441 ± 0.09
2.24SerPro: 2.24 ± 0.07
1.852SerGln: 1.852 ± 0.058
3.28SerArg: 3.28 ± 0.079
3.86SerSer: 3.86 ± 0.108
2.477SerThr: 2.477 ± 0.085
4.054SerVal: 4.054 ± 0.101
0.623SerTrp: 0.623 ± 0.036
2.198SerTyr: 2.198 ± 0.077
0.0SerXaa: 0.0 ± 0.0
Thr
5.024ThrAla: 5.024 ± 0.105
0.571ThrCys: 0.571 ± 0.036
2.742ThrAsp: 2.742 ± 0.068
2.904ThrGlu: 2.904 ± 0.085
2.15ThrPhe: 2.15 ± 0.069
3.826ThrGly: 3.826 ± 0.113
0.748ThrHis: 0.748 ± 0.04
3.202ThrIle: 3.202 ± 0.112
2.536ThrLys: 2.536 ± 0.073
4.614ThrLeu: 4.614 ± 0.094
1.055ThrMet: 1.055 ± 0.046
2.044ThrAsn: 2.044 ± 0.085
2.851ThrPro: 2.851 ± 0.074
1.372ThrGln: 1.372 ± 0.046
2.057ThrArg: 2.057 ± 0.066
2.765ThrSer: 2.765 ± 0.075
2.374ThrThr: 2.374 ± 0.076
3.927ThrVal: 3.927 ± 0.089
0.418ThrTrp: 0.418 ± 0.027
1.626ThrTyr: 1.626 ± 0.053
0.0ThrXaa: 0.0 ± 0.0
Val
5.103ValAla: 5.103 ± 0.106
0.92ValCys: 0.92 ± 0.041
3.513ValAsp: 3.513 ± 0.088
4.149ValGlu: 4.149 ± 0.115
2.961ValPhe: 2.961 ± 0.082
3.976ValGly: 3.976 ± 0.101
0.989ValHis: 0.989 ± 0.047
4.857ValIle: 4.857 ± 0.118
4.771ValLys: 4.771 ± 0.101
6.174ValLeu: 6.174 ± 0.117
1.649ValMet: 1.649 ± 0.06
3.283ValAsn: 3.283 ± 0.086
2.49ValPro: 2.49 ± 0.074
1.763ValGln: 1.763 ± 0.064
3.202ValArg: 3.202 ± 0.072
4.745ValSer: 4.745 ± 0.11
3.242ValThr: 3.242 ± 0.079
4.551ValVal: 4.551 ± 0.126
0.772ValTrp: 0.772 ± 0.046
2.386ValTyr: 2.386 ± 0.074
0.0ValXaa: 0.0 ± 0.0
Trp
0.742TrpAla: 0.742 ± 0.039
0.146TrpCys: 0.146 ± 0.016
0.52TrpAsp: 0.52 ± 0.029
0.594TrpGlu: 0.594 ± 0.039
0.52TrpPhe: 0.52 ± 0.033
0.757TrpGly: 0.757 ± 0.042
0.252TrpHis: 0.252 ± 0.024
0.645TrpIle: 0.645 ± 0.032
0.651TrpLys: 0.651 ± 0.038
1.279TrpLeu: 1.279 ± 0.059
0.277TrpMet: 0.277 ± 0.025
0.541TrpAsn: 0.541 ± 0.032
0.294TrpPro: 0.294 ± 0.022
0.562TrpGln: 0.562 ± 0.037
0.566TrpArg: 0.566 ± 0.035
0.662TrpSer: 0.662 ± 0.032
0.446TrpThr: 0.446 ± 0.034
0.596TrpVal: 0.596 ± 0.03
0.137TrpTrp: 0.137 ± 0.013
0.376TrpTyr: 0.376 ± 0.026
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.972TyrAla: 2.972 ± 0.076
0.541TyrCys: 0.541 ± 0.039
2.217TyrAsp: 2.217 ± 0.057
2.131TyrGlu: 2.131 ± 0.064
1.655TyrPhe: 1.655 ± 0.059
2.473TyrGly: 2.473 ± 0.073
0.759TyrHis: 0.759 ± 0.044
2.287TyrIle: 2.287 ± 0.068
2.338TyrLys: 2.338 ± 0.068
3.581TyrLeu: 3.581 ± 0.091
0.883TyrMet: 0.883 ± 0.042
2.008TyrAsn: 2.008 ± 0.069
1.355TyrPro: 1.355 ± 0.056
1.456TyrGln: 1.456 ± 0.057
1.993TyrArg: 1.993 ± 0.065
1.978TyrSer: 1.978 ± 0.073
1.725TyrThr: 1.725 ± 0.063
2.202TyrVal: 2.202 ± 0.065
0.452TyrTrp: 0.452 ± 0.033
1.499TyrTyr: 1.499 ± 0.061
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1762 proteins (526901 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski