Amino acid dipepetide frequency for Anaerorhabdus furcosa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.493AlaAla: 3.493 ± 0.092
1.022AlaCys: 1.022 ± 0.043
2.555AlaAsp: 2.555 ± 0.071
3.015AlaGlu: 3.015 ± 0.077
2.624AlaPhe: 2.624 ± 0.063
3.616AlaGly: 3.616 ± 0.091
0.993AlaHis: 0.993 ± 0.043
6.009AlaIle: 6.009 ± 0.097
4.694AlaLys: 4.694 ± 0.097
5.927AlaLeu: 5.927 ± 0.093
1.896AlaMet: 1.896 ± 0.054
2.994AlaAsn: 2.994 ± 0.072
1.501AlaPro: 1.501 ± 0.05
1.741AlaGln: 1.741 ± 0.049
1.981AlaArg: 1.981 ± 0.056
3.714AlaSer: 3.714 ± 0.082
3.342AlaThr: 3.342 ± 0.087
3.776AlaVal: 3.776 ± 0.092
0.456AlaTrp: 0.456 ± 0.026
2.348AlaTyr: 2.348 ± 0.053
0.0AlaXaa: 0.0 ± 0.0
Cys
0.866CysAla: 0.866 ± 0.038
0.234CysCys: 0.234 ± 0.018
0.782CysAsp: 0.782 ± 0.037
0.815CysGlu: 0.815 ± 0.036
0.72CysPhe: 0.72 ± 0.033
0.969CysGly: 0.969 ± 0.037
0.274CysHis: 0.274 ± 0.02
1.46CysIle: 1.46 ± 0.049
1.057CysLys: 1.057 ± 0.044
1.349CysLeu: 1.349 ± 0.049
0.388CysMet: 0.388 ± 0.024
0.68CysAsn: 0.68 ± 0.034
0.495CysPro: 0.495 ± 0.028
0.343CysGln: 0.343 ± 0.022
0.41CysArg: 0.41 ± 0.025
0.977CysSer: 0.977 ± 0.039
0.821CysThr: 0.821 ± 0.038
0.909CysVal: 0.909 ± 0.038
0.093CysTrp: 0.093 ± 0.012
0.492CysTyr: 0.492 ± 0.024
0.0CysXaa: 0.0 ± 0.0
Asp
3.304AspAla: 3.304 ± 0.067
0.821AspCys: 0.821 ± 0.04
3.065AspAsp: 3.065 ± 0.069
4.941AspGlu: 4.941 ± 0.094
2.926AspPhe: 2.926 ± 0.075
3.409AspGly: 3.409 ± 0.08
0.946AspHis: 0.946 ± 0.037
4.765AspIle: 4.765 ± 0.089
4.088AspLys: 4.088 ± 0.079
5.242AspLeu: 5.242 ± 0.089
1.472AspMet: 1.472 ± 0.043
2.705AspAsn: 2.705 ± 0.071
1.547AspPro: 1.547 ± 0.048
1.581AspGln: 1.581 ± 0.042
1.691AspArg: 1.691 ± 0.053
3.174AspSer: 3.174 ± 0.071
2.727AspThr: 2.727 ± 0.059
3.739AspVal: 3.739 ± 0.079
0.419AspTrp: 0.419 ± 0.021
2.878AspTyr: 2.878 ± 0.063
0.0AspXaa: 0.0 ± 0.0
Glu
4.07GluAla: 4.07 ± 0.09
0.909GluCys: 0.909 ± 0.035
3.802GluAsp: 3.802 ± 0.077
6.039GluGlu: 6.039 ± 0.134
3.096GluPhe: 3.096 ± 0.072
3.796GluGly: 3.796 ± 0.082
1.053GluHis: 1.053 ± 0.043
6.853GluIle: 6.853 ± 0.114
6.218GluLys: 6.218 ± 0.13
6.812GluLeu: 6.812 ± 0.127
2.24GluMet: 2.24 ± 0.055
4.645GluAsn: 4.645 ± 0.106
1.474GluPro: 1.474 ± 0.042
2.259GluGln: 2.259 ± 0.063
2.44GluArg: 2.44 ± 0.071
3.686GluSer: 3.686 ± 0.08
3.494GluThr: 3.494 ± 0.081
4.933GluVal: 4.933 ± 0.076
0.535GluTrp: 0.535 ± 0.028
3.048GluTyr: 3.048 ± 0.076
0.0GluXaa: 0.0 ± 0.0
Phe
2.653PheAla: 2.653 ± 0.067
0.586PheCys: 0.586 ± 0.029
3.098PheAsp: 3.098 ± 0.062
3.196PheGlu: 3.196 ± 0.068
2.263PhePhe: 2.263 ± 0.082
3.077PheGly: 3.077 ± 0.085
0.723PheHis: 0.723 ± 0.033
4.438PheIle: 4.438 ± 0.107
3.333PheLys: 3.333 ± 0.07
4.129PheLeu: 4.129 ± 0.096
1.31PheMet: 1.31 ± 0.042
2.774PheAsn: 2.774 ± 0.064
1.289PhePro: 1.289 ± 0.048
1.155PheGln: 1.155 ± 0.038
1.239PheArg: 1.239 ± 0.035
3.246PheSer: 3.246 ± 0.079
2.712PheThr: 2.712 ± 0.062
3.486PheVal: 3.486 ± 0.078
0.332PheTrp: 0.332 ± 0.023
2.071PheTyr: 2.071 ± 0.065
0.0PheXaa: 0.0 ± 0.0
Gly
3.78GlyAla: 3.78 ± 0.094
1.064GlyCys: 1.064 ± 0.039
2.915GlyAsp: 2.915 ± 0.072
3.478GlyGlu: 3.478 ± 0.075
3.211GlyPhe: 3.211 ± 0.071
3.78GlyGly: 3.78 ± 0.089
1.028GlyHis: 1.028 ± 0.04
6.079GlyIle: 6.079 ± 0.106
4.471GlyLys: 4.471 ± 0.094
5.68GlyLeu: 5.68 ± 0.111
1.717GlyMet: 1.717 ± 0.054
3.07GlyAsn: 3.07 ± 0.095
1.114GlyPro: 1.114 ± 0.042
1.677GlyGln: 1.677 ± 0.052
1.836GlyArg: 1.836 ± 0.055
3.541GlySer: 3.541 ± 0.07
3.475GlyThr: 3.475 ± 0.082
4.413GlyVal: 4.413 ± 0.09
0.565GlyTrp: 0.565 ± 0.031
3.163GlyTyr: 3.163 ± 0.071
0.0GlyXaa: 0.0 ± 0.0
His
0.963HisAla: 0.963 ± 0.039
0.332HisCys: 0.332 ± 0.02
1.086HisAsp: 1.086 ± 0.04
1.185HisGlu: 1.185 ± 0.045
0.917HisPhe: 0.917 ± 0.038
1.177HisGly: 1.177 ± 0.045
0.511HisHis: 0.511 ± 0.026
1.351HisIle: 1.351 ± 0.041
0.992HisLys: 0.992 ± 0.04
1.664HisLeu: 1.664 ± 0.052
0.398HisMet: 0.398 ± 0.022
0.786HisAsn: 0.786 ± 0.031
0.739HisPro: 0.739 ± 0.033
0.662HisGln: 0.662 ± 0.033
0.713HisArg: 0.713 ± 0.033
0.937HisSer: 0.937 ± 0.038
0.872HisThr: 0.872 ± 0.036
1.125HisVal: 1.125 ± 0.037
0.138HisTrp: 0.138 ± 0.014
0.814HisTyr: 0.814 ± 0.034
0.0HisXaa: 0.0 ± 0.0
Ile
5.925IleAla: 5.925 ± 0.106
1.278IleCys: 1.278 ± 0.041
5.535IleAsp: 5.535 ± 0.088
7.218IleGlu: 7.218 ± 0.122
3.932IlePhe: 3.932 ± 0.086
5.386IleGly: 5.386 ± 0.095
1.774IleHis: 1.774 ± 0.054
7.556IleIle: 7.556 ± 0.142
6.285IleLys: 6.285 ± 0.104
8.73IleLeu: 8.73 ± 0.137
2.132IleMet: 2.132 ± 0.063
4.866IleAsn: 4.866 ± 0.099
3.193IlePro: 3.193 ± 0.08
3.666IleGln: 3.666 ± 0.071
2.926IleArg: 2.926 ± 0.067
5.867IleSer: 5.867 ± 0.108
5.143IleThr: 5.143 ± 0.094
6.353IleVal: 6.353 ± 0.105
0.521IleTrp: 0.521 ± 0.031
3.464IleTyr: 3.464 ± 0.081
0.0IleXaa: 0.0 ± 0.0
Lys
3.898LysAla: 3.898 ± 0.085
0.811LysCys: 0.811 ± 0.04
4.583LysAsp: 4.583 ± 0.082
7.315LysGlu: 7.315 ± 0.131
2.548LysPhe: 2.548 ± 0.06
3.981LysGly: 3.981 ± 0.067
1.217LysHis: 1.217 ± 0.044
6.849LysIle: 6.849 ± 0.097
7.121LysLys: 7.121 ± 0.14
6.234LysLeu: 6.234 ± 0.095
2.733LysMet: 2.733 ± 0.063
4.79LysAsn: 4.79 ± 0.091
2.151LysPro: 2.151 ± 0.058
2.891LysGln: 2.891 ± 0.073
2.66LysArg: 2.66 ± 0.063
4.146LysSer: 4.146 ± 0.069
4.151LysThr: 4.151 ± 0.072
4.94LysVal: 4.94 ± 0.089
0.569LysTrp: 0.569 ± 0.03
3.13LysTyr: 3.13 ± 0.073
0.0LysXaa: 0.0 ± 0.0
Leu
5.774LeuAla: 5.774 ± 0.102
1.357LeuCys: 1.357 ± 0.049
5.387LeuAsp: 5.387 ± 0.107
6.277LeuGlu: 6.277 ± 0.106
4.886LeuPhe: 4.886 ± 0.118
5.822LeuGly: 5.822 ± 0.098
1.437LeuHis: 1.437 ± 0.054
8.329LeuIle: 8.329 ± 0.122
6.671LeuLys: 6.671 ± 0.103
9.152LeuLeu: 9.152 ± 0.139
2.477LeuMet: 2.477 ± 0.055
5.517LeuAsn: 5.517 ± 0.081
3.141LeuPro: 3.141 ± 0.068
2.924LeuGln: 2.924 ± 0.066
3.096LeuArg: 3.096 ± 0.063
6.794LeuSer: 6.794 ± 0.106
4.967LeuThr: 4.967 ± 0.098
6.595LeuVal: 6.595 ± 0.114
0.64LeuTrp: 0.64 ± 0.03
3.229LeuTyr: 3.229 ± 0.071
0.0LeuXaa: 0.0 ± 0.0
Met
1.597MetAla: 1.597 ± 0.045
0.272MetCys: 0.272 ± 0.017
1.581MetAsp: 1.581 ± 0.056
1.741MetGlu: 1.741 ± 0.056
1.129MetPhe: 1.129 ± 0.04
1.655MetGly: 1.655 ± 0.054
0.437MetHis: 0.437 ± 0.023
2.839MetIle: 2.839 ± 0.07
2.975MetLys: 2.975 ± 0.072
2.338MetLeu: 2.338 ± 0.064
1.06MetMet: 1.06 ± 0.04
2.031MetAsn: 2.031 ± 0.054
0.833MetPro: 0.833 ± 0.034
0.882MetGln: 0.882 ± 0.033
0.883MetArg: 0.883 ± 0.032
1.8MetSer: 1.8 ± 0.054
1.491MetThr: 1.491 ± 0.041
1.655MetVal: 1.655 ± 0.048
0.16MetTrp: 0.16 ± 0.014
0.883MetTyr: 0.883 ± 0.032
0.0MetXaa: 0.0 ± 0.0
Asn
3.067AsnAla: 3.067 ± 0.071
0.73AsnCys: 0.73 ± 0.035
3.229AsnAsp: 3.229 ± 0.063
4.567AsnGlu: 4.567 ± 0.1
2.383AsnPhe: 2.383 ± 0.056
3.272AsnGly: 3.272 ± 0.078
1.275AsnHis: 1.275 ± 0.046
4.644AsnIle: 4.644 ± 0.081
4.485AsnLys: 4.485 ± 0.081
4.866AsnLeu: 4.866 ± 0.083
1.43AsnMet: 1.43 ± 0.041
3.088AsnAsn: 3.088 ± 0.087
2.211AsnPro: 2.211 ± 0.057
2.696AsnGln: 2.696 ± 0.075
2.009AsnArg: 2.009 ± 0.051
3.145AsnSer: 3.145 ± 0.077
2.835AsnThr: 2.835 ± 0.064
3.34AsnVal: 3.34 ± 0.078
0.455AsnTrp: 0.455 ± 0.028
2.614AsnTyr: 2.614 ± 0.073
0.0AsnXaa: 0.0 ± 0.0
Pro
1.572ProAla: 1.572 ± 0.053
0.39ProCys: 0.39 ± 0.024
1.368ProAsp: 1.368 ± 0.046
2.081ProGlu: 2.081 ± 0.053
1.521ProPhe: 1.521 ± 0.049
1.472ProGly: 1.472 ± 0.043
0.539ProHis: 0.539 ± 0.026
2.794ProIle: 2.794 ± 0.067
1.966ProLys: 1.966 ± 0.056
2.776ProLeu: 2.776 ± 0.066
0.839ProMet: 0.839 ± 0.034
1.8ProAsn: 1.8 ± 0.048
0.473ProPro: 0.473 ± 0.031
0.89ProGln: 0.89 ± 0.034
0.768ProArg: 0.768 ± 0.032
1.916ProSer: 1.916 ± 0.051
2.168ProThr: 2.168 ± 0.072
2.187ProVal: 2.187 ± 0.059
0.251ProTrp: 0.251 ± 0.018
1.375ProTyr: 1.375 ± 0.042
0.0ProXaa: 0.0 ± 0.0
Gln
1.905GlnAla: 1.905 ± 0.052
0.513GlnCys: 0.513 ± 0.027
1.816GlnAsp: 1.816 ± 0.061
2.603GlnGlu: 2.603 ± 0.072
1.492GlnPhe: 1.492 ± 0.047
2.01GlnGly: 2.01 ± 0.059
0.547GlnHis: 0.547 ± 0.027
2.893GlnIle: 2.893 ± 0.07
2.397GlnLys: 2.397 ± 0.071
3.189GlnLeu: 3.189 ± 0.08
0.98GlnMet: 0.98 ± 0.035
1.799GlnAsn: 1.799 ± 0.056
0.847GlnPro: 0.847 ± 0.034
1.098GlnGln: 1.098 ± 0.044
1.199GlnArg: 1.199 ± 0.042
1.963GlnSer: 1.963 ± 0.049
1.686GlnThr: 1.686 ± 0.057
2.142GlnVal: 2.142 ± 0.053
0.278GlnTrp: 0.278 ± 0.02
1.472GlnTyr: 1.472 ± 0.045
0.0GlnXaa: 0.0 ± 0.0
Arg
1.621ArgAla: 1.621 ± 0.045
0.419ArgCys: 0.419 ± 0.023
1.705ArgAsp: 1.705 ± 0.051
2.303ArgGlu: 2.303 ± 0.054
1.565ArgPhe: 1.565 ± 0.046
1.737ArgGly: 1.737 ± 0.055
0.568ArgHis: 0.568 ± 0.026
3.24ArgIle: 3.24 ± 0.073
2.852ArgLys: 2.852 ± 0.065
3.114ArgLeu: 3.114 ± 0.079
0.993ArgMet: 0.993 ± 0.036
1.95ArgAsn: 1.95 ± 0.056
0.832ArgPro: 0.832 ± 0.03
1.054ArgGln: 1.054 ± 0.04
1.273ArgArg: 1.273 ± 0.048
1.665ArgSer: 1.665 ± 0.049
1.577ArgThr: 1.577 ± 0.048
2.287ArgVal: 2.287 ± 0.055
0.253ArgTrp: 0.253 ± 0.017
1.583ArgTyr: 1.583 ± 0.049
0.0ArgXaa: 0.0 ± 0.0
Ser
3.15SerAla: 3.15 ± 0.069
0.843SerCys: 0.843 ± 0.04
3.156SerAsp: 3.156 ± 0.069
3.659SerGlu: 3.659 ± 0.086
3.431SerPhe: 3.431 ± 0.092
3.912SerGly: 3.912 ± 0.094
1.027SerHis: 1.027 ± 0.034
6.073SerIle: 6.073 ± 0.103
5.35SerLys: 5.35 ± 0.082
6.085SerLeu: 6.085 ± 0.09
1.762SerMet: 1.762 ± 0.052
3.666SerAsn: 3.666 ± 0.092
1.541SerPro: 1.541 ± 0.051
1.839SerGln: 1.839 ± 0.053
1.871SerArg: 1.871 ± 0.048
4.209SerSer: 4.209 ± 0.087
3.617SerThr: 3.617 ± 0.085
3.76SerVal: 3.76 ± 0.076
0.478SerTrp: 0.478 ± 0.026
2.711SerTyr: 2.711 ± 0.067
0.0SerXaa: 0.0 ± 0.0
Thr
2.892ThrAla: 2.892 ± 0.081
0.783ThrCys: 0.783 ± 0.037
2.651ThrAsp: 2.651 ± 0.063
3.052ThrGlu: 3.052 ± 0.067
2.486ThrPhe: 2.486 ± 0.062
3.54ThrGly: 3.54 ± 0.08
0.971ThrHis: 0.971 ± 0.038
5.502ThrIle: 5.502 ± 0.099
3.881ThrLys: 3.881 ± 0.068
5.418ThrLeu: 5.418 ± 0.086
1.48ThrMet: 1.48 ± 0.047
2.943ThrAsn: 2.943 ± 0.061
2.178ThrPro: 2.178 ± 0.076
1.675ThrGln: 1.675 ± 0.05
1.603ThrArg: 1.603 ± 0.052
3.675ThrSer: 3.675 ± 0.088
3.335ThrThr: 3.335 ± 0.1
3.945ThrVal: 3.945 ± 0.105
0.423ThrTrp: 0.423 ± 0.022
2.359ThrTyr: 2.359 ± 0.075
0.0ThrXaa: 0.0 ± 0.0
Val
4.278ValAla: 4.278 ± 0.079
1.021ValCys: 1.021 ± 0.038
4.156ValAsp: 4.156 ± 0.076
4.694ValGlu: 4.694 ± 0.082
3.145ValPhe: 3.145 ± 0.07
4.391ValGly: 4.391 ± 0.088
0.982ValHis: 0.982 ± 0.038
6.019ValIle: 6.019 ± 0.098
4.616ValLys: 4.616 ± 0.081
6.685ValLeu: 6.685 ± 0.103
1.742ValMet: 1.742 ± 0.05
3.53ValAsn: 3.53 ± 0.075
1.896ValPro: 1.896 ± 0.055
1.998ValGln: 1.998 ± 0.05
2.079ValArg: 2.079 ± 0.056
4.525ValSer: 4.525 ± 0.079
3.684ValThr: 3.684 ± 0.082
5.215ValVal: 5.215 ± 0.097
0.479ValTrp: 0.479 ± 0.026
2.611ValTyr: 2.611 ± 0.057
0.0ValXaa: 0.0 ± 0.0
Trp
0.419TrpAla: 0.419 ± 0.027
0.105TrpCys: 0.105 ± 0.011
0.397TrpAsp: 0.397 ± 0.023
0.402TrpGlu: 0.402 ± 0.022
0.43TrpPhe: 0.43 ± 0.026
0.492TrpGly: 0.492 ± 0.027
0.167TrpHis: 0.167 ± 0.016
0.848TrpIle: 0.848 ± 0.034
0.486TrpLys: 0.486 ± 0.027
0.788TrpLeu: 0.788 ± 0.038
0.261TrpMet: 0.261 ± 0.018
0.448TrpAsn: 0.448 ± 0.027
0.152TrpPro: 0.152 ± 0.015
0.242TrpGln: 0.242 ± 0.018
0.184TrpArg: 0.184 ± 0.016
0.408TrpSer: 0.408 ± 0.023
0.356TrpThr: 0.356 ± 0.021
0.506TrpVal: 0.506 ± 0.026
0.072TrpTrp: 0.072 ± 0.01
0.312TrpTyr: 0.312 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.469TyrAla: 2.469 ± 0.053
0.644TyrCys: 0.644 ± 0.031
2.574TyrAsp: 2.574 ± 0.086
2.823TyrGlu: 2.823 ± 0.065
2.403TyrPhe: 2.403 ± 0.063
2.569TyrGly: 2.569 ± 0.062
0.893TyrHis: 0.893 ± 0.037
3.207TyrIle: 3.207 ± 0.074
2.779TyrLys: 2.779 ± 0.066
4.296TyrLeu: 4.296 ± 0.087
0.931TyrMet: 0.931 ± 0.04
2.256TyrAsn: 2.256 ± 0.072
1.541TyrPro: 1.541 ± 0.047
1.559TyrGln: 1.559 ± 0.039
1.727TyrArg: 1.727 ± 0.053
2.748TyrSer: 2.748 ± 0.07
2.338TyrThr: 2.338 ± 0.066
2.443TyrVal: 2.443 ± 0.058
0.358TyrTrp: 0.358 ± 0.022
2.008TyrTyr: 2.008 ± 0.059
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2330 proteins (723749 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski