Amino acid dipepetide frequency for Sphingomonas sp. AAP5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.919AlaAla: 20.919 ± 0.207
1.096AlaCys: 1.096 ± 0.033
7.655AlaAsp: 7.655 ± 0.074
7.201AlaGlu: 7.201 ± 0.09
4.579AlaPhe: 4.579 ± 0.068
11.785AlaGly: 11.785 ± 0.113
2.558AlaHis: 2.558 ± 0.044
7.298AlaIle: 7.298 ± 0.073
4.36AlaLys: 4.36 ± 0.068
14.891AlaLeu: 14.891 ± 0.147
3.954AlaMet: 3.954 ± 0.067
3.212AlaAsn: 3.212 ± 0.059
6.963AlaPro: 6.963 ± 0.084
4.617AlaGln: 4.617 ± 0.065
10.029AlaArg: 10.029 ± 0.098
6.78AlaSer: 6.78 ± 0.082
7.932AlaThr: 7.932 ± 0.081
9.221AlaVal: 9.221 ± 0.101
1.807AlaTrp: 1.807 ± 0.044
2.668AlaTyr: 2.668 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.938CysAla: 0.938 ± 0.028
0.096CysCys: 0.096 ± 0.009
0.501CysAsp: 0.501 ± 0.019
0.358CysGlu: 0.358 ± 0.016
0.254CysPhe: 0.254 ± 0.014
0.854CysGly: 0.854 ± 0.024
0.199CysHis: 0.199 ± 0.012
0.313CysIle: 0.313 ± 0.019
0.162CysLys: 0.162 ± 0.01
0.652CysLeu: 0.652 ± 0.021
0.123CysMet: 0.123 ± 0.01
0.185CysAsn: 0.185 ± 0.012
0.426CysPro: 0.426 ± 0.021
0.17CysGln: 0.17 ± 0.012
0.519CysArg: 0.519 ± 0.021
0.425CysSer: 0.425 ± 0.019
0.403CysThr: 0.403 ± 0.016
0.558CysVal: 0.558 ± 0.02
0.113CysTrp: 0.113 ± 0.01
0.161CysTyr: 0.161 ± 0.012
0.0CysXaa: 0.0 ± 0.0
Asp
8.329AspAla: 8.329 ± 0.084
0.438AspCys: 0.438 ± 0.018
3.29AspAsp: 3.29 ± 0.058
2.931AspGlu: 2.931 ± 0.055
2.166AspPhe: 2.166 ± 0.047
5.527AspGly: 5.527 ± 0.069
1.302AspHis: 1.302 ± 0.033
2.643AspIle: 2.643 ± 0.054
1.533AspLys: 1.533 ± 0.039
5.769AspLeu: 5.769 ± 0.074
1.215AspMet: 1.215 ± 0.031
1.103AspAsn: 1.103 ± 0.029
3.871AspPro: 3.871 ± 0.065
1.898AspGln: 1.898 ± 0.039
4.926AspArg: 4.926 ± 0.077
2.338AspSer: 2.338 ± 0.038
3.005AspThr: 3.005 ± 0.052
4.264AspVal: 4.264 ± 0.058
1.129AspTrp: 1.129 ± 0.031
1.531AspTyr: 1.531 ± 0.034
0.0AspXaa: 0.0 ± 0.0
Glu
6.962GluAla: 6.962 ± 0.093
0.271GluCys: 0.271 ± 0.015
2.486GluAsp: 2.486 ± 0.049
2.248GluGlu: 2.248 ± 0.061
1.341GluPhe: 1.341 ± 0.034
3.859GluGly: 3.859 ± 0.065
1.076GluHis: 1.076 ± 0.032
2.711GluIle: 2.711 ± 0.046
1.574GluLys: 1.574 ± 0.037
4.593GluLeu: 4.593 ± 0.067
1.252GluMet: 1.252 ± 0.032
1.131GluAsn: 1.131 ± 0.032
2.373GluPro: 2.373 ± 0.047
1.857GluGln: 1.857 ± 0.038
4.714GluArg: 4.714 ± 0.075
2.108GluSer: 2.108 ± 0.036
3.268GluThr: 3.268 ± 0.049
3.14GluVal: 3.14 ± 0.054
0.673GluTrp: 0.673 ± 0.021
0.914GluTyr: 0.914 ± 0.027
0.0GluXaa: 0.0 ± 0.0
Phe
5.096PheAla: 5.096 ± 0.073
0.295PheCys: 0.295 ± 0.014
2.661PheAsp: 2.661 ± 0.049
1.869PheGlu: 1.869 ± 0.038
1.216PhePhe: 1.216 ± 0.031
3.698PheGly: 3.698 ± 0.056
0.7PheHis: 0.7 ± 0.024
1.316PheIle: 1.316 ± 0.034
0.9PheLys: 0.9 ± 0.029
2.983PheLeu: 2.983 ± 0.055
0.615PheMet: 0.615 ± 0.021
0.965PheAsn: 0.965 ± 0.038
1.482PhePro: 1.482 ± 0.038
0.889PheGln: 0.889 ± 0.026
2.1PheArg: 2.1 ± 0.042
1.952PheSer: 1.952 ± 0.039
1.973PheThr: 1.973 ± 0.044
2.747PheVal: 2.747 ± 0.047
0.504PheTrp: 0.504 ± 0.02
0.891PheTyr: 0.891 ± 0.03
0.0PheXaa: 0.0 ± 0.0
Gly
10.775GlyAla: 10.775 ± 0.104
0.844GlyCys: 0.844 ± 0.027
4.874GlyAsp: 4.874 ± 0.065
4.287GlyGlu: 4.287 ± 0.061
3.688GlyPhe: 3.688 ± 0.066
8.258GlyGly: 8.258 ± 0.113
1.866GlyHis: 1.866 ± 0.043
4.494GlyIle: 4.494 ± 0.057
3.171GlyLys: 3.171 ± 0.055
8.627GlyLeu: 8.627 ± 0.086
2.24GlyMet: 2.24 ± 0.046
2.205GlyAsn: 2.205 ± 0.047
3.52GlyPro: 3.52 ± 0.049
2.853GlyGln: 2.853 ± 0.051
6.166GlyArg: 6.166 ± 0.08
4.813GlySer: 4.813 ± 0.071
5.181GlyThr: 5.181 ± 0.079
6.886GlyVal: 6.886 ± 0.079
1.656GlyTrp: 1.656 ± 0.032
2.438GlyTyr: 2.438 ± 0.05
0.0GlyXaa: 0.0 ± 0.0
His
2.568HisAla: 2.568 ± 0.051
0.18HisCys: 0.18 ± 0.013
1.299HisAsp: 1.299 ± 0.032
0.865HisGlu: 0.865 ± 0.026
0.751HisPhe: 0.751 ± 0.022
2.075HisGly: 2.075 ± 0.048
0.554HisHis: 0.554 ± 0.021
0.921HisIle: 0.921 ± 0.027
0.428HisLys: 0.428 ± 0.016
1.905HisLeu: 1.905 ± 0.032
0.392HisMet: 0.392 ± 0.016
0.442HisAsn: 0.442 ± 0.02
1.296HisPro: 1.296 ± 0.035
0.523HisGln: 0.523 ± 0.021
1.446HisArg: 1.446 ± 0.032
0.914HisSer: 0.914 ± 0.03
0.816HisThr: 0.816 ± 0.027
1.585HisVal: 1.585 ± 0.036
0.346HisTrp: 0.346 ± 0.015
0.582HisTyr: 0.582 ± 0.022
0.0HisXaa: 0.0 ± 0.0
Ile
8.383IleAla: 8.383 ± 0.086
0.386IleCys: 0.386 ± 0.015
3.996IleAsp: 3.996 ± 0.06
3.401IleGlu: 3.401 ± 0.058
1.384IlePhe: 1.384 ± 0.034
5.302IleGly: 5.302 ± 0.076
0.78IleHis: 0.78 ± 0.024
1.821IleIle: 1.821 ± 0.042
1.225IleLys: 1.225 ± 0.032
3.946IleLeu: 3.946 ± 0.059
0.735IleMet: 0.735 ± 0.025
1.235IleAsn: 1.235 ± 0.034
2.241IlePro: 2.241 ± 0.04
1.072IleGln: 1.072 ± 0.032
2.949IleArg: 2.949 ± 0.052
2.349IleSer: 2.349 ± 0.047
2.68IleThr: 2.68 ± 0.046
4.591IleVal: 4.591 ± 0.061
0.533IleTrp: 0.533 ± 0.021
0.908IleTyr: 0.908 ± 0.026
0.0IleXaa: 0.0 ± 0.0
Lys
3.945LysAla: 3.945 ± 0.065
0.145LysCys: 0.145 ± 0.011
1.477LysAsp: 1.477 ± 0.038
0.988LysGlu: 0.988 ± 0.029
0.785LysPhe: 0.785 ± 0.026
2.456LysGly: 2.456 ± 0.051
0.516LysHis: 0.516 ± 0.021
1.491LysIle: 1.491 ± 0.033
0.899LysLys: 0.899 ± 0.03
3.203LysLeu: 3.203 ± 0.059
0.724LysMet: 0.724 ± 0.023
0.655LysAsn: 0.655 ± 0.025
1.988LysPro: 1.988 ± 0.044
0.869LysGln: 0.869 ± 0.028
2.325LysArg: 2.325 ± 0.042
1.483LysSer: 1.483 ± 0.035
1.857LysThr: 1.857 ± 0.038
2.099LysVal: 2.099 ± 0.045
0.326LysTrp: 0.326 ± 0.016
0.589LysTyr: 0.589 ± 0.024
0.0LysXaa: 0.0 ± 0.0
Leu
14.715LeuAla: 14.715 ± 0.15
0.725LeuCys: 0.725 ± 0.024
6.311LeuAsp: 6.311 ± 0.084
4.528LeuGlu: 4.528 ± 0.072
3.593LeuPhe: 3.593 ± 0.049
8.723LeuGly: 8.723 ± 0.103
1.906LeuHis: 1.906 ± 0.039
4.715LeuIle: 4.715 ± 0.068
2.94LeuLys: 2.94 ± 0.058
9.53LeuLeu: 9.53 ± 0.122
1.918LeuMet: 1.918 ± 0.042
2.271LeuAsn: 2.271 ± 0.054
5.673LeuPro: 5.673 ± 0.075
2.563LeuGln: 2.563 ± 0.042
6.831LeuArg: 6.831 ± 0.084
5.732LeuSer: 5.732 ± 0.067
5.726LeuThr: 5.726 ± 0.073
7.522LeuVal: 7.522 ± 0.082
1.247LeuTrp: 1.247 ± 0.033
2.036LeuTyr: 2.036 ± 0.045
0.0LeuXaa: 0.0 ± 0.0
Met
3.051MetAla: 3.051 ± 0.057
0.142MetCys: 0.142 ± 0.012
0.942MetAsp: 0.942 ± 0.031
0.818MetGlu: 0.818 ± 0.031
0.621MetPhe: 0.621 ± 0.021
1.716MetGly: 1.716 ± 0.043
0.409MetHis: 0.409 ± 0.018
1.34MetIle: 1.34 ± 0.032
0.807MetLys: 0.807 ± 0.026
2.609MetLeu: 2.609 ± 0.047
0.586MetMet: 0.586 ± 0.022
0.575MetAsn: 0.575 ± 0.02
1.424MetPro: 1.424 ± 0.033
0.666MetGln: 0.666 ± 0.024
1.74MetArg: 1.74 ± 0.035
1.343MetSer: 1.343 ± 0.029
1.834MetThr: 1.834 ± 0.036
1.572MetVal: 1.572 ± 0.034
0.202MetTrp: 0.202 ± 0.013
0.264MetTyr: 0.264 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
3.3AsnAla: 3.3 ± 0.06
0.198AsnCys: 0.198 ± 0.013
1.358AsnAsp: 1.358 ± 0.036
0.94AsnGlu: 0.94 ± 0.027
0.866AsnPhe: 0.866 ± 0.037
2.394AsnGly: 2.394 ± 0.056
0.456AsnHis: 0.456 ± 0.022
1.149AsnIle: 1.149 ± 0.028
0.575AsnLys: 0.575 ± 0.021
2.351AsnLeu: 2.351 ± 0.046
0.446AsnMet: 0.446 ± 0.02
0.622AsnAsn: 0.622 ± 0.03
1.737AsnPro: 1.737 ± 0.036
0.755AsnGln: 0.755 ± 0.026
1.731AsnArg: 1.731 ± 0.038
1.12AsnSer: 1.12 ± 0.037
1.279AsnThr: 1.279 ± 0.04
1.78AsnVal: 1.78 ± 0.039
0.362AsnTrp: 0.362 ± 0.018
0.698AsnTyr: 0.698 ± 0.03
0.0AsnXaa: 0.0 ± 0.0
Pro
7.43ProAla: 7.43 ± 0.086
0.32ProCys: 0.32 ± 0.015
3.745ProAsp: 3.745 ± 0.059
3.198ProGlu: 3.198 ± 0.057
1.862ProPhe: 1.862 ± 0.039
4.809ProGly: 4.809 ± 0.069
1.011ProHis: 1.011 ± 0.029
2.65ProIle: 2.65 ± 0.042
1.616ProLys: 1.616 ± 0.035
5.074ProLeu: 5.074 ± 0.062
1.152ProMet: 1.152 ± 0.032
1.345ProAsn: 1.345 ± 0.033
2.81ProPro: 2.81 ± 0.064
1.637ProGln: 1.637 ± 0.032
3.132ProArg: 3.132 ± 0.051
2.895ProSer: 2.895 ± 0.048
3.015ProThr: 3.015 ± 0.048
4.4ProVal: 4.4 ± 0.065
0.67ProTrp: 0.67 ± 0.022
1.121ProTyr: 1.121 ± 0.032
0.0ProXaa: 0.0 ± 0.0
Gln
4.182GlnAla: 4.182 ± 0.067
0.195GlnCys: 0.195 ± 0.012
1.343GlnAsp: 1.343 ± 0.032
1.037GlnGlu: 1.037 ± 0.028
1.026GlnPhe: 1.026 ± 0.032
2.317GlnGly: 2.317 ± 0.042
0.636GlnHis: 0.636 ± 0.022
1.738GlnIle: 1.738 ± 0.038
0.803GlnLys: 0.803 ± 0.025
2.915GlnLeu: 2.915 ± 0.045
0.791GlnMet: 0.791 ± 0.023
0.792GlnAsn: 0.792 ± 0.029
1.791GlnPro: 1.791 ± 0.038
1.123GlnGln: 1.123 ± 0.033
2.594GlnArg: 2.594 ± 0.049
1.729GlnSer: 1.729 ± 0.039
1.905GlnThr: 1.905 ± 0.048
2.21GlnVal: 2.21 ± 0.042
0.426GlnTrp: 0.426 ± 0.019
0.637GlnTyr: 0.637 ± 0.022
0.0GlnXaa: 0.0 ± 0.0
Arg
9.337ArgAla: 9.337 ± 0.1
0.479ArgCys: 0.479 ± 0.021
4.561ArgAsp: 4.561 ± 0.078
3.573ArgGlu: 3.573 ± 0.062
2.966ArgPhe: 2.966 ± 0.047
5.335ArgGly: 5.335 ± 0.074
1.677ArgHis: 1.677 ± 0.041
4.055ArgIle: 4.055 ± 0.054
1.891ArgLys: 1.891 ± 0.037
7.679ArgLeu: 7.679 ± 0.086
1.826ArgMet: 1.826 ± 0.039
1.716ArgAsn: 1.716 ± 0.034
3.591ArgPro: 3.591 ± 0.054
2.246ArgGln: 2.246 ± 0.041
5.581ArgArg: 5.581 ± 0.087
3.84ArgSer: 3.84 ± 0.048
3.788ArgThr: 3.788 ± 0.058
5.071ArgVal: 5.071 ± 0.061
1.225ArgTrp: 1.225 ± 0.035
1.933ArgTyr: 1.933 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
6.754SerAla: 6.754 ± 0.077
0.365SerCys: 0.365 ± 0.016
3.116SerAsp: 3.116 ± 0.053
2.329SerGlu: 2.329 ± 0.044
2.112SerPhe: 2.112 ± 0.039
5.314SerGly: 5.314 ± 0.062
0.953SerHis: 0.953 ± 0.028
2.685SerIle: 2.685 ± 0.048
1.378SerLys: 1.378 ± 0.035
4.895SerLeu: 4.895 ± 0.07
1.131SerMet: 1.131 ± 0.031
1.441SerAsn: 1.441 ± 0.038
2.815SerPro: 2.815 ± 0.053
1.486SerGln: 1.486 ± 0.038
3.247SerArg: 3.247 ± 0.055
2.712SerSer: 2.712 ± 0.058
2.857SerThr: 2.857 ± 0.054
4.073SerVal: 4.073 ± 0.058
0.708SerTrp: 0.708 ± 0.023
1.393SerTyr: 1.393 ± 0.03
0.0SerXaa: 0.0 ± 0.0
Thr
7.39ThrAla: 7.39 ± 0.086
0.398ThrCys: 0.398 ± 0.019
2.977ThrAsp: 2.977 ± 0.048
2.287ThrGlu: 2.287 ± 0.048
1.89ThrPhe: 1.89 ± 0.045
5.526ThrGly: 5.526 ± 0.076
1.08ThrHis: 1.08 ± 0.03
3.4ThrIle: 3.4 ± 0.057
1.571ThrLys: 1.571 ± 0.039
6.721ThrLeu: 6.721 ± 0.082
1.264ThrMet: 1.264 ± 0.031
1.381ThrAsn: 1.381 ± 0.038
3.927ThrPro: 3.927 ± 0.053
1.631ThrGln: 1.631 ± 0.041
3.731ThrArg: 3.731 ± 0.06
2.852ThrSer: 2.852 ± 0.051
3.338ThrThr: 3.338 ± 0.055
4.495ThrVal: 4.495 ± 0.068
0.638ThrTrp: 0.638 ± 0.022
1.269ThrTyr: 1.269 ± 0.033
0.0ThrXaa: 0.0 ± 0.0
Val
11.092ValAla: 11.092 ± 0.097
0.531ValCys: 0.531 ± 0.022
4.317ValAsp: 4.317 ± 0.064
4.148ValGlu: 4.148 ± 0.061
2.35ValPhe: 2.35 ± 0.04
5.862ValGly: 5.862 ± 0.072
1.323ValHis: 1.323 ± 0.032
3.644ValIle: 3.644 ± 0.054
2.005ValLys: 2.005 ± 0.046
7.089ValLeu: 7.089 ± 0.08
1.593ValMet: 1.593 ± 0.037
1.808ValAsn: 1.808 ± 0.045
4.155ValPro: 4.155 ± 0.062
1.988ValGln: 1.988 ± 0.038
5.259ValArg: 5.259 ± 0.069
4.274ValSer: 4.274 ± 0.061
4.925ValThr: 4.925 ± 0.058
5.857ValVal: 5.857 ± 0.078
0.931ValTrp: 0.931 ± 0.032
1.392ValTyr: 1.392 ± 0.033
0.0ValXaa: 0.0 ± 0.0
Trp
1.396TrpAla: 1.396 ± 0.035
0.118TrpCys: 0.118 ± 0.01
0.707TrpAsp: 0.707 ± 0.026
0.494TrpGlu: 0.494 ± 0.018
0.575TrpPhe: 0.575 ± 0.024
0.963TrpGly: 0.963 ± 0.031
0.4TrpHis: 0.4 ± 0.016
0.712TrpIle: 0.712 ± 0.023
0.429TrpLys: 0.429 ± 0.02
1.668TrpLeu: 1.668 ± 0.039
0.33TrpMet: 0.33 ± 0.015
0.406TrpAsn: 0.406 ± 0.016
0.731TrpPro: 0.731 ± 0.024
0.621TrpGln: 0.621 ± 0.025
1.358TrpArg: 1.358 ± 0.038
0.93TrpSer: 0.93 ± 0.031
0.854TrpThr: 0.854 ± 0.024
0.858TrpVal: 0.858 ± 0.031
0.26TrpTrp: 0.26 ± 0.015
0.3TrpTyr: 0.3 ± 0.013
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.963TyrAla: 2.963 ± 0.055
0.185TyrCys: 0.185 ± 0.012
1.593TyrAsp: 1.593 ± 0.039
1.051TyrGlu: 1.051 ± 0.03
0.828TyrPhe: 0.828 ± 0.026
2.067TyrGly: 2.067 ± 0.042
0.489TyrHis: 0.489 ± 0.017
0.82TyrIle: 0.82 ± 0.027
0.551TyrLys: 0.551 ± 0.023
2.107TyrLeu: 2.107 ± 0.043
0.374TyrMet: 0.374 ± 0.018
0.625TyrAsn: 0.625 ± 0.03
1.076TyrPro: 1.076 ± 0.034
0.729TyrGln: 0.729 ± 0.023
1.991TyrArg: 1.991 ± 0.037
1.177TyrSer: 1.177 ± 0.033
1.13TyrThr: 1.13 ± 0.036
1.614TyrVal: 1.614 ± 0.036
0.353TyrTrp: 0.353 ± 0.015
0.651TyrTyr: 0.651 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3972 proteins (1285990 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski