Amino acid dipepetide frequency for Caldiarchaeum subterraneum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.023AlaAla: 10.023 ± 0.168
0.763AlaCys: 0.763 ± 0.038
3.695AlaAsp: 3.695 ± 0.083
6.662AlaGlu: 6.662 ± 0.125
3.105AlaPhe: 3.105 ± 0.078
6.643AlaGly: 6.643 ± 0.106
1.256AlaHis: 1.256 ± 0.041
4.155AlaIle: 4.155 ± 0.087
4.5AlaLys: 4.5 ± 0.087
8.881AlaLeu: 8.881 ± 0.141
1.866AlaMet: 1.866 ± 0.055
1.923AlaAsn: 1.923 ± 0.056
2.651AlaPro: 2.651 ± 0.076
1.931AlaGln: 1.931 ± 0.059
4.589AlaArg: 4.589 ± 0.086
5.325AlaSer: 5.325 ± 0.086
4.049AlaThr: 4.049 ± 0.084
9.042AlaVal: 9.042 ± 0.132
0.798AlaTrp: 0.798 ± 0.039
2.793AlaTyr: 2.793 ± 0.068
0.0AlaXaa: 0.0 ± 0.0
Cys
0.437CysAla: 0.437 ± 0.031
0.089CysCys: 0.089 ± 0.011
0.373CysAsp: 0.373 ± 0.022
0.47CysGlu: 0.47 ± 0.025
0.35CysPhe: 0.35 ± 0.025
1.117CysGly: 1.117 ± 0.051
0.18CysHis: 0.18 ± 0.017
0.568CysIle: 0.568 ± 0.028
0.288CysLys: 0.288 ± 0.021
0.785CysLeu: 0.785 ± 0.035
0.222CysMet: 0.222 ± 0.019
0.225CysAsn: 0.225 ± 0.017
0.549CysPro: 0.549 ± 0.032
0.171CysGln: 0.171 ± 0.016
0.711CysArg: 0.711 ± 0.036
0.568CysSer: 0.568 ± 0.03
0.404CysThr: 0.404 ± 0.026
0.722CysVal: 0.722 ± 0.036
0.114CysTrp: 0.114 ± 0.014
0.258CysTyr: 0.258 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.663AspAla: 3.663 ± 0.077
0.345AspCys: 0.345 ± 0.024
2.027AspAsp: 2.027 ± 0.064
3.917AspGlu: 3.917 ± 0.086
1.986AspPhe: 1.986 ± 0.062
3.678AspGly: 3.678 ± 0.085
0.66AspHis: 0.66 ± 0.033
3.136AspIle: 3.136 ± 0.067
2.356AspLys: 2.356 ± 0.065
4.056AspLeu: 4.056 ± 0.084
1.138AspMet: 1.138 ± 0.047
1.22AspAsn: 1.22 ± 0.045
2.518AspPro: 2.518 ± 0.071
0.756AspGln: 0.756 ± 0.034
2.413AspArg: 2.413 ± 0.061
2.049AspSer: 2.049 ± 0.056
2.09AspThr: 2.09 ± 0.062
4.928AspVal: 4.928 ± 0.096
0.548AspTrp: 0.548 ± 0.029
1.654AspTyr: 1.654 ± 0.051
0.0AspXaa: 0.0 ± 0.0
Glu
6.736GluAla: 6.736 ± 0.126
0.413GluCys: 0.413 ± 0.023
2.929GluAsp: 2.929 ± 0.069
5.948GluGlu: 5.948 ± 0.141
2.328GluPhe: 2.328 ± 0.065
4.119GluGly: 4.119 ± 0.075
1.125GluHis: 1.125 ± 0.045
4.428GluIle: 4.428 ± 0.092
6.703GluLys: 6.703 ± 0.105
6.879GluLeu: 6.879 ± 0.134
1.899GluMet: 1.899 ± 0.06
2.736GluAsn: 2.736 ± 0.068
2.677GluPro: 2.677 ± 0.068
1.946GluGln: 1.946 ± 0.055
4.214GluArg: 4.214 ± 0.076
2.951GluSer: 2.951 ± 0.076
4.011GluThr: 4.011 ± 0.089
5.853GluVal: 5.853 ± 0.106
0.711GluTrp: 0.711 ± 0.037
2.065GluTyr: 2.065 ± 0.061
0.0GluXaa: 0.0 ± 0.0
Phe
2.918PheAla: 2.918 ± 0.068
0.345PheCys: 0.345 ± 0.023
2.002PheAsp: 2.002 ± 0.055
2.47PheGlu: 2.47 ± 0.062
1.975PhePhe: 1.975 ± 0.068
3.149PheGly: 3.149 ± 0.084
0.666PheHis: 0.666 ± 0.033
2.559PheIle: 2.559 ± 0.072
1.742PheLys: 1.742 ± 0.056
4.331PheLeu: 4.331 ± 0.108
0.877PheMet: 0.877 ± 0.04
1.256PheAsn: 1.256 ± 0.044
1.612PhePro: 1.612 ± 0.05
0.999PheGln: 0.999 ± 0.039
2.465PheArg: 2.465 ± 0.063
2.731PheSer: 2.731 ± 0.067
2.402PheThr: 2.402 ± 0.069
3.045PheVal: 3.045 ± 0.07
0.517PheTrp: 0.517 ± 0.03
1.567PheTyr: 1.567 ± 0.052
0.0PheXaa: 0.0 ± 0.0
Gly
5.561GlyAla: 5.561 ± 0.103
0.836GlyCys: 0.836 ± 0.037
3.532GlyAsp: 3.532 ± 0.079
5.561GlyGlu: 5.561 ± 0.08
3.728GlyPhe: 3.728 ± 0.073
6.046GlyGly: 6.046 ± 0.125
1.426GlyHis: 1.426 ± 0.048
4.081GlyIle: 4.081 ± 0.082
4.382GlyLys: 4.382 ± 0.074
8.055GlyLeu: 8.055 ± 0.134
1.926GlyMet: 1.926 ± 0.051
1.828GlyAsn: 1.828 ± 0.049
2.772GlyPro: 2.772 ± 0.066
1.731GlyGln: 1.731 ± 0.055
4.94GlyArg: 4.94 ± 0.098
4.03GlySer: 4.03 ± 0.079
3.189GlyThr: 3.189 ± 0.073
7.836GlyVal: 7.836 ± 0.119
1.086GlyTrp: 1.086 ± 0.043
3.062GlyTyr: 3.062 ± 0.069
0.0GlyXaa: 0.0 ± 0.0
His
1.264HisAla: 1.264 ± 0.042
0.153HisCys: 0.153 ± 0.017
0.722HisAsp: 0.722 ± 0.032
1.087HisGlu: 1.087 ± 0.044
0.65HisPhe: 0.65 ± 0.037
1.856HisGly: 1.856 ± 0.048
0.464HisHis: 0.464 ± 0.03
1.106HisIle: 1.106 ± 0.04
0.633HisLys: 0.633 ± 0.04
2.018HisLeu: 2.018 ± 0.059
0.435HisMet: 0.435 ± 0.029
0.465HisAsn: 0.465 ± 0.027
1.168HisPro: 1.168 ± 0.042
0.432HisGln: 0.432 ± 0.03
1.244HisArg: 1.244 ± 0.048
1.002HisSer: 1.002 ± 0.037
0.812HisThr: 0.812 ± 0.035
1.82HisVal: 1.82 ± 0.052
0.161HisTrp: 0.161 ± 0.015
0.57HisTyr: 0.57 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
5.132IleAla: 5.132 ± 0.097
0.491IleCys: 0.491 ± 0.032
2.918IleAsp: 2.918 ± 0.066
3.856IleGlu: 3.856 ± 0.078
2.287IlePhe: 2.287 ± 0.067
4.489IleGly: 4.489 ± 0.09
1.348IleHis: 1.348 ± 0.043
3.755IleIle: 3.755 ± 0.089
2.902IleLys: 2.902 ± 0.063
5.901IleLeu: 5.901 ± 0.107
1.339IleMet: 1.339 ± 0.038
1.983IleAsn: 1.983 ± 0.055
3.265IlePro: 3.265 ± 0.075
1.442IleGln: 1.442 ± 0.051
3.516IleArg: 3.516 ± 0.073
3.543IleSer: 3.543 ± 0.086
3.284IleThr: 3.284 ± 0.077
5.078IleVal: 5.078 ± 0.089
0.503IleTrp: 0.503 ± 0.026
2.071IleTyr: 2.071 ± 0.059
0.0IleXaa: 0.0 ± 0.0
Lys
4.633LysAla: 4.633 ± 0.088
0.411LysCys: 0.411 ± 0.027
1.872LysAsp: 1.872 ± 0.059
3.062LysGlu: 3.062 ± 0.078
1.486LysPhe: 1.486 ± 0.049
3.356LysGly: 3.356 ± 0.073
1.169LysHis: 1.169 ± 0.044
4.318LysIle: 4.318 ± 0.08
3.355LysLys: 3.355 ± 0.078
5.8LysLeu: 5.8 ± 0.101
1.508LysMet: 1.508 ± 0.054
1.997LysAsn: 1.997 ± 0.053
3.409LysPro: 3.409 ± 0.073
1.73LysGln: 1.73 ± 0.057
3.393LysArg: 3.393 ± 0.073
2.741LysSer: 2.741 ± 0.06
3.95LysThr: 3.95 ± 0.078
4.255LysVal: 4.255 ± 0.094
0.608LysTrp: 0.608 ± 0.038
1.575LysTyr: 1.575 ± 0.048
0.0LysXaa: 0.0 ± 0.0
Leu
9.62LeuAla: 9.62 ± 0.134
0.826LeuCys: 0.826 ± 0.037
4.939LeuAsp: 4.939 ± 0.087
7.662LeuGlu: 7.662 ± 0.128
3.825LeuPhe: 3.825 ± 0.093
8.278LeuGly: 8.278 ± 0.124
1.794LeuHis: 1.794 ± 0.052
5.412LeuIle: 5.412 ± 0.116
5.641LeuLys: 5.641 ± 0.109
11.341LeuLeu: 11.341 ± 0.184
2.603LeuMet: 2.603 ± 0.059
3.2LeuAsn: 3.2 ± 0.079
4.504LeuPro: 4.504 ± 0.087
2.613LeuGln: 2.613 ± 0.066
7.553LeuArg: 7.553 ± 0.114
7.219LeuSer: 7.219 ± 0.136
5.646LeuThr: 5.646 ± 0.105
8.841LeuVal: 8.841 ± 0.138
0.964LeuTrp: 0.964 ± 0.044
3.279LeuTyr: 3.279 ± 0.075
0.0LeuXaa: 0.0 ± 0.0
Met
2.162MetAla: 2.162 ± 0.061
0.169MetCys: 0.169 ± 0.015
1.201MetAsp: 1.201 ± 0.04
1.446MetGlu: 1.446 ± 0.043
0.9MetPhe: 0.9 ± 0.037
1.934MetGly: 1.934 ± 0.057
0.497MetHis: 0.497 ± 0.026
1.222MetIle: 1.222 ± 0.048
1.41MetLys: 1.41 ± 0.045
2.847MetLeu: 2.847 ± 0.073
0.741MetMet: 0.741 ± 0.036
0.701MetAsn: 0.701 ± 0.032
1.125MetPro: 1.125 ± 0.043
0.592MetGln: 0.592 ± 0.031
1.807MetArg: 1.807 ± 0.053
1.571MetSer: 1.571 ± 0.044
1.218MetThr: 1.218 ± 0.042
2.247MetVal: 2.247 ± 0.063
0.228MetTrp: 0.228 ± 0.02
0.551MetTyr: 0.551 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.007AsnAla: 2.007 ± 0.057
0.301AsnCys: 0.301 ± 0.024
1.014AsnAsp: 1.014 ± 0.039
1.589AsnGlu: 1.589 ± 0.057
1.112AsnPhe: 1.112 ± 0.044
2.124AsnGly: 2.124 ± 0.06
0.53AsnHis: 0.53 ± 0.033
2.677AsnIle: 2.677 ± 0.07
1.364AsnLys: 1.364 ± 0.051
3.348AsnLeu: 3.348 ± 0.07
0.881AsnMet: 0.881 ± 0.041
0.972AsnAsn: 0.972 ± 0.041
2.266AsnPro: 2.266 ± 0.062
0.709AsnGln: 0.709 ± 0.038
1.81AsnArg: 1.81 ± 0.06
1.502AsnSer: 1.502 ± 0.049
1.654AsnThr: 1.654 ± 0.059
2.571AsnVal: 2.571 ± 0.064
0.323AsnTrp: 0.323 ± 0.027
1.059AsnTyr: 1.059 ± 0.04
0.0AsnXaa: 0.0 ± 0.0
Pro
3.415ProAla: 3.415 ± 0.074
0.358ProCys: 0.358 ± 0.024
2.369ProAsp: 2.369 ± 0.061
3.45ProGlu: 3.45 ± 0.074
1.961ProPhe: 1.961 ± 0.05
3.097ProGly: 3.097 ± 0.078
1.084ProHis: 1.084 ± 0.048
2.367ProIle: 2.367 ± 0.064
1.85ProLys: 1.85 ± 0.051
4.852ProLeu: 4.852 ± 0.103
0.997ProMet: 0.997 ± 0.039
1.388ProAsn: 1.388 ± 0.055
2.676ProPro: 2.676 ± 0.079
1.299ProGln: 1.299 ± 0.046
3.053ProArg: 3.053 ± 0.065
3.098ProSer: 3.098 ± 0.07
2.369ProThr: 2.369 ± 0.061
4.271ProVal: 4.271 ± 0.089
0.585ProTrp: 0.585 ± 0.034
1.772ProTyr: 1.772 ± 0.054
0.0ProXaa: 0.0 ± 0.0
Gln
2.181GlnAla: 2.181 ± 0.054
0.123GlnCys: 0.123 ± 0.014
0.799GlnAsp: 0.799 ± 0.041
1.388GlnGlu: 1.388 ± 0.052
0.805GlnPhe: 0.805 ± 0.039
1.462GlnGly: 1.462 ± 0.052
0.56GlnHis: 0.56 ± 0.031
1.495GlnIle: 1.495 ± 0.046
1.518GlnLys: 1.518 ± 0.049
3.119GlnLeu: 3.119 ± 0.082
0.644GlnMet: 0.644 ± 0.038
0.885GlnAsn: 0.885 ± 0.041
1.22GlnPro: 1.22 ± 0.05
1.036GlnGln: 1.036 ± 0.041
1.942GlnArg: 1.942 ± 0.053
1.299GlnSer: 1.299 ± 0.054
1.557GlnThr: 1.557 ± 0.056
1.964GlnVal: 1.964 ± 0.054
0.22GlnTrp: 0.22 ± 0.016
0.747GlnTyr: 0.747 ± 0.031
0.0GlnXaa: 0.0 ± 0.0
Arg
4.405ArgAla: 4.405 ± 0.085
0.687ArgCys: 0.687 ± 0.032
2.927ArgAsp: 2.927 ± 0.067
5.065ArgGlu: 5.065 ± 0.104
2.638ArgPhe: 2.638 ± 0.064
4.793ArgGly: 4.793 ± 0.084
1.109ArgHis: 1.109 ± 0.045
3.92ArgIle: 3.92 ± 0.085
3.689ArgLys: 3.689 ± 0.081
7.368ArgLeu: 7.368 ± 0.114
1.736ArgMet: 1.736 ± 0.045
1.967ArgAsn: 1.967 ± 0.064
2.636ArgPro: 2.636 ± 0.069
1.87ArgGln: 1.87 ± 0.05
5.418ArgArg: 5.418 ± 0.114
2.948ArgSer: 2.948 ± 0.068
2.562ArgThr: 2.562 ± 0.059
5.651ArgVal: 5.651 ± 0.096
0.769ArgTrp: 0.769 ± 0.038
2.252ArgTyr: 2.252 ± 0.056
0.0ArgXaa: 0.0 ± 0.0
Ser
4.239SerAla: 4.239 ± 0.095
0.487SerCys: 0.487 ± 0.028
2.537SerAsp: 2.537 ± 0.063
3.657SerGlu: 3.657 ± 0.086
2.673SerPhe: 2.673 ± 0.069
4.679SerGly: 4.679 ± 0.088
1.103SerHis: 1.103 ± 0.041
3.616SerIle: 3.616 ± 0.079
2.866SerLys: 2.866 ± 0.072
6.641SerLeu: 6.641 ± 0.111
1.567SerMet: 1.567 ± 0.05
1.468SerAsn: 1.468 ± 0.051
3.073SerPro: 3.073 ± 0.075
1.631SerGln: 1.631 ± 0.056
3.898SerArg: 3.898 ± 0.079
4.038SerSer: 4.038 ± 0.08
3.133SerThr: 3.133 ± 0.082
4.575SerVal: 4.575 ± 0.076
0.817SerTrp: 0.817 ± 0.03
2.092SerTyr: 2.092 ± 0.06
0.0SerXaa: 0.0 ± 0.0
Thr
4.649ThrAla: 4.649 ± 0.083
0.475ThrCys: 0.475 ± 0.027
1.938ThrAsp: 1.938 ± 0.054
2.799ThrGlu: 2.799 ± 0.079
1.834ThrPhe: 1.834 ± 0.063
4.69ThrGly: 4.69 ± 0.093
1.01ThrHis: 1.01 ± 0.037
3.154ThrIle: 3.154 ± 0.073
1.847ThrLys: 1.847 ± 0.052
5.793ThrLeu: 5.793 ± 0.105
1.177ThrMet: 1.177 ± 0.041
1.378ThrAsn: 1.378 ± 0.04
3.078ThrPro: 3.078 ± 0.069
1.236ThrGln: 1.236 ± 0.048
2.945ThrArg: 2.945 ± 0.065
3.143ThrSer: 3.143 ± 0.078
3.111ThrThr: 3.111 ± 0.123
5.592ThrVal: 5.592 ± 0.111
0.608ThrTrp: 0.608 ± 0.033
1.79ThrTyr: 1.79 ± 0.067
0.0ThrXaa: 0.0 ± 0.0
Val
8.107ValAla: 8.107 ± 0.129
0.874ValCys: 0.874 ± 0.04
5.26ValAsp: 5.26 ± 0.097
7.975ValGlu: 7.975 ± 0.116
4.125ValPhe: 4.125 ± 0.085
6.464ValGly: 6.464 ± 0.106
1.291ValHis: 1.291 ± 0.045
4.51ValIle: 4.51 ± 0.086
5.852ValLys: 5.852 ± 0.108
8.825ValLeu: 8.825 ± 0.118
1.875ValMet: 1.875 ± 0.049
2.769ValAsn: 2.769 ± 0.072
2.962ValPro: 2.962 ± 0.063
1.734ValGln: 1.734 ± 0.052
4.814ValArg: 4.814 ± 0.102
6.092ValSer: 6.092 ± 0.091
4.385ValThr: 4.385 ± 0.09
9.452ValVal: 9.452 ± 0.158
0.921ValTrp: 0.921 ± 0.036
3.592ValTyr: 3.592 ± 0.074
0.0ValXaa: 0.0 ± 0.0
Trp
0.737TrpAla: 0.737 ± 0.034
0.117TrpCys: 0.117 ± 0.013
0.462TrpAsp: 0.462 ± 0.026
0.584TrpGlu: 0.584 ± 0.03
0.479TrpPhe: 0.479 ± 0.03
0.763TrpGly: 0.763 ± 0.036
0.217TrpHis: 0.217 ± 0.018
0.603TrpIle: 0.603 ± 0.031
0.472TrpLys: 0.472 ± 0.027
1.32TrpLeu: 1.32 ± 0.044
0.326TrpMet: 0.326 ± 0.023
0.421TrpAsn: 0.421 ± 0.023
0.437TrpPro: 0.437 ± 0.032
0.313TrpGln: 0.313 ± 0.023
1.093TrpArg: 1.093 ± 0.048
0.798TrpSer: 0.798 ± 0.039
0.505TrpThr: 0.505 ± 0.032
0.911TrpVal: 0.911 ± 0.038
0.165TrpTrp: 0.165 ± 0.017
0.335TrpTyr: 0.335 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.76TyrAla: 2.76 ± 0.067
0.337TyrCys: 0.337 ± 0.024
1.62TyrAsp: 1.62 ± 0.05
2.024TyrGlu: 2.024 ± 0.048
1.381TyrPhe: 1.381 ± 0.046
2.871TyrGly: 2.871 ± 0.066
0.571TyrHis: 0.571 ± 0.031
2.019TyrIle: 2.019 ± 0.06
1.258TyrLys: 1.258 ± 0.046
3.6TyrLeu: 3.6 ± 0.081
0.812TyrMet: 0.812 ± 0.039
1.032TyrAsn: 1.032 ± 0.043
1.712TyrPro: 1.712 ± 0.052
0.793TyrGln: 0.793 ± 0.031
2.594TyrArg: 2.594 ± 0.069
2.141TyrSer: 2.141 ± 0.055
1.916TyrThr: 1.916 ± 0.064
3.227TyrVal: 3.227 ± 0.065
0.415TyrTrp: 0.415 ± 0.026
1.03TyrTyr: 1.03 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2154 proteins (631940 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski