Amino acid dipepetide frequency for Arthrobacter psychrolactophilus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.966AlaAla: 18.966 ± 0.192
0.702AlaCys: 0.702 ± 0.025
6.527AlaAsp: 6.527 ± 0.084
7.524AlaGlu: 7.524 ± 0.102
3.681AlaPhe: 3.681 ± 0.065
11.408AlaGly: 11.408 ± 0.112
2.478AlaHis: 2.478 ± 0.054
5.644AlaIle: 5.644 ± 0.076
4.124AlaLys: 4.124 ± 0.067
13.478AlaLeu: 13.478 ± 0.125
2.999AlaMet: 2.999 ± 0.05
3.103AlaAsn: 3.103 ± 0.052
5.926AlaPro: 5.926 ± 0.111
4.412AlaGln: 4.412 ± 0.068
6.688AlaArg: 6.688 ± 0.087
7.19AlaSer: 7.19 ± 0.083
7.525AlaThr: 7.525 ± 0.094
10.729AlaVal: 10.729 ± 0.115
1.594AlaTrp: 1.594 ± 0.042
2.262AlaTyr: 2.262 ± 0.043
0.0AlaXaa: 0.0 ± 0.0
Cys
0.757CysAla: 0.757 ± 0.03
0.066CysCys: 0.066 ± 0.008
0.319CysAsp: 0.319 ± 0.016
0.318CysGlu: 0.318 ± 0.017
0.205CysPhe: 0.205 ± 0.013
0.668CysGly: 0.668 ± 0.024
0.14CysHis: 0.14 ± 0.012
0.287CysIle: 0.287 ± 0.014
0.132CysLys: 0.132 ± 0.011
0.517CysLeu: 0.517 ± 0.021
0.097CysMet: 0.097 ± 0.009
0.156CysAsn: 0.156 ± 0.012
0.313CysPro: 0.313 ± 0.017
0.193CysGln: 0.193 ± 0.014
0.335CysArg: 0.335 ± 0.016
0.429CysSer: 0.429 ± 0.022
0.404CysThr: 0.404 ± 0.021
0.434CysVal: 0.434 ± 0.019
0.092CysTrp: 0.092 ± 0.011
0.124CysTyr: 0.124 ± 0.011
0.0CysXaa: 0.0 ± 0.0
Asp
6.978AspAla: 6.978 ± 0.089
0.298AspCys: 0.298 ± 0.019
2.893AspAsp: 2.893 ± 0.051
3.258AspGlu: 3.258 ± 0.065
1.989AspPhe: 1.989 ± 0.039
5.229AspGly: 5.229 ± 0.075
1.153AspHis: 1.153 ± 0.031
2.479AspIle: 2.479 ± 0.047
1.534AspLys: 1.534 ± 0.042
5.315AspLeu: 5.315 ± 0.078
0.97AspMet: 0.97 ± 0.029
1.213AspAsn: 1.213 ± 0.032
3.405AspPro: 3.405 ± 0.057
1.546AspGln: 1.546 ± 0.037
2.783AspArg: 2.783 ± 0.052
3.101AspSer: 3.101 ± 0.057
2.749AspThr: 2.749 ± 0.053
4.554AspVal: 4.554 ± 0.071
0.835AspTrp: 0.835 ± 0.031
1.326AspTyr: 1.326 ± 0.035
0.0AspXaa: 0.0 ± 0.0
Glu
6.676GluAla: 6.676 ± 0.107
0.284GluCys: 0.284 ± 0.017
2.767GluAsp: 2.767 ± 0.054
3.09GluGlu: 3.09 ± 0.064
1.875GluPhe: 1.875 ± 0.046
3.768GluGly: 3.768 ± 0.06
1.434GluHis: 1.434 ± 0.035
2.964GluIle: 2.964 ± 0.054
1.957GluLys: 1.957 ± 0.051
7.242GluLeu: 7.242 ± 0.101
1.085GluMet: 1.085 ± 0.036
1.75GluAsn: 1.75 ± 0.038
2.599GluPro: 2.599 ± 0.055
2.383GluGln: 2.383 ± 0.051
3.736GluArg: 3.736 ± 0.073
3.482GluSer: 3.482 ± 0.057
2.852GluThr: 2.852 ± 0.06
4.218GluVal: 4.218 ± 0.064
0.764GluTrp: 0.764 ± 0.026
1.092GluTyr: 1.092 ± 0.029
0.0GluXaa: 0.0 ± 0.0
Phe
4.122PheAla: 4.122 ± 0.063
0.228PheCys: 0.228 ± 0.015
2.12PheAsp: 2.12 ± 0.043
1.742PheGlu: 1.742 ± 0.038
1.269PhePhe: 1.269 ± 0.04
3.175PheGly: 3.175 ± 0.062
0.668PheHis: 0.668 ± 0.024
1.528PheIle: 1.528 ± 0.045
0.909PheLys: 0.909 ± 0.03
3.178PheLeu: 3.178 ± 0.061
0.703PheMet: 0.703 ± 0.028
1.008PheAsn: 1.008 ± 0.035
1.509PhePro: 1.509 ± 0.035
0.908PheGln: 0.908 ± 0.027
1.651PheArg: 1.651 ± 0.038
2.434PheSer: 2.434 ± 0.049
2.27PheThr: 2.27 ± 0.054
2.605PheVal: 2.605 ± 0.05
0.509PheTrp: 0.509 ± 0.021
0.727PheTyr: 0.727 ± 0.027
0.0PheXaa: 0.0 ± 0.0
Gly
9.641GlyAla: 9.641 ± 0.088
0.582GlyCys: 0.582 ± 0.024
3.93GlyAsp: 3.93 ± 0.068
4.495GlyGlu: 4.495 ± 0.06
3.108GlyPhe: 3.108 ± 0.058
6.997GlyGly: 6.997 ± 0.09
1.922GlyHis: 1.922 ± 0.043
4.76GlyIle: 4.76 ± 0.073
3.175GlyLys: 3.175 ± 0.058
8.815GlyLeu: 8.815 ± 0.096
2.136GlyMet: 2.136 ± 0.044
2.349GlyAsn: 2.349 ± 0.046
3.488GlyPro: 3.488 ± 0.055
2.927GlyGln: 2.927 ± 0.053
4.876GlyArg: 4.876 ± 0.072
5.702GlySer: 5.702 ± 0.077
5.962GlyThr: 5.962 ± 0.086
7.157GlyVal: 7.157 ± 0.099
1.574GlyTrp: 1.574 ± 0.039
2.225GlyTyr: 2.225 ± 0.052
0.0GlyXaa: 0.0 ± 0.0
His
2.221HisAla: 2.221 ± 0.05
0.163HisCys: 0.163 ± 0.01
1.249HisAsp: 1.249 ± 0.034
1.252HisGlu: 1.252 ± 0.033
0.714HisPhe: 0.714 ± 0.025
2.022HisGly: 2.022 ± 0.048
0.627HisHis: 0.627 ± 0.026
0.909HisIle: 0.909 ± 0.026
0.497HisLys: 0.497 ± 0.023
2.085HisLeu: 2.085 ± 0.05
0.413HisMet: 0.413 ± 0.02
0.495HisAsn: 0.495 ± 0.021
1.415HisPro: 1.415 ± 0.04
0.714HisGln: 0.714 ± 0.025
1.415HisArg: 1.415 ± 0.037
1.216HisSer: 1.216 ± 0.035
1.077HisThr: 1.077 ± 0.027
1.65HisVal: 1.65 ± 0.037
0.337HisTrp: 0.337 ± 0.02
0.491HisTyr: 0.491 ± 0.023
0.0HisXaa: 0.0 ± 0.0
Ile
6.372IleAla: 6.372 ± 0.102
0.351IleCys: 0.351 ± 0.017
2.955IleAsp: 2.955 ± 0.055
2.611IleGlu: 2.611 ± 0.055
1.707IlePhe: 1.707 ± 0.047
4.25IleGly: 4.25 ± 0.069
0.921IleHis: 0.921 ± 0.033
2.352IleIle: 2.352 ± 0.057
1.462IleLys: 1.462 ± 0.039
4.504IleLeu: 4.504 ± 0.072
1.042IleMet: 1.042 ± 0.033
1.469IleAsn: 1.469 ± 0.042
2.647IlePro: 2.647 ± 0.054
1.216IleGln: 1.216 ± 0.036
2.487IleArg: 2.487 ± 0.049
3.267IleSer: 3.267 ± 0.058
3.102IleThr: 3.102 ± 0.048
4.184IleVal: 4.184 ± 0.07
0.569IleTrp: 0.569 ± 0.023
0.949IleTyr: 0.949 ± 0.032
0.0IleXaa: 0.0 ± 0.0
Lys
3.835LysAla: 3.835 ± 0.068
0.118LysCys: 0.118 ± 0.01
1.83LysAsp: 1.83 ± 0.048
1.598LysGlu: 1.598 ± 0.044
1.015LysPhe: 1.015 ± 0.033
2.132LysGly: 2.132 ± 0.049
0.625LysHis: 0.625 ± 0.026
1.643LysIle: 1.643 ± 0.036
1.256LysLys: 1.256 ± 0.037
3.003LysLeu: 3.003 ± 0.058
0.741LysMet: 0.741 ± 0.023
1.046LysAsn: 1.046 ± 0.032
1.588LysPro: 1.588 ± 0.037
0.885LysGln: 0.885 ± 0.028
1.751LysArg: 1.751 ± 0.042
2.014LysSer: 2.014 ± 0.05
1.936LysThr: 1.936 ± 0.051
2.658LysVal: 2.658 ± 0.055
0.358LysTrp: 0.358 ± 0.019
0.706LysTyr: 0.706 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
14.351LeuAla: 14.351 ± 0.135
0.707LeuCys: 0.707 ± 0.028
5.825LeuAsp: 5.825 ± 0.09
5.607LeuGlu: 5.607 ± 0.07
3.032LeuPhe: 3.032 ± 0.061
9.155LeuGly: 9.155 ± 0.104
1.975LeuHis: 1.975 ± 0.042
4.994LeuIle: 4.994 ± 0.088
2.836LeuLys: 2.836 ± 0.06
10.763LeuLeu: 10.763 ± 0.13
2.179LeuMet: 2.179 ± 0.049
2.917LeuAsn: 2.917 ± 0.058
5.758LeuPro: 5.758 ± 0.082
2.878LeuGln: 2.878 ± 0.05
6.439LeuArg: 6.439 ± 0.083
6.963LeuSer: 6.963 ± 0.086
6.542LeuThr: 6.542 ± 0.076
8.218LeuVal: 8.218 ± 0.107
1.333LeuTrp: 1.333 ± 0.037
1.614LeuTyr: 1.614 ± 0.041
0.0LeuXaa: 0.0 ± 0.0
Met
2.866MetAla: 2.866 ± 0.056
0.151MetCys: 0.151 ± 0.011
1.105MetAsp: 1.105 ± 0.035
0.997MetGlu: 0.997 ± 0.029
0.677MetPhe: 0.677 ± 0.026
1.762MetGly: 1.762 ± 0.045
0.386MetHis: 0.386 ± 0.019
1.109MetIle: 1.109 ± 0.038
0.705MetLys: 0.705 ± 0.025
2.054MetLeu: 2.054 ± 0.043
0.476MetMet: 0.476 ± 0.02
0.707MetAsn: 0.707 ± 0.025
1.119MetPro: 1.119 ± 0.031
0.583MetGln: 0.583 ± 0.023
1.243MetArg: 1.243 ± 0.033
1.782MetSer: 1.782 ± 0.036
1.722MetThr: 1.722 ± 0.041
1.83MetVal: 1.83 ± 0.039
0.25MetTrp: 0.25 ± 0.014
0.357MetTyr: 0.357 ± 0.016
0.0MetXaa: 0.0 ± 0.0
Asn
3.156AsnAla: 3.156 ± 0.055
0.172AsnCys: 0.172 ± 0.013
1.573AsnAsp: 1.573 ± 0.04
1.359AsnGlu: 1.359 ± 0.038
1.011AsnPhe: 1.011 ± 0.037
2.508AsnGly: 2.508 ± 0.049
0.576AsnHis: 0.576 ± 0.022
1.402AsnIle: 1.402 ± 0.033
0.859AsnLys: 0.859 ± 0.032
2.495AsnLeu: 2.495 ± 0.049
0.553AsnMet: 0.553 ± 0.025
0.923AsnAsn: 0.923 ± 0.034
1.993AsnPro: 1.993 ± 0.044
0.881AsnGln: 0.881 ± 0.028
1.477AsnArg: 1.477 ± 0.037
1.763AsnSer: 1.763 ± 0.044
1.675AsnThr: 1.675 ± 0.043
2.083AsnVal: 2.083 ± 0.05
0.428AsnTrp: 0.428 ± 0.02
0.733AsnTyr: 0.733 ± 0.029
0.0AsnXaa: 0.0 ± 0.0
Pro
6.802ProAla: 6.802 ± 0.096
0.191ProCys: 0.191 ± 0.012
3.035ProAsp: 3.035 ± 0.056
3.897ProGlu: 3.897 ± 0.071
1.666ProPhe: 1.666 ± 0.045
4.639ProGly: 4.639 ± 0.064
1.101ProHis: 1.101 ± 0.033
1.955ProIle: 1.955 ± 0.039
1.384ProLys: 1.384 ± 0.04
4.955ProLeu: 4.955 ± 0.074
1.057ProMet: 1.057 ± 0.029
1.328ProAsn: 1.328 ± 0.033
1.842ProPro: 1.842 ± 0.048
1.659ProGln: 1.659 ± 0.041
2.564ProArg: 2.564 ± 0.058
3.364ProSer: 3.364 ± 0.057
3.228ProThr: 3.228 ± 0.081
4.545ProVal: 4.545 ± 0.072
0.834ProTrp: 0.834 ± 0.028
1.05ProTyr: 1.05 ± 0.036
0.001ProXaa: 0.001 ± 0.001
Gln
3.96GlnAla: 3.96 ± 0.068
0.163GlnCys: 0.163 ± 0.013
1.555GlnAsp: 1.555 ± 0.04
1.808GlnGlu: 1.808 ± 0.04
0.929GlnPhe: 0.929 ± 0.031
2.602GlnGly: 2.602 ± 0.052
0.696GlnHis: 0.696 ± 0.026
1.608GlnIle: 1.608 ± 0.041
1.042GlnLys: 1.042 ± 0.028
3.871GlnLeu: 3.871 ± 0.068
0.687GlnMet: 0.687 ± 0.026
0.802GlnAsn: 0.802 ± 0.025
1.479GlnPro: 1.479 ± 0.037
1.386GlnGln: 1.386 ± 0.041
2.36GlnArg: 2.36 ± 0.049
1.813GlnSer: 1.813 ± 0.038
1.622GlnThr: 1.622 ± 0.035
2.359GlnVal: 2.359 ± 0.041
0.616GlnTrp: 0.616 ± 0.025
0.704GlnTyr: 0.704 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
6.185ArgAla: 6.185 ± 0.087
0.328ArgCys: 0.328 ± 0.02
2.984ArgAsp: 2.984 ± 0.059
3.496ArgGlu: 3.496 ± 0.067
1.961ArgPhe: 1.961 ± 0.039
4.322ArgGly: 4.322 ± 0.071
1.395ArgHis: 1.395 ± 0.04
3.068ArgIle: 3.068 ± 0.052
1.916ArgLys: 1.916 ± 0.043
5.858ArgLeu: 5.858 ± 0.089
1.365ArgMet: 1.365 ± 0.035
1.677ArgAsn: 1.677 ± 0.044
2.735ArgPro: 2.735 ± 0.058
1.982ArgGln: 1.982 ± 0.045
4.39ArgArg: 4.39 ± 0.08
3.759ArgSer: 3.759 ± 0.058
3.61ArgThr: 3.61 ± 0.058
4.237ArgVal: 4.237 ± 0.072
1.012ArgTrp: 1.012 ± 0.03
1.413ArgTyr: 1.413 ± 0.045
0.0ArgXaa: 0.0 ± 0.0
Ser
7.661SerAla: 7.661 ± 0.098
0.41SerCys: 0.41 ± 0.023
3.314SerAsp: 3.314 ± 0.061
3.455SerGlu: 3.455 ± 0.06
2.23SerPhe: 2.23 ± 0.044
6.127SerGly: 6.127 ± 0.088
1.349SerHis: 1.349 ± 0.03
2.918SerIle: 2.918 ± 0.048
1.917SerLys: 1.917 ± 0.045
6.482SerLeu: 6.482 ± 0.082
1.598SerMet: 1.598 ± 0.037
1.851SerAsn: 1.851 ± 0.041
3.184SerPro: 3.184 ± 0.06
1.926SerGln: 1.926 ± 0.043
3.583SerArg: 3.583 ± 0.06
4.409SerSer: 4.409 ± 0.082
4.013SerThr: 4.013 ± 0.074
5.041SerVal: 5.041 ± 0.069
1.053SerTrp: 1.053 ± 0.033
1.738SerTyr: 1.738 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
8.14ThrAla: 8.14 ± 0.102
0.259ThrCys: 0.259 ± 0.016
2.97ThrAsp: 2.97 ± 0.057
3.227ThrGlu: 3.227 ± 0.059
2.089ThrPhe: 2.089 ± 0.05
5.557ThrGly: 5.557 ± 0.084
1.187ThrHis: 1.187 ± 0.034
2.988ThrIle: 2.988 ± 0.054
1.709ThrLys: 1.709 ± 0.046
6.596ThrLeu: 6.596 ± 0.085
1.253ThrMet: 1.253 ± 0.038
1.484ThrAsn: 1.484 ± 0.041
3.915ThrPro: 3.915 ± 0.08
1.823ThrGln: 1.823 ± 0.042
2.996ThrArg: 2.996 ± 0.05
3.869ThrSer: 3.869 ± 0.056
3.938ThrThr: 3.938 ± 0.084
5.704ThrVal: 5.704 ± 0.078
0.799ThrTrp: 0.799 ± 0.026
1.045ThrTyr: 1.045 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
10.493ValAla: 10.493 ± 0.106
0.541ValCys: 0.541 ± 0.022
4.705ValAsp: 4.705 ± 0.062
4.503ValGlu: 4.503 ± 0.073
2.655ValPhe: 2.655 ± 0.05
6.505ValGly: 6.505 ± 0.081
1.646ValHis: 1.646 ± 0.044
4.248ValIle: 4.248 ± 0.067
2.351ValLys: 2.351 ± 0.047
8.852ValLeu: 8.852 ± 0.107
1.817ValMet: 1.817 ± 0.044
2.231ValAsn: 2.231 ± 0.046
4.409ValPro: 4.409 ± 0.06
2.352ValGln: 2.352 ± 0.045
4.561ValArg: 4.561 ± 0.066
5.375ValSer: 5.375 ± 0.079
5.224ValThr: 5.224 ± 0.078
7.748ValVal: 7.748 ± 0.093
0.983ValTrp: 0.983 ± 0.03
1.4ValTyr: 1.4 ± 0.039
0.0ValXaa: 0.0 ± 0.0
Trp
1.502TrpAla: 1.502 ± 0.041
0.116TrpCys: 0.116 ± 0.012
0.757TrpAsp: 0.757 ± 0.027
0.702TrpGlu: 0.702 ± 0.028
0.579TrpPhe: 0.579 ± 0.022
1.069TrpGly: 1.069 ± 0.032
0.365TrpHis: 0.365 ± 0.02
0.77TrpIle: 0.77 ± 0.029
0.393TrpLys: 0.393 ± 0.02
1.777TrpLeu: 1.777 ± 0.04
0.346TrpMet: 0.346 ± 0.016
0.525TrpAsn: 0.525 ± 0.024
0.658TrpPro: 0.658 ± 0.024
0.648TrpGln: 0.648 ± 0.025
0.993TrpArg: 0.993 ± 0.032
0.886TrpSer: 0.886 ± 0.027
0.863TrpThr: 0.863 ± 0.028
1.036TrpVal: 1.036 ± 0.029
0.324TrpTrp: 0.324 ± 0.02
0.305TrpTyr: 0.305 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.276TyrAla: 2.276 ± 0.046
0.158TyrCys: 0.158 ± 0.012
1.192TyrAsp: 1.192 ± 0.034
1.079TyrGlu: 1.079 ± 0.032
0.873TyrPhe: 0.873 ± 0.027
1.929TyrGly: 1.929 ± 0.039
0.318TyrHis: 0.318 ± 0.018
0.792TyrIle: 0.792 ± 0.026
0.581TyrLys: 0.581 ± 0.026
2.246TyrLeu: 2.246 ± 0.051
0.36TyrMet: 0.36 ± 0.017
0.587TyrAsn: 0.587 ± 0.025
1.113TyrPro: 1.113 ± 0.034
0.76TyrGln: 0.76 ± 0.027
1.363TyrArg: 1.363 ± 0.039
1.439TyrSer: 1.439 ± 0.043
1.232TyrThr: 1.232 ± 0.041
1.608TyrVal: 1.608 ± 0.038
0.354TyrTrp: 0.354 ± 0.018
0.501TyrTyr: 0.501 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.001XaaArg: 0.001 ± 0.001
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.001XaaXaa: 0.001 ± 0.001
Statistics based on 3412 proteins (1124869 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski