Amino acid dipepetide frequency for Marinifilum breve

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.946AlaAla: 3.946 ± 0.063
0.749AlaCys: 0.749 ± 0.022
3.561AlaAsp: 3.561 ± 0.063
4.251AlaGlu: 4.251 ± 0.069
3.066AlaPhe: 3.066 ± 0.043
4.067AlaGly: 4.067 ± 0.069
1.06AlaHis: 1.06 ± 0.027
4.908AlaIle: 4.908 ± 0.061
4.923AlaLys: 4.923 ± 0.058
5.666AlaLeu: 5.666 ± 0.074
1.637AlaMet: 1.637 ± 0.038
3.473AlaAsn: 3.473 ± 0.057
1.733AlaPro: 1.733 ± 0.041
2.218AlaGln: 2.218 ± 0.037
2.052AlaArg: 2.052 ± 0.042
3.968AlaSer: 3.968 ± 0.064
2.956AlaThr: 2.956 ± 0.054
3.748AlaVal: 3.748 ± 0.055
0.624AlaTrp: 0.624 ± 0.021
2.528AlaTyr: 2.528 ± 0.048
0.0AlaXaa: 0.0 ± 0.0
Cys
0.525CysAla: 0.525 ± 0.019
0.138CysCys: 0.138 ± 0.01
0.535CysAsp: 0.535 ± 0.018
0.641CysGlu: 0.641 ± 0.023
0.507CysPhe: 0.507 ± 0.019
0.733CysGly: 0.733 ± 0.022
0.236CysHis: 0.236 ± 0.017
0.727CysIle: 0.727 ± 0.021
0.733CysLys: 0.733 ± 0.02
0.817CysLeu: 0.817 ± 0.025
0.226CysMet: 0.226 ± 0.011
0.527CysAsn: 0.527 ± 0.019
0.415CysPro: 0.415 ± 0.018
0.27CysGln: 0.27 ± 0.012
0.347CysArg: 0.347 ± 0.014
0.686CysSer: 0.686 ± 0.023
0.48CysThr: 0.48 ± 0.018
0.541CysVal: 0.541 ± 0.02
0.098CysTrp: 0.098 ± 0.008
0.381CysTyr: 0.381 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
3.411AspAla: 3.411 ± 0.054
0.496AspCys: 0.496 ± 0.016
3.0AspAsp: 3.0 ± 0.054
4.145AspGlu: 4.145 ± 0.049
3.495AspPhe: 3.495 ± 0.044
3.641AspGly: 3.641 ± 0.06
1.027AspHis: 1.027 ± 0.027
4.344AspIle: 4.344 ± 0.058
4.32AspLys: 4.32 ± 0.047
5.621AspLeu: 5.621 ± 0.058
1.298AspMet: 1.298 ± 0.033
2.963AspAsn: 2.963 ± 0.054
1.898AspPro: 1.898 ± 0.031
1.95AspGln: 1.95 ± 0.036
2.095AspArg: 2.095 ± 0.037
3.27AspSer: 3.27 ± 0.05
2.41AspThr: 2.41 ± 0.043
3.563AspVal: 3.563 ± 0.059
0.772AspTrp: 0.772 ± 0.024
2.812AspTyr: 2.812 ± 0.044
0.0AspXaa: 0.0 ± 0.0
Glu
4.177GluAla: 4.177 ± 0.056
0.534GluCys: 0.534 ± 0.02
3.665GluAsp: 3.665 ± 0.052
5.811GluGlu: 5.811 ± 0.088
3.487GluPhe: 3.487 ± 0.047
3.915GluGly: 3.915 ± 0.056
1.151GluHis: 1.151 ± 0.026
5.924GluIle: 5.924 ± 0.074
6.523GluLys: 6.523 ± 0.072
7.191GluLeu: 7.191 ± 0.071
2.043GluMet: 2.043 ± 0.036
4.823GluAsn: 4.823 ± 0.063
1.498GluPro: 1.498 ± 0.033
2.35GluGln: 2.35 ± 0.038
2.557GluArg: 2.557 ± 0.044
4.019GluSer: 4.019 ± 0.05
3.145GluThr: 3.145 ± 0.045
4.455GluVal: 4.455 ± 0.063
0.737GluTrp: 0.737 ± 0.021
2.931GluTyr: 2.931 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.108PheAla: 3.108 ± 0.048
0.548PheCys: 0.548 ± 0.018
3.219PheAsp: 3.219 ± 0.043
3.423PheGlu: 3.423 ± 0.046
2.423PhePhe: 2.423 ± 0.051
3.302PheGly: 3.302 ± 0.053
0.869PheHis: 0.869 ± 0.025
3.616PheIle: 3.616 ± 0.059
3.599PheLys: 3.599 ± 0.049
4.403PheLeu: 4.403 ± 0.068
1.189PheMet: 1.189 ± 0.03
2.906PheAsn: 2.906 ± 0.043
1.618PhePro: 1.618 ± 0.036
1.539PheGln: 1.539 ± 0.034
1.826PheArg: 1.826 ± 0.036
3.85PheSer: 3.85 ± 0.056
2.901PheThr: 2.901 ± 0.049
3.188PheVal: 3.188 ± 0.047
0.568PheTrp: 0.568 ± 0.021
2.12PheTyr: 2.12 ± 0.039
0.0PheXaa: 0.0 ± 0.0
Gly
3.826GlyAla: 3.826 ± 0.057
0.662GlyCys: 0.662 ± 0.023
3.384GlyAsp: 3.384 ± 0.053
4.115GlyGlu: 4.115 ± 0.05
3.437GlyPhe: 3.437 ± 0.054
4.321GlyGly: 4.321 ± 0.071
1.083GlyHis: 1.083 ± 0.028
5.378GlyIle: 5.378 ± 0.058
5.533GlyLys: 5.533 ± 0.064
5.555GlyLeu: 5.555 ± 0.069
1.802GlyMet: 1.802 ± 0.036
3.707GlyAsn: 3.707 ± 0.061
1.255GlyPro: 1.255 ± 0.027
1.687GlyGln: 1.687 ± 0.032
2.191GlyArg: 2.191 ± 0.043
4.003GlySer: 4.003 ± 0.061
3.55GlyThr: 3.55 ± 0.064
4.444GlyVal: 4.444 ± 0.062
0.777GlyTrp: 0.777 ± 0.022
2.789GlyTyr: 2.789 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
1.015HisAla: 1.015 ± 0.026
0.227HisCys: 0.227 ± 0.014
0.896HisAsp: 0.896 ± 0.026
1.127HisGlu: 1.127 ± 0.029
1.109HisPhe: 1.109 ± 0.029
1.085HisGly: 1.085 ± 0.029
0.452HisHis: 0.452 ± 0.019
1.34HisIle: 1.34 ± 0.031
1.296HisLys: 1.296 ± 0.03
1.857HisLeu: 1.857 ± 0.042
0.411HisMet: 0.411 ± 0.014
0.906HisAsn: 0.906 ± 0.027
0.93HisPro: 0.93 ± 0.028
0.613HisGln: 0.613 ± 0.02
0.721HisArg: 0.721 ± 0.027
1.22HisSer: 1.22 ± 0.029
0.895HisThr: 0.895 ± 0.023
0.985HisVal: 0.985 ± 0.027
0.237HisTrp: 0.237 ± 0.015
0.856HisTyr: 0.856 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
5.127IleAla: 5.127 ± 0.068
0.776IleCys: 0.776 ± 0.024
4.837IleAsp: 4.837 ± 0.061
5.608IleGlu: 5.608 ± 0.07
3.346IlePhe: 3.346 ± 0.05
5.02IleGly: 5.02 ± 0.07
1.504IleHis: 1.504 ± 0.032
5.35IleIle: 5.35 ± 0.069
5.714IleLys: 5.714 ± 0.066
7.036IleLeu: 7.036 ± 0.079
1.441IleMet: 1.441 ± 0.031
4.512IleAsn: 4.512 ± 0.059
3.143IlePro: 3.143 ± 0.043
2.689IleGln: 2.689 ± 0.044
3.015IleArg: 3.015 ± 0.043
5.788IleSer: 5.788 ± 0.076
4.079IleThr: 4.079 ± 0.059
4.703IleVal: 4.703 ± 0.061
0.755IleTrp: 0.755 ± 0.028
2.776IleTyr: 2.776 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
5.096LysAla: 5.096 ± 0.069
0.543LysCys: 0.543 ± 0.022
4.711LysAsp: 4.711 ± 0.064
6.998LysGlu: 6.998 ± 0.079
3.104LysPhe: 3.104 ± 0.042
4.787LysGly: 4.787 ± 0.058
1.505LysHis: 1.505 ± 0.031
6.064LysIle: 6.064 ± 0.068
6.837LysLys: 6.837 ± 0.092
7.286LysLeu: 7.286 ± 0.068
2.208LysMet: 2.208 ± 0.041
5.059LysAsn: 5.059 ± 0.058
2.253LysPro: 2.253 ± 0.043
2.868LysGln: 2.868 ± 0.048
3.114LysArg: 3.114 ± 0.047
4.944LysSer: 4.944 ± 0.064
3.937LysThr: 3.937 ± 0.05
5.019LysVal: 5.019 ± 0.059
0.881LysTrp: 0.881 ± 0.024
3.491LysTyr: 3.491 ± 0.056
0.0LysXaa: 0.0 ± 0.0
Leu
5.713LeuAla: 5.713 ± 0.07
0.826LeuCys: 0.826 ± 0.022
4.883LeuAsp: 4.883 ± 0.056
6.162LeuGlu: 6.162 ± 0.073
4.792LeuPhe: 4.792 ± 0.077
5.586LeuGly: 5.586 ± 0.072
1.607LeuHis: 1.607 ± 0.037
7.141LeuIle: 7.141 ± 0.101
7.933LeuLys: 7.933 ± 0.08
8.936LeuLeu: 8.936 ± 0.101
2.231LeuMet: 2.231 ± 0.04
5.87LeuAsn: 5.87 ± 0.072
3.445LeuPro: 3.445 ± 0.049
3.225LeuGln: 3.225 ± 0.053
3.406LeuArg: 3.406 ± 0.056
7.146LeuSer: 7.146 ± 0.078
4.508LeuThr: 4.508 ± 0.061
5.293LeuVal: 5.293 ± 0.059
0.858LeuTrp: 0.858 ± 0.026
3.253LeuTyr: 3.253 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
1.723MetAla: 1.723 ± 0.036
0.183MetCys: 0.183 ± 0.012
1.442MetAsp: 1.442 ± 0.03
1.625MetGlu: 1.625 ± 0.029
0.926MetPhe: 0.926 ± 0.025
1.681MetGly: 1.681 ± 0.034
0.447MetHis: 0.447 ± 0.016
1.816MetIle: 1.816 ± 0.038
2.36MetLys: 2.36 ± 0.042
2.109MetLeu: 2.109 ± 0.039
0.594MetMet: 0.594 ± 0.021
1.687MetAsn: 1.687 ± 0.035
0.929MetPro: 0.929 ± 0.024
0.846MetGln: 0.846 ± 0.026
0.948MetArg: 0.948 ± 0.021
1.445MetSer: 1.445 ± 0.032
1.064MetThr: 1.064 ± 0.026
1.524MetVal: 1.524 ± 0.031
0.198MetTrp: 0.198 ± 0.012
0.781MetTyr: 0.781 ± 0.024
0.0MetXaa: 0.0 ± 0.0
Asn
3.509AsnAla: 3.509 ± 0.052
0.606AsnCys: 0.606 ± 0.022
3.056AsnAsp: 3.056 ± 0.053
3.866AsnGlu: 3.866 ± 0.052
2.875AsnPhe: 2.875 ± 0.047
4.072AsnGly: 4.072 ± 0.062
1.202AsnHis: 1.202 ± 0.028
4.543AsnIle: 4.543 ± 0.057
4.547AsnLys: 4.547 ± 0.061
5.621AsnLeu: 5.621 ± 0.075
1.386AsnMet: 1.386 ± 0.03
3.376AsnAsn: 3.376 ± 0.054
2.584AsnPro: 2.584 ± 0.043
2.395AsnGln: 2.395 ± 0.041
2.277AsnArg: 2.277 ± 0.04
3.99AsnSer: 3.99 ± 0.061
3.1AsnThr: 3.1 ± 0.056
3.466AsnVal: 3.466 ± 0.048
0.84AsnTrp: 0.84 ± 0.021
2.859AsnTyr: 2.859 ± 0.049
0.0AsnXaa: 0.0 ± 0.0
Pro
1.907ProAla: 1.907 ± 0.04
0.271ProCys: 0.271 ± 0.014
2.144ProAsp: 2.144 ± 0.039
2.918ProGlu: 2.918 ± 0.048
1.702ProPhe: 1.702 ± 0.033
1.983ProGly: 1.983 ± 0.044
0.591ProHis: 0.591 ± 0.021
2.431ProIle: 2.431 ± 0.038
2.501ProLys: 2.501 ± 0.047
2.714ProLeu: 2.714 ± 0.039
0.759ProMet: 0.759 ± 0.022
2.074ProAsn: 2.074 ± 0.034
0.714ProPro: 0.714 ± 0.023
1.052ProGln: 1.052 ± 0.026
0.962ProArg: 0.962 ± 0.025
1.967ProSer: 1.967 ± 0.035
1.671ProThr: 1.671 ± 0.033
2.442ProVal: 2.442 ± 0.05
0.328ProTrp: 0.328 ± 0.014
1.326ProTyr: 1.326 ± 0.028
0.0ProXaa: 0.0 ± 0.0
Gln
2.114GlnAla: 2.114 ± 0.042
0.225GlnCys: 0.225 ± 0.011
1.662GlnAsp: 1.662 ± 0.038
2.544GlnGlu: 2.544 ± 0.048
1.724GlnPhe: 1.724 ± 0.035
1.662GlnGly: 1.662 ± 0.033
0.56GlnHis: 0.56 ± 0.021
2.75GlnIle: 2.75 ± 0.043
3.071GlnLys: 3.071 ± 0.047
3.45GlnLeu: 3.45 ± 0.048
0.934GlnMet: 0.934 ± 0.024
2.161GlnAsn: 2.161 ± 0.038
0.927GlnPro: 0.927 ± 0.032
1.212GlnGln: 1.212 ± 0.032
1.227GlnArg: 1.227 ± 0.033
1.994GlnSer: 1.994 ± 0.032
1.686GlnThr: 1.686 ± 0.036
2.021GlnVal: 2.021 ± 0.033
0.347GlnTrp: 0.347 ± 0.016
1.402GlnTyr: 1.402 ± 0.032
0.0GlnXaa: 0.0 ± 0.0
Arg
2.0ArgAla: 2.0 ± 0.036
0.282ArgCys: 0.282 ± 0.013
1.877ArgAsp: 1.877 ± 0.037
2.56ArgGlu: 2.56 ± 0.053
2.005ArgPhe: 2.005 ± 0.035
2.009ArgGly: 2.009 ± 0.039
0.625ArgHis: 0.625 ± 0.019
3.225ArgIle: 3.225 ± 0.049
3.38ArgLys: 3.38 ± 0.057
3.445ArgLeu: 3.445 ± 0.046
1.101ArgMet: 1.101 ± 0.029
2.31ArgAsn: 2.31 ± 0.043
1.045ArgPro: 1.045 ± 0.028
1.075ArgGln: 1.075 ± 0.029
1.517ArgArg: 1.517 ± 0.038
2.152ArgSer: 2.152 ± 0.04
1.846ArgThr: 1.846 ± 0.033
2.251ArgVal: 2.251 ± 0.037
0.411ArgTrp: 0.411 ± 0.015
1.699ArgTyr: 1.699 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
3.8SerAla: 3.8 ± 0.052
0.75SerCys: 0.75 ± 0.026
3.741SerAsp: 3.741 ± 0.05
4.339SerGlu: 4.339 ± 0.059
3.841SerPhe: 3.841 ± 0.052
4.714SerGly: 4.714 ± 0.074
1.174SerHis: 1.174 ± 0.028
5.424SerIle: 5.424 ± 0.074
5.053SerLys: 5.053 ± 0.056
6.185SerLeu: 6.185 ± 0.061
1.505SerMet: 1.505 ± 0.032
3.927SerAsn: 3.927 ± 0.053
2.04SerPro: 2.04 ± 0.037
2.015SerGln: 2.015 ± 0.036
2.307SerArg: 2.307 ± 0.039
4.542SerSer: 4.542 ± 0.075
3.24SerThr: 3.24 ± 0.047
4.314SerVal: 4.314 ± 0.049
0.788SerTrp: 0.788 ± 0.023
2.926SerTyr: 2.926 ± 0.05
0.0SerXaa: 0.0 ± 0.0
Thr
3.233ThrAla: 3.233 ± 0.057
0.482ThrCys: 0.482 ± 0.019
3.025ThrAsp: 3.025 ± 0.056
3.172ThrGlu: 3.172 ± 0.043
2.295ThrPhe: 2.295 ± 0.047
3.745ThrGly: 3.745 ± 0.059
0.952ThrHis: 0.952 ± 0.026
4.075ThrIle: 4.075 ± 0.062
3.467ThrLys: 3.467 ± 0.053
4.343ThrLeu: 4.343 ± 0.061
0.906ThrMet: 0.906 ± 0.023
2.908ThrAsn: 2.908 ± 0.055
2.109ThrPro: 2.109 ± 0.033
1.584ThrGln: 1.584 ± 0.036
1.665ThrArg: 1.665 ± 0.036
3.351ThrSer: 3.351 ± 0.056
2.736ThrThr: 2.736 ± 0.059
3.198ThrVal: 3.198 ± 0.06
0.528ThrTrp: 0.528 ± 0.019
2.176ThrTyr: 2.176 ± 0.05
0.0ThrXaa: 0.0 ± 0.0
Val
3.874ValAla: 3.874 ± 0.059
0.679ValCys: 0.679 ± 0.022
3.893ValAsp: 3.893 ± 0.051
4.364ValGlu: 4.364 ± 0.057
3.159ValPhe: 3.159 ± 0.047
3.85ValGly: 3.85 ± 0.055
0.982ValHis: 0.982 ± 0.024
4.691ValIle: 4.691 ± 0.057
4.84ValLys: 4.84 ± 0.058
5.71ValLeu: 5.71 ± 0.072
1.441ValMet: 1.441 ± 0.03
3.73ValAsn: 3.73 ± 0.052
2.133ValPro: 2.133 ± 0.041
1.821ValGln: 1.821 ± 0.034
2.298ValArg: 2.298 ± 0.039
4.421ValSer: 4.421 ± 0.066
2.864ValThr: 2.864 ± 0.064
4.2ValVal: 4.2 ± 0.059
0.65ValTrp: 0.65 ± 0.021
2.488ValTyr: 2.488 ± 0.044
0.0ValXaa: 0.0 ± 0.0
Trp
0.619TrpAla: 0.619 ± 0.021
0.123TrpCys: 0.123 ± 0.009
0.664TrpAsp: 0.664 ± 0.023
0.788TrpGlu: 0.788 ± 0.024
0.528TrpPhe: 0.528 ± 0.015
0.743TrpGly: 0.743 ± 0.022
0.218TrpHis: 0.218 ± 0.012
0.866TrpIle: 0.866 ± 0.027
0.904TrpLys: 0.904 ± 0.025
0.939TrpLeu: 0.939 ± 0.028
0.351TrpMet: 0.351 ± 0.015
0.727TrpAsn: 0.727 ± 0.023
0.233TrpPro: 0.233 ± 0.015
0.416TrpGln: 0.416 ± 0.019
0.434TrpArg: 0.434 ± 0.018
0.701TrpSer: 0.701 ± 0.023
0.563TrpThr: 0.563 ± 0.019
0.66TrpVal: 0.66 ± 0.018
0.141TrpTrp: 0.141 ± 0.01
0.47TrpTyr: 0.47 ± 0.017
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.41TyrAla: 2.41 ± 0.041
0.461TyrCys: 0.461 ± 0.019
2.334TyrAsp: 2.334 ± 0.041
2.478TyrGlu: 2.478 ± 0.038
2.395TyrPhe: 2.395 ± 0.042
2.582TyrGly: 2.582 ± 0.047
0.934TyrHis: 0.934 ± 0.024
2.606TyrIle: 2.606 ± 0.046
3.169TyrLys: 3.169 ± 0.043
3.929TyrLeu: 3.929 ± 0.055
0.864TyrMet: 0.864 ± 0.024
2.524TyrAsn: 2.524 ± 0.045
1.557TyrPro: 1.557 ± 0.031
1.844TyrGln: 1.844 ± 0.035
1.857TyrArg: 1.857 ± 0.037
3.17TyrSer: 3.17 ± 0.047
2.323TyrThr: 2.323 ± 0.046
2.074TyrVal: 2.074 ± 0.037
0.553TyrTrp: 0.553 ± 0.022
1.945TyrTyr: 1.945 ± 0.041
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4572 proteins (1604438 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski