Amino acid dipepetide frequency for Beihai tombus-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.135AlaAla: 6.135 ± 1.071
1.753AlaCys: 1.753 ± 0.918
0.0AlaAsp: 0.0 ± 0.0
7.011AlaGlu: 7.011 ± 3.185
5.259AlaPhe: 5.259 ± 2.599
6.135AlaGly: 6.135 ± 1.985
1.753AlaHis: 1.753 ± 1.03
5.259AlaIle: 5.259 ± 0.525
3.506AlaLys: 3.506 ± 0.377
2.629AlaLeu: 2.629 ± 0.991
3.506AlaMet: 3.506 ± 1.49
2.629AlaAsn: 2.629 ± 0.991
3.506AlaPro: 3.506 ± 1.292
1.753AlaGln: 1.753 ± 0.918
6.135AlaArg: 6.135 ± 2.492
4.382AlaSer: 4.382 ± 1.895
2.629AlaThr: 2.629 ± 1.925
10.517AlaVal: 10.517 ± 5.982
2.629AlaTrp: 2.629 ± 1.163
2.629AlaTyr: 2.629 ± 1.925
0.0AlaXaa: 0.0 ± 0.0
Cys
2.629CysAla: 2.629 ± 1.858
0.0CysCys: 0.0 ± 0.0
0.876CysAsp: 0.876 ± 0.787
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.876CysGly: 0.876 ± 0.619
0.0CysHis: 0.0 ± 0.0
1.753CysIle: 1.753 ± 2.098
1.753CysLys: 1.753 ± 0.918
2.629CysLeu: 2.629 ± 1.872
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.876CysPro: 0.876 ± 1.049
0.876CysGln: 0.876 ± 0.619
0.876CysArg: 0.876 ± 0.619
1.753CysSer: 1.753 ± 0.646
1.753CysThr: 1.753 ± 1.574
1.753CysVal: 1.753 ± 1.239
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.753AspAla: 1.753 ± 0.646
2.629AspCys: 2.629 ± 0.467
2.629AspAsp: 2.629 ± 0.467
0.876AspGlu: 0.876 ± 1.049
0.876AspPhe: 0.876 ± 0.619
3.506AspGly: 3.506 ± 1.621
0.0AspHis: 0.0 ± 0.0
1.753AspIle: 1.753 ± 1.574
2.629AspLys: 2.629 ± 0.467
6.135AspLeu: 6.135 ± 0.58
1.753AspMet: 1.753 ± 1.239
3.506AspAsn: 3.506 ± 0.377
5.259AspPro: 5.259 ± 1.246
0.876AspGln: 0.876 ± 0.619
0.876AspArg: 0.876 ± 0.619
4.382AspSer: 4.382 ± 1.554
0.876AspThr: 0.876 ± 0.787
3.506AspVal: 3.506 ± 1.836
0.876AspTrp: 0.876 ± 1.049
0.876AspTyr: 0.876 ± 0.787
0.0AspXaa: 0.0 ± 0.0
Glu
4.382GluAla: 4.382 ± 0.912
1.753GluCys: 1.753 ± 0.918
2.629GluAsp: 2.629 ± 1.872
1.753GluGlu: 1.753 ± 0.918
3.506GluPhe: 3.506 ± 1.292
2.629GluGly: 2.629 ± 1.872
2.629GluHis: 2.629 ± 1.858
0.0GluIle: 0.0 ± 0.0
2.629GluLys: 2.629 ± 0.991
5.259GluLeu: 5.259 ± 1.198
0.876GluMet: 0.876 ± 0.551
0.876GluAsn: 0.876 ± 0.619
3.506GluPro: 3.506 ± 1.836
1.753GluGln: 1.753 ± 1.239
5.259GluArg: 5.259 ± 1.246
5.259GluSer: 5.259 ± 2.251
1.753GluThr: 1.753 ± 1.03
3.506GluVal: 3.506 ± 2.924
1.753GluTrp: 1.753 ± 0.646
1.753GluTyr: 1.753 ± 1.574
0.0GluXaa: 0.0 ± 0.0
Phe
4.382PheAla: 4.382 ± 1.475
1.753PheCys: 1.753 ± 1.239
4.382PheAsp: 4.382 ± 2.101
0.876PheGlu: 0.876 ± 0.787
0.876PhePhe: 0.876 ± 0.787
5.259PheGly: 5.259 ± 1.246
0.0PheHis: 0.0 ± 0.0
3.506PheIle: 3.506 ± 0.912
0.876PheLys: 0.876 ± 0.619
0.876PheLeu: 0.876 ± 0.619
0.0PheMet: 0.0 ± 0.0
1.753PheAsn: 1.753 ± 0.646
0.876PhePro: 0.876 ± 0.787
1.753PheGln: 1.753 ± 1.239
1.753PheArg: 1.753 ± 0.646
3.506PheSer: 3.506 ± 0.377
2.629PheThr: 2.629 ± 1.503
3.506PheVal: 3.506 ± 1.292
0.0PheTrp: 0.0 ± 0.0
3.506PheTyr: 3.506 ± 1.521
0.0PheXaa: 0.0 ± 0.0
Gly
1.753GlyAla: 1.753 ± 1.239
1.753GlyCys: 1.753 ± 0.918
4.382GlyAsp: 4.382 ± 1.554
0.876GlyGlu: 0.876 ± 0.787
5.259GlyPhe: 5.259 ± 1.513
5.259GlyGly: 5.259 ± 1.246
0.876GlyHis: 0.876 ± 1.049
7.888GlyIle: 7.888 ± 3.147
2.629GlyLys: 2.629 ± 0.991
6.135GlyLeu: 6.135 ± 3.291
0.876GlyMet: 0.876 ± 0.619
1.753GlyAsn: 1.753 ± 1.239
4.382GlyPro: 4.382 ± 2.161
3.506GlyGln: 3.506 ± 2.049
4.382GlyArg: 4.382 ± 1.895
3.506GlySer: 3.506 ± 0.377
6.135GlyThr: 6.135 ± 1.985
3.506GlyVal: 3.506 ± 1.621
2.629GlyTrp: 2.629 ± 0.467
3.506GlyTyr: 3.506 ± 0.912
0.0GlyXaa: 0.0 ± 0.0
His
0.876HisAla: 0.876 ± 0.619
0.0HisCys: 0.0 ± 0.0
0.876HisAsp: 0.876 ± 0.787
0.876HisGlu: 0.876 ± 1.049
0.876HisPhe: 0.876 ± 1.049
2.629HisGly: 2.629 ± 0.467
0.876HisHis: 0.876 ± 0.619
3.506HisIle: 3.506 ± 0.377
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
1.753HisMet: 1.753 ± 0.918
0.876HisAsn: 0.876 ± 0.619
0.876HisPro: 0.876 ± 0.787
0.876HisGln: 0.876 ± 1.049
3.506HisArg: 3.506 ± 2.924
1.753HisSer: 1.753 ± 1.239
2.629HisThr: 2.629 ± 1.503
2.629HisVal: 2.629 ± 0.467
0.876HisTrp: 0.876 ± 0.619
0.876HisTyr: 0.876 ± 0.619
0.0HisXaa: 0.0 ± 0.0
Ile
5.259IleAla: 5.259 ± 0.525
0.0IleCys: 0.0 ± 0.0
1.753IleAsp: 1.753 ± 0.646
3.506IleGlu: 3.506 ± 0.377
0.0IlePhe: 0.0 ± 0.0
3.506IleGly: 3.506 ± 1.621
0.876IleHis: 0.876 ± 1.049
0.876IleIle: 0.876 ± 1.049
2.629IleLys: 2.629 ± 1.163
3.506IleLeu: 3.506 ± 2.049
1.753IleMet: 1.753 ± 0.918
1.753IleAsn: 1.753 ± 0.646
1.753IlePro: 1.753 ± 1.574
2.629IleGln: 2.629 ± 1.3
2.629IleArg: 2.629 ± 1.503
5.259IleSer: 5.259 ± 0.525
3.506IleThr: 3.506 ± 2.167
5.259IleVal: 5.259 ± 1.938
0.876IleTrp: 0.876 ± 0.619
1.753IleTyr: 1.753 ± 0.646
0.0IleXaa: 0.0 ± 0.0
Lys
5.259LysAla: 5.259 ± 2.251
2.629LysCys: 2.629 ± 1.163
4.382LysAsp: 4.382 ± 2.101
2.629LysGlu: 2.629 ± 1.163
3.506LysPhe: 3.506 ± 0.377
5.259LysGly: 5.259 ± 1.982
0.876LysHis: 0.876 ± 1.049
2.629LysIle: 2.629 ± 0.991
0.0LysLys: 0.0 ± 0.0
5.259LysLeu: 5.259 ± 0.934
0.0LysMet: 0.0 ± 0.805
0.876LysAsn: 0.876 ± 0.619
0.876LysPro: 0.876 ± 0.619
1.753LysGln: 1.753 ± 1.574
2.629LysArg: 2.629 ± 1.163
0.876LysSer: 0.876 ± 0.787
0.0LysThr: 0.0 ± 0.0
2.629LysVal: 2.629 ± 0.991
0.876LysTrp: 0.876 ± 0.619
3.506LysTyr: 3.506 ± 0.377
0.0LysXaa: 0.0 ± 0.0
Leu
9.641LeuAla: 9.641 ± 1.263
1.753LeuCys: 1.753 ± 1.03
4.382LeuAsp: 4.382 ± 3.928
5.259LeuGlu: 5.259 ± 2.735
1.753LeuPhe: 1.753 ± 1.03
2.629LeuGly: 2.629 ± 0.991
2.629LeuHis: 2.629 ± 1.925
1.753LeuIle: 1.753 ± 0.646
2.629LeuLys: 2.629 ± 1.858
10.517LeuLeu: 10.517 ± 3.333
1.753LeuMet: 1.753 ± 0.646
5.259LeuAsn: 5.259 ± 0.934
6.135LeuPro: 6.135 ± 1.944
3.506LeuGln: 3.506 ± 1.521
4.382LeuArg: 4.382 ± 1.226
4.382LeuSer: 4.382 ± 1.554
3.506LeuThr: 3.506 ± 1.292
2.629LeuVal: 2.629 ± 1.3
0.0LeuTrp: 0.0 ± 0.0
1.753LeuTyr: 1.753 ± 0.646
0.0LeuXaa: 0.0 ± 0.0
Met
2.629MetAla: 2.629 ± 1.503
0.0MetCys: 0.0 ± 0.0
1.753MetAsp: 1.753 ± 0.918
1.753MetGlu: 1.753 ± 0.918
0.876MetPhe: 0.876 ± 0.787
0.876MetGly: 0.876 ± 0.619
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
1.753MetLys: 1.753 ± 1.239
0.876MetLeu: 0.876 ± 0.619
0.0MetMet: 0.0 ± 0.0
0.876MetAsn: 0.876 ± 0.619
0.876MetPro: 0.876 ± 0.619
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
4.382MetSer: 4.382 ± 0.912
1.753MetThr: 1.753 ± 1.03
3.506MetVal: 3.506 ± 0.912
0.0MetTrp: 0.0 ± 0.0
0.876MetTyr: 0.876 ± 0.619
0.0MetXaa: 0.0 ± 0.0
Asn
2.629AsnAla: 2.629 ± 1.503
0.876AsnCys: 0.876 ± 0.619
2.629AsnAsp: 2.629 ± 1.163
0.876AsnGlu: 0.876 ± 1.049
1.753AsnPhe: 1.753 ± 0.646
6.135AsnGly: 6.135 ± 2.516
0.876AsnHis: 0.876 ± 1.049
1.753AsnIle: 1.753 ± 1.03
0.876AsnLys: 0.876 ± 0.619
4.382AsnLeu: 4.382 ± 2.101
0.876AsnMet: 0.876 ± 0.787
2.629AsnAsn: 2.629 ± 0.991
5.259AsnPro: 5.259 ± 0.525
1.753AsnGln: 1.753 ± 0.918
1.753AsnArg: 1.753 ± 0.918
1.753AsnSer: 1.753 ± 0.646
4.382AsnThr: 4.382 ± 1.895
1.753AsnVal: 1.753 ± 1.239
0.0AsnTrp: 0.0 ± 0.0
0.876AsnTyr: 0.876 ± 1.049
0.0AsnXaa: 0.0 ± 0.0
Pro
3.506ProAla: 3.506 ± 0.377
1.753ProCys: 1.753 ± 0.646
3.506ProAsp: 3.506 ± 1.621
1.753ProGlu: 1.753 ± 0.918
1.753ProPhe: 1.753 ± 1.239
3.506ProGly: 3.506 ± 1.292
2.629ProHis: 2.629 ± 1.163
0.0ProIle: 0.0 ± 0.0
2.629ProLys: 2.629 ± 0.467
2.629ProLeu: 2.629 ± 1.163
0.876ProMet: 0.876 ± 0.787
1.753ProAsn: 1.753 ± 0.918
1.753ProPro: 1.753 ± 0.646
3.506ProGln: 3.506 ± 2.049
5.259ProArg: 5.259 ± 0.934
4.382ProSer: 4.382 ± 1.475
4.382ProThr: 4.382 ± 1.554
7.888ProVal: 7.888 ± 1.479
0.876ProTrp: 0.876 ± 0.787
0.876ProTyr: 0.876 ± 0.787
0.0ProXaa: 0.0 ± 0.0
Gln
4.382GlnAla: 4.382 ± 1.638
0.876GlnCys: 0.876 ± 0.619
0.0GlnAsp: 0.0 ± 0.0
0.0GlnGlu: 0.0 ± 0.0
0.876GlnPhe: 0.876 ± 0.787
3.506GlnGly: 3.506 ± 2.049
0.876GlnHis: 0.876 ± 0.619
1.753GlnIle: 1.753 ± 0.918
3.506GlnLys: 3.506 ± 0.912
1.753GlnLeu: 1.753 ± 0.646
0.876GlnMet: 0.876 ± 0.619
0.876GlnAsn: 0.876 ± 0.787
4.382GlnPro: 4.382 ± 1.895
1.753GlnGln: 1.753 ± 1.574
2.629GlnArg: 2.629 ± 0.467
3.506GlnSer: 3.506 ± 1.292
1.753GlnThr: 1.753 ± 0.918
0.876GlnVal: 0.876 ± 0.619
0.876GlnTrp: 0.876 ± 0.619
1.753GlnTyr: 1.753 ± 0.918
0.0GlnXaa: 0.0 ± 0.0
Arg
5.259ArgAla: 5.259 ± 1.513
0.0ArgCys: 0.0 ± 0.0
0.0ArgAsp: 0.0 ± 0.0
4.382ArgGlu: 4.382 ± 2.535
4.382ArgPhe: 4.382 ± 0.412
3.506ArgGly: 3.506 ± 0.912
2.629ArgHis: 2.629 ± 1.163
2.629ArgIle: 2.629 ± 1.3
5.259ArgLys: 5.259 ± 2.251
5.259ArgLeu: 5.259 ± 0.525
1.753ArgMet: 1.753 ± 1.239
5.259ArgAsn: 5.259 ± 1.198
1.753ArgPro: 1.753 ± 0.918
0.0ArgGln: 0.0 ± 0.0
5.259ArgArg: 5.259 ± 2.754
2.629ArgSer: 2.629 ± 1.872
1.753ArgThr: 1.753 ± 1.03
7.011ArgVal: 7.011 ± 2.525
0.876ArgTrp: 0.876 ± 1.049
3.506ArgTyr: 3.506 ± 1.49
0.0ArgXaa: 0.0 ± 0.0
Ser
6.135SerAla: 6.135 ± 3.178
0.0SerCys: 0.0 ± 0.0
0.876SerAsp: 0.876 ± 0.619
4.382SerGlu: 4.382 ± 0.912
3.506SerPhe: 3.506 ± 1.521
3.506SerGly: 3.506 ± 1.292
2.629SerHis: 2.629 ± 0.467
6.135SerIle: 6.135 ± 2.492
4.382SerLys: 4.382 ± 1.554
5.259SerLeu: 5.259 ± 1.513
3.506SerMet: 3.506 ± 1.99
5.259SerAsn: 5.259 ± 1.198
5.259SerPro: 5.259 ± 2.325
2.629SerGln: 2.629 ± 1.503
3.506SerArg: 3.506 ± 1.836
7.011SerSer: 7.011 ± 3.077
2.629SerThr: 2.629 ± 1.503
7.011SerVal: 7.011 ± 0.599
0.876SerTrp: 0.876 ± 1.049
1.753SerTyr: 1.753 ± 0.646
0.0SerXaa: 0.0 ± 0.0
Thr
5.259ThrAla: 5.259 ± 1.938
0.876ThrCys: 0.876 ± 0.787
2.629ThrAsp: 2.629 ± 1.503
2.629ThrGlu: 2.629 ± 0.467
1.753ThrPhe: 1.753 ± 0.646
0.876ThrGly: 0.876 ± 1.049
1.753ThrHis: 1.753 ± 1.574
5.259ThrIle: 5.259 ± 1.836
2.629ThrLys: 2.629 ± 0.467
3.506ThrLeu: 3.506 ± 0.912
0.0ThrMet: 0.0 ± 0.0
1.753ThrAsn: 1.753 ± 1.03
2.629ThrPro: 2.629 ± 1.3
0.876ThrGln: 0.876 ± 1.049
2.629ThrArg: 2.629 ± 1.503
4.382ThrSer: 4.382 ± 2.893
6.135ThrThr: 6.135 ± 1.985
4.382ThrVal: 4.382 ± 1.895
1.753ThrTrp: 1.753 ± 0.646
1.753ThrTyr: 1.753 ± 1.574
0.0ThrXaa: 0.0 ± 0.0
Val
5.259ValAla: 5.259 ± 2.392
0.0ValCys: 0.0 ± 0.0
5.259ValAsp: 5.259 ± 1.198
10.517ValGlu: 10.517 ± 1.13
4.382ValPhe: 4.382 ± 2.161
8.764ValGly: 8.764 ± 0.645
2.629ValHis: 2.629 ± 0.467
1.753ValIle: 1.753 ± 0.646
3.506ValLys: 3.506 ± 0.377
3.506ValLeu: 3.506 ± 0.377
0.876ValMet: 0.876 ± 0.619
1.753ValAsn: 1.753 ± 1.03
4.382ValPro: 4.382 ± 1.638
2.629ValGln: 2.629 ± 1.163
6.135ValArg: 6.135 ± 1.071
8.764ValSer: 8.764 ± 0.824
2.629ValThr: 2.629 ± 1.3
6.135ValVal: 6.135 ± 1.217
1.753ValTrp: 1.753 ± 0.646
3.506ValTyr: 3.506 ± 1.621
0.0ValXaa: 0.0 ± 0.0
Trp
0.876TrpAla: 0.876 ± 0.787
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.876TrpGlu: 0.876 ± 0.619
0.876TrpPhe: 0.876 ± 1.049
1.753TrpGly: 1.753 ± 1.239
0.0TrpHis: 0.0 ± 0.0
0.876TrpIle: 0.876 ± 0.619
1.753TrpLys: 1.753 ± 0.918
2.629TrpLeu: 2.629 ± 0.467
0.876TrpMet: 0.876 ± 0.619
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
3.506TrpGln: 3.506 ± 2.049
0.876TrpArg: 0.876 ± 1.049
0.876TrpSer: 0.876 ± 0.619
0.0TrpThr: 0.0 ± 0.0
0.876TrpVal: 0.876 ± 0.619
0.0TrpTrp: 0.0 ± 0.0
0.876TrpTyr: 0.876 ± 1.049
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.753TyrAla: 1.753 ± 2.098
0.0TyrCys: 0.0 ± 0.0
2.629TyrAsp: 2.629 ± 0.467
2.629TyrGlu: 2.629 ± 1.872
0.0TyrPhe: 0.0 ± 0.0
0.876TyrGly: 0.876 ± 1.049
2.629TyrHis: 2.629 ± 1.3
0.0TyrIle: 0.0 ± 0.0
1.753TyrLys: 1.753 ± 0.646
3.506TyrLeu: 3.506 ± 0.377
0.0TyrMet: 0.0 ± 0.0
4.382TyrAsn: 4.382 ± 0.912
0.876TyrPro: 0.876 ± 0.619
0.876TyrGln: 0.876 ± 0.787
2.629TyrArg: 2.629 ± 0.467
3.506TyrSer: 3.506 ± 0.377
2.629TyrThr: 2.629 ± 1.3
5.259TyrVal: 5.259 ± 2.599
0.0TyrTrp: 0.0 ± 0.0
1.753TyrTyr: 1.753 ± 1.239
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1142 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski