Amino acid dipepetide frequency for Rubus canadensis virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.779AlaAla: 4.779 ± 5.709
1.103AlaCys: 1.103 ± 0.554
1.838AlaAsp: 1.838 ± 0.479
5.147AlaGlu: 5.147 ± 2.389
5.147AlaPhe: 5.147 ± 1.372
4.779AlaGly: 4.779 ± 2.763
1.103AlaHis: 1.103 ± 1.098
4.044AlaIle: 4.044 ± 1.748
3.676AlaLys: 3.676 ± 1.24
3.676AlaLeu: 3.676 ± 0.958
0.368AlaMet: 0.368 ± 0.185
2.941AlaAsn: 2.941 ± 0.9
1.838AlaPro: 1.838 ± 0.479
1.103AlaGln: 1.103 ± 0.554
1.838AlaArg: 1.838 ± 0.479
3.676AlaSer: 3.676 ± 0.64
1.838AlaThr: 1.838 ± 0.928
4.779AlaVal: 4.779 ± 1.286
0.735AlaTrp: 0.735 ± 1.171
0.368AlaTyr: 0.368 ± 0.744
0.0AlaXaa: 0.0 ± 0.0
Cys
0.735CysAla: 0.735 ± 0.514
0.368CysCys: 0.368 ± 0.185
0.735CysAsp: 0.735 ± 1.171
0.735CysGlu: 0.735 ± 0.645
2.941CysPhe: 2.941 ± 1.093
1.471CysGly: 1.471 ± 0.941
0.0CysHis: 0.0 ± 0.0
1.471CysIle: 1.471 ± 0.537
1.471CysLys: 1.471 ± 0.739
4.412CysLeu: 4.412 ± 1.766
0.368CysMet: 0.368 ± 0.185
0.368CysAsn: 0.368 ± 0.185
0.368CysPro: 0.368 ± 0.185
0.368CysGln: 0.368 ± 0.185
1.103CysArg: 1.103 ± 0.554
2.941CysSer: 2.941 ± 0.831
1.838CysThr: 1.838 ± 0.928
2.574CysVal: 2.574 ± 1.334
0.368CysTrp: 0.368 ± 0.185
1.103CysTyr: 1.103 ± 0.554
0.0CysXaa: 0.0 ± 0.0
Asp
2.574AspAla: 2.574 ± 0.404
1.838AspCys: 1.838 ± 0.924
2.206AspAsp: 2.206 ± 0.561
2.206AspGlu: 2.206 ± 0.561
3.676AspPhe: 3.676 ± 1.001
2.574AspGly: 2.574 ± 0.799
0.735AspHis: 0.735 ± 0.37
2.941AspIle: 2.941 ± 1.479
4.779AspLys: 4.779 ± 2.446
6.985AspLeu: 6.985 ± 1.181
1.838AspMet: 1.838 ± 0.479
2.574AspAsn: 2.574 ± 0.584
1.471AspPro: 1.471 ± 0.739
0.735AspGln: 0.735 ± 0.645
1.838AspArg: 1.838 ± 0.665
5.515AspSer: 5.515 ± 1.335
0.368AspThr: 0.368 ± 0.185
3.309AspVal: 3.309 ± 1.068
0.735AspTrp: 0.735 ± 0.37
1.838AspTyr: 1.838 ± 0.798
0.0AspXaa: 0.0 ± 0.0
Glu
4.044GluAla: 4.044 ± 0.775
0.368GluCys: 0.368 ± 0.185
2.574GluAsp: 2.574 ± 1.237
7.721GluGlu: 7.721 ± 1.639
4.412GluPhe: 4.412 ± 0.814
4.044GluGly: 4.044 ± 0.775
1.103GluHis: 1.103 ± 0.554
6.985GluIle: 6.985 ± 0.894
6.985GluLys: 6.985 ± 1.364
6.25GluLeu: 6.25 ± 4.415
0.368GluMet: 0.368 ± 0.185
3.309GluAsn: 3.309 ± 0.863
2.206GluPro: 2.206 ± 0.518
1.103GluGln: 1.103 ± 0.554
3.309GluArg: 3.309 ± 1.616
5.882GluSer: 5.882 ± 1.62
2.206GluThr: 2.206 ± 0.858
7.721GluVal: 7.721 ± 1.955
0.368GluTrp: 0.368 ± 0.185
1.471GluTyr: 1.471 ± 0.478
0.0GluXaa: 0.0 ± 0.0
Phe
3.676PheAla: 3.676 ± 0.92
1.471PheCys: 1.471 ± 0.827
4.044PheAsp: 4.044 ± 1.117
6.985PheGlu: 6.985 ± 1.501
3.309PhePhe: 3.309 ± 1.068
4.779PheGly: 4.779 ± 0.87
2.206PheHis: 2.206 ± 1.103
2.941PheIle: 2.941 ± 1.678
2.206PheLys: 2.206 ± 0.561
8.824PheLeu: 8.824 ± 2.264
0.735PheMet: 0.735 ± 0.37
3.676PheAsn: 3.676 ± 1.143
1.838PhePro: 1.838 ± 0.798
1.471PheGln: 1.471 ± 0.739
2.206PheArg: 2.206 ± 0.595
6.25PheSer: 6.25 ± 1.512
3.309PheThr: 3.309 ± 0.978
3.309PheVal: 3.309 ± 0.828
0.368PheTrp: 0.368 ± 0.185
1.838PheTyr: 1.838 ± 0.581
0.0PheXaa: 0.0 ± 0.0
Gly
3.309GlyAla: 3.309 ± 2.187
1.103GlyCys: 1.103 ± 0.867
3.676GlyAsp: 3.676 ± 0.772
3.309GlyGlu: 3.309 ± 0.863
4.412GlyPhe: 4.412 ± 0.893
3.676GlyGly: 3.676 ± 4.555
0.735GlyHis: 0.735 ± 0.37
3.309GlyIle: 3.309 ± 0.398
5.882GlyLys: 5.882 ± 1.642
5.147GlyLeu: 5.147 ± 1.52
1.471GlyMet: 1.471 ± 0.743
4.044GlyAsn: 4.044 ± 0.653
1.103GlyPro: 1.103 ± 1.028
2.206GlyGln: 2.206 ± 0.674
3.676GlyArg: 3.676 ± 0.855
5.515GlySer: 5.515 ± 1.252
2.941GlyThr: 2.941 ± 0.621
2.206GlyVal: 2.206 ± 0.561
0.368GlyTrp: 0.368 ± 0.185
1.471GlyTyr: 1.471 ± 0.739
0.0GlyXaa: 0.0 ± 0.0
His
1.103HisAla: 1.103 ± 0.556
1.103HisCys: 1.103 ± 0.997
1.471HisAsp: 1.471 ± 0.739
1.103HisGlu: 1.103 ± 0.867
1.103HisPhe: 1.103 ± 0.536
0.735HisGly: 0.735 ± 0.63
1.103HisHis: 1.103 ± 1.39
1.103HisIle: 1.103 ± 0.554
1.103HisLys: 1.103 ± 0.429
2.574HisLeu: 2.574 ± 1.34
0.0HisMet: 0.0 ± 0.0
1.471HisAsn: 1.471 ± 0.537
0.735HisPro: 0.735 ± 0.37
0.368HisGln: 0.368 ± 0.185
0.368HisArg: 0.368 ± 0.782
4.412HisSer: 4.412 ± 0.871
0.368HisThr: 0.368 ± 0.782
1.103HisVal: 1.103 ± 0.536
0.0HisTrp: 0.0 ± 0.0
1.471HisTyr: 1.471 ± 0.478
0.0HisXaa: 0.0 ± 0.0
Ile
3.676IleAla: 3.676 ± 1.534
2.574IleCys: 2.574 ± 0.404
2.206IleAsp: 2.206 ± 0.901
5.147IleGlu: 5.147 ± 1.321
4.044IlePhe: 4.044 ± 1.314
2.206IleGly: 2.206 ± 0.674
1.103IleHis: 1.103 ± 1.098
1.838IleIle: 1.838 ± 1.084
5.882IleLys: 5.882 ± 1.101
4.412IleLeu: 4.412 ± 1.609
1.471IleMet: 1.471 ± 0.415
1.838IleAsn: 1.838 ± 0.962
2.574IlePro: 2.574 ± 0.824
1.838IleGln: 1.838 ± 0.851
2.206IleArg: 2.206 ± 1.109
7.353IleSer: 7.353 ± 0.977
2.941IleThr: 2.941 ± 1.82
2.941IleVal: 2.941 ± 0.9
0.0IleTrp: 0.0 ± 0.0
2.941IleTyr: 2.941 ± 0.544
0.0IleXaa: 0.0 ± 0.0
Lys
5.147LysAla: 5.147 ± 1.005
1.838LysCys: 1.838 ± 0.924
5.515LysAsp: 5.515 ± 1.585
3.676LysGlu: 3.676 ± 2.343
4.412LysPhe: 4.412 ± 1.025
4.412LysGly: 4.412 ± 1.588
1.838LysHis: 1.838 ± 0.487
4.044LysIle: 4.044 ± 1.065
6.618LysLys: 6.618 ± 1.754
7.353LysLeu: 7.353 ± 1.295
2.206LysMet: 2.206 ± 0.809
2.941LysAsn: 2.941 ± 0.821
1.103LysPro: 1.103 ± 0.867
0.368LysGln: 0.368 ± 0.642
5.882LysArg: 5.882 ± 0.999
6.25LysSer: 6.25 ± 1.694
2.574LysThr: 2.574 ± 0.678
4.412LysVal: 4.412 ± 1.609
1.471LysTrp: 1.471 ± 0.827
1.103LysTyr: 1.103 ± 0.536
0.0LysXaa: 0.0 ± 0.0
Leu
4.044LeuAla: 4.044 ± 2.792
2.941LeuCys: 2.941 ± 1.075
4.779LeuAsp: 4.779 ± 1.231
7.721LeuGlu: 7.721 ± 3.347
4.044LeuPhe: 4.044 ± 1.178
6.985LeuGly: 6.985 ± 0.909
2.206LeuHis: 2.206 ± 0.518
5.882LeuIle: 5.882 ± 1.693
7.721LeuLys: 7.721 ± 0.629
7.721LeuLeu: 7.721 ± 1.887
2.574LeuMet: 2.574 ± 1.294
4.779LeuAsn: 4.779 ± 0.738
5.147LeuPro: 5.147 ± 1.598
3.309LeuGln: 3.309 ± 0.863
5.882LeuArg: 5.882 ± 1.139
9.559LeuSer: 9.559 ± 0.875
5.147LeuThr: 5.147 ± 1.52
8.456LeuVal: 8.456 ± 3.556
0.368LeuTrp: 0.368 ± 0.185
3.309LeuTyr: 3.309 ± 1.068
0.0LeuXaa: 0.0 ± 0.0
Met
1.838MetAla: 1.838 ± 0.924
0.735MetCys: 0.735 ± 0.37
0.0MetAsp: 0.0 ± 0.0
1.103MetGlu: 1.103 ± 0.536
1.103MetPhe: 1.103 ± 0.536
2.206MetGly: 2.206 ± 0.858
0.368MetHis: 0.368 ± 0.185
2.206MetIle: 2.206 ± 0.595
1.103MetLys: 1.103 ± 0.554
2.206MetLeu: 2.206 ± 0.873
1.103MetMet: 1.103 ± 0.554
0.368MetAsn: 0.368 ± 0.185
1.471MetPro: 1.471 ± 0.537
0.0MetGln: 0.0 ± 0.0
1.838MetArg: 1.838 ± 0.924
1.103MetSer: 1.103 ± 0.554
1.103MetThr: 1.103 ± 0.429
0.735MetVal: 0.735 ± 0.37
0.0MetTrp: 0.0 ± 0.0
0.735MetTyr: 0.735 ± 0.37
0.0MetXaa: 0.0 ± 0.0
Asn
1.838AsnAla: 1.838 ± 0.928
0.735AsnCys: 0.735 ± 0.37
1.103AsnAsp: 1.103 ± 0.554
2.574AsnGlu: 2.574 ± 0.824
4.412AsnPhe: 4.412 ± 1.594
4.412AsnGly: 4.412 ± 1.488
1.838AsnHis: 1.838 ± 0.665
2.574AsnIle: 2.574 ± 0.956
3.309AsnLys: 3.309 ± 0.625
8.088AsnLeu: 8.088 ± 1.572
0.735AsnMet: 0.735 ± 0.37
1.471AsnAsn: 1.471 ± 0.886
1.838AsnPro: 1.838 ± 1.485
1.838AsnGln: 1.838 ± 0.924
1.838AsnArg: 1.838 ± 0.763
4.412AsnSer: 4.412 ± 1.488
1.103AsnThr: 1.103 ± 0.536
1.838AsnVal: 1.838 ± 0.928
1.103AsnTrp: 1.103 ± 0.554
1.471AsnTyr: 1.471 ± 0.739
0.0AsnXaa: 0.0 ± 0.0
Pro
2.206ProAla: 2.206 ± 0.858
1.838ProCys: 1.838 ± 0.851
2.941ProAsp: 2.941 ± 0.956
1.471ProGlu: 1.471 ± 1.548
1.103ProPhe: 1.103 ± 0.867
1.103ProGly: 1.103 ± 0.554
1.471ProHis: 1.471 ± 1.761
2.206ProIle: 2.206 ± 1.109
2.941ProLys: 2.941 ± 0.945
2.574ProLeu: 2.574 ± 0.584
0.0ProMet: 0.0 ± 0.0
2.574ProAsn: 2.574 ± 1.06
1.838ProPro: 1.838 ± 0.763
1.471ProGln: 1.471 ± 0.537
1.471ProArg: 1.471 ± 0.739
3.676ProSer: 3.676 ± 1.266
1.838ProThr: 1.838 ± 0.479
1.838ProVal: 1.838 ± 0.479
0.368ProTrp: 0.368 ± 0.185
1.103ProTyr: 1.103 ± 0.554
0.0ProXaa: 0.0 ± 0.0
Gln
0.368GlnAla: 0.368 ± 0.185
0.735GlnCys: 0.735 ± 0.63
1.471GlnAsp: 1.471 ± 0.739
2.574GlnGlu: 2.574 ± 0.74
1.471GlnPhe: 1.471 ± 0.827
0.735GlnGly: 0.735 ± 0.37
0.368GlnHis: 0.368 ± 0.185
2.574GlnIle: 2.574 ± 1.949
1.838GlnLys: 1.838 ± 0.479
2.206GlnLeu: 2.206 ± 1.109
1.103GlnMet: 1.103 ± 0.554
0.735GlnAsn: 0.735 ± 0.37
0.368GlnPro: 0.368 ± 0.185
0.0GlnGln: 0.0 ± 0.0
0.368GlnArg: 0.368 ± 0.185
1.838GlnSer: 1.838 ± 0.581
0.735GlnThr: 0.735 ± 0.37
0.735GlnVal: 0.735 ± 0.645
0.735GlnTrp: 0.735 ± 0.37
0.368GlnTyr: 0.368 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
2.941ArgAla: 2.941 ± 0.621
0.735ArgCys: 0.735 ± 0.37
2.574ArgAsp: 2.574 ± 0.74
2.941ArgGlu: 2.941 ± 1.779
2.941ArgPhe: 2.941 ± 0.621
1.838ArgGly: 1.838 ± 0.487
0.735ArgHis: 0.735 ± 0.645
2.574ArgIle: 2.574 ± 1.06
2.941ArgLys: 2.941 ± 0.821
4.412ArgLeu: 4.412 ± 1.488
1.471ArgMet: 1.471 ± 0.704
3.676ArgAsn: 3.676 ± 1.329
1.838ArgPro: 1.838 ± 0.928
1.103ArgGln: 1.103 ± 1.148
2.574ArgArg: 2.574 ± 0.404
3.676ArgSer: 3.676 ± 0.482
2.941ArgThr: 2.941 ± 1.704
3.309ArgVal: 3.309 ± 0.978
0.368ArgTrp: 0.368 ± 0.185
1.838ArgTyr: 1.838 ± 0.924
0.0ArgXaa: 0.0 ± 0.0
Ser
3.309SerAla: 3.309 ± 1.663
1.838SerCys: 1.838 ± 0.487
6.25SerAsp: 6.25 ± 1.151
5.515SerGlu: 5.515 ± 0.975
5.515SerPhe: 5.515 ± 0.979
5.515SerGly: 5.515 ± 1.522
2.941SerHis: 2.941 ± 0.956
5.882SerIle: 5.882 ± 1.54
5.882SerLys: 5.882 ± 1.517
9.559SerLeu: 9.559 ± 1.273
2.941SerMet: 2.941 ± 0.89
6.25SerAsn: 6.25 ± 1.697
2.574SerPro: 2.574 ± 0.678
1.838SerGln: 1.838 ± 0.924
4.412SerArg: 4.412 ± 1.348
9.559SerSer: 9.559 ± 3.472
2.941SerThr: 2.941 ± 0.9
7.721SerVal: 7.721 ± 3.472
1.103SerTrp: 1.103 ± 0.554
3.309SerTyr: 3.309 ± 1.663
0.0SerXaa: 0.0 ± 0.0
Thr
1.838ThrAla: 1.838 ± 0.479
1.471ThrCys: 1.471 ± 1.548
1.838ThrAsp: 1.838 ± 0.479
3.309ThrGlu: 3.309 ± 0.877
6.25ThrPhe: 6.25 ± 1.694
2.941ThrGly: 2.941 ± 1.643
1.103ThrHis: 1.103 ± 0.554
1.838ThrIle: 1.838 ± 1.084
1.471ThrLys: 1.471 ± 0.537
2.941ThrLeu: 2.941 ± 0.9
0.368ThrMet: 0.368 ± 0.185
1.838ThrAsn: 1.838 ± 0.479
1.471ThrPro: 1.471 ± 0.537
0.368ThrGln: 0.368 ± 0.185
2.574ThrArg: 2.574 ± 1.657
5.147ThrSer: 5.147 ± 2.153
1.103ThrThr: 1.103 ± 0.554
1.838ThrVal: 1.838 ± 0.479
0.0ThrTrp: 0.0 ± 0.0
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.779ValAla: 4.779 ± 0.81
1.838ValCys: 1.838 ± 1.308
3.676ValAsp: 3.676 ± 1.848
5.147ValGlu: 5.147 ± 1.158
3.309ValPhe: 3.309 ± 0.863
2.941ValGly: 2.941 ± 0.9
1.103ValHis: 1.103 ± 0.536
2.941ValIle: 2.941 ± 1.588
6.25ValLys: 6.25 ± 1.001
7.353ValLeu: 7.353 ± 3.102
1.471ValMet: 1.471 ± 0.478
1.103ValAsn: 1.103 ± 0.554
3.676ValPro: 3.676 ± 0.975
1.471ValGln: 1.471 ± 0.739
3.676ValArg: 3.676 ± 0.855
5.882ValSer: 5.882 ± 0.803
2.206ValThr: 2.206 ± 0.858
5.882ValVal: 5.882 ± 6.122
1.103ValTrp: 1.103 ± 0.997
1.471ValTyr: 1.471 ± 0.739
0.0ValXaa: 0.0 ± 0.0
Trp
0.368TrpAla: 0.368 ± 0.185
0.735TrpCys: 0.735 ± 0.37
0.368TrpAsp: 0.368 ± 0.642
1.103TrpGlu: 1.103 ± 0.429
1.103TrpPhe: 1.103 ± 0.554
0.735TrpGly: 0.735 ± 0.645
0.368TrpHis: 0.368 ± 0.185
0.368TrpIle: 0.368 ± 0.185
0.0TrpLys: 0.0 ± 0.0
1.471TrpLeu: 1.471 ± 0.478
0.368TrpMet: 0.368 ± 0.185
0.368TrpAsn: 0.368 ± 0.642
0.368TrpPro: 0.368 ± 0.185
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.735TrpSer: 0.735 ± 0.37
0.368TrpThr: 0.368 ± 0.185
1.103TrpVal: 1.103 ± 0.536
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.206TyrAla: 2.206 ± 0.67
0.368TyrCys: 0.368 ± 0.782
1.471TyrAsp: 1.471 ± 0.739
2.941TyrGlu: 2.941 ± 1.479
0.735TyrPhe: 0.735 ± 0.37
1.471TyrGly: 1.471 ± 0.537
0.368TyrHis: 0.368 ± 0.185
1.471TyrIle: 1.471 ± 0.478
1.103TyrLys: 1.103 ± 0.556
4.412TyrLeu: 4.412 ± 1.488
0.0TyrMet: 0.0 ± 0.0
2.206TyrAsn: 2.206 ± 1.109
2.206TyrPro: 2.206 ± 1.109
0.368TyrGln: 0.368 ± 0.185
0.735TyrArg: 0.735 ± 0.37
1.471TyrSer: 1.471 ± 1.289
1.838TyrThr: 1.838 ± 0.581
1.471TyrVal: 1.471 ± 0.739
0.368TyrTrp: 0.368 ± 0.185
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (2721 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski