Amino acid dipepetide frequency for Carrot Ch virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.189AlaAla: 4.189 ± 2.078
0.419AlaCys: 0.419 ± 0.208
3.351AlaAsp: 3.351 ± 3.112
4.608AlaGlu: 4.608 ± 0.521
5.446AlaPhe: 5.446 ± 1.907
3.77AlaGly: 3.77 ± 1.87
1.257AlaHis: 1.257 ± 0.623
4.608AlaIle: 4.608 ± 2.448
5.027AlaLys: 5.027 ± 1.864
7.122AlaLeu: 7.122 ± 0.704
0.419AlaMet: 0.419 ± 0.208
2.095AlaAsn: 2.095 ± 0.949
0.838AlaPro: 0.838 ± 0.701
0.838AlaGln: 0.838 ± 0.416
2.514AlaArg: 2.514 ± 0.763
2.514AlaSer: 2.514 ± 0.653
0.838AlaThr: 0.838 ± 0.701
1.676AlaVal: 1.676 ± 0.536
0.419AlaTrp: 0.419 ± 0.208
0.838AlaTyr: 0.838 ± 0.416
0.0AlaXaa: 0.0 ± 0.0
Cys
0.419CysAla: 0.419 ± 0.851
0.0CysCys: 0.0 ± 0.0
1.257CysAsp: 1.257 ± 0.588
0.838CysGlu: 0.838 ± 0.416
1.676CysPhe: 1.676 ± 0.831
0.838CysGly: 0.838 ± 0.416
0.0CysHis: 0.0 ± 0.0
1.676CysIle: 1.676 ± 0.933
2.514CysLys: 2.514 ± 1.247
1.676CysLeu: 1.676 ± 0.831
0.0CysMet: 0.0 ± 0.0
0.419CysAsn: 0.419 ± 0.208
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
1.676CysArg: 1.676 ± 0.933
2.095CysSer: 2.095 ± 1.039
0.419CysThr: 0.419 ± 0.208
0.419CysVal: 0.419 ± 0.208
0.0CysTrp: 0.0 ± 0.0
0.838CysTyr: 0.838 ± 0.416
0.0CysXaa: 0.0 ± 0.0
Asp
2.514AspAla: 2.514 ± 1.247
2.514AspCys: 2.514 ± 1.247
6.703AspAsp: 6.703 ± 1.911
6.284AspGlu: 6.284 ± 1.294
4.189AspPhe: 4.189 ± 1.553
3.77AspGly: 3.77 ± 0.377
0.419AspHis: 0.419 ± 0.208
2.933AspIle: 2.933 ± 1.973
6.703AspLys: 6.703 ± 4.652
7.122AspLeu: 7.122 ± 3.446
2.933AspMet: 2.933 ± 1.085
3.351AspAsn: 3.351 ± 1.662
5.027AspPro: 5.027 ± 1.607
2.095AspGln: 2.095 ± 1.278
2.514AspArg: 2.514 ± 0.763
5.865AspSer: 5.865 ± 1.63
1.257AspThr: 1.257 ± 1.341
4.189AspVal: 4.189 ± 1.32
0.838AspTrp: 0.838 ± 0.416
1.676AspTyr: 1.676 ± 0.831
0.0AspXaa: 0.0 ± 0.0
Glu
2.933GluAla: 2.933 ± 1.106
0.838GluCys: 0.838 ± 0.416
2.933GluAsp: 2.933 ± 1.454
6.284GluGlu: 6.284 ± 1.591
4.608GluPhe: 4.608 ± 2.286
5.027GluGly: 5.027 ± 1.307
0.838GluHis: 0.838 ± 0.701
3.77GluIle: 3.77 ± 0.377
7.541GluLys: 7.541 ± 2.848
5.446GluLeu: 5.446 ± 3.234
2.933GluMet: 2.933 ± 2.813
5.446GluAsn: 5.446 ± 1.349
2.095GluPro: 2.095 ± 0.909
1.257GluGln: 1.257 ± 0.623
3.77GluArg: 3.77 ± 0.925
6.703GluSer: 6.703 ± 1.462
1.257GluThr: 1.257 ± 0.588
2.933GluVal: 2.933 ± 0.792
0.419GluTrp: 0.419 ± 0.208
1.257GluTyr: 1.257 ± 1.0
0.0GluXaa: 0.0 ± 0.0
Phe
5.027PheAla: 5.027 ± 0.329
1.676PheCys: 1.676 ± 0.536
6.703PheAsp: 6.703 ± 0.566
5.027PheGlu: 5.027 ± 2.21
4.608PhePhe: 4.608 ± 2.286
2.933PheGly: 2.933 ± 1.454
1.676PheHis: 1.676 ± 0.536
4.189PheIle: 4.189 ± 0.721
3.77PheLys: 3.77 ± 0.377
5.446PheLeu: 5.446 ± 1.907
0.838PheMet: 0.838 ± 1.103
3.77PheAsn: 3.77 ± 3.707
1.257PhePro: 1.257 ± 0.623
2.514PheGln: 2.514 ± 1.247
3.77PheArg: 3.77 ± 1.87
8.379PheSer: 8.379 ± 3.329
1.257PheThr: 1.257 ± 0.623
2.514PheVal: 2.514 ± 0.932
0.419PheTrp: 0.419 ± 0.208
2.095PheTyr: 2.095 ± 1.278
0.0PheXaa: 0.0 ± 0.0
Gly
4.189GlyAla: 4.189 ± 0.721
1.257GlyCys: 1.257 ± 0.623
5.865GlyAsp: 5.865 ± 1.797
2.514GlyGlu: 2.514 ± 1.247
2.095GlyPhe: 2.095 ± 0.949
1.676GlyGly: 1.676 ± 0.536
0.419GlyHis: 0.419 ± 0.208
4.189GlyIle: 4.189 ± 1.72
3.77GlyLys: 3.77 ± 1.87
3.351GlyLeu: 3.351 ± 2.107
0.838GlyMet: 0.838 ± 0.416
2.514GlyAsn: 2.514 ± 1.177
1.257GlyPro: 1.257 ± 0.623
0.838GlyGln: 0.838 ± 0.416
4.189GlyArg: 4.189 ± 0.721
3.77GlySer: 3.77 ± 1.076
3.351GlyThr: 3.351 ± 1.82
4.189GlyVal: 4.189 ± 0.41
1.257GlyTrp: 1.257 ± 0.623
2.095GlyTyr: 2.095 ± 1.039
0.0GlyXaa: 0.0 ± 0.0
His
1.257HisAla: 1.257 ± 0.623
0.0HisCys: 0.0 ± 0.0
1.257HisAsp: 1.257 ± 0.623
0.0HisGlu: 0.0 ± 0.0
1.257HisPhe: 1.257 ± 0.588
0.419HisGly: 0.419 ± 0.208
0.419HisHis: 0.419 ± 0.208
0.838HisIle: 0.838 ± 0.416
0.419HisLys: 0.419 ± 0.208
2.095HisLeu: 2.095 ± 0.56
0.838HisMet: 0.838 ± 1.423
0.0HisAsn: 0.0 ± 0.0
0.419HisPro: 0.419 ± 0.208
0.419HisGln: 0.419 ± 0.208
0.838HisArg: 0.838 ± 0.416
2.095HisSer: 2.095 ± 0.56
0.419HisThr: 0.419 ± 0.208
0.838HisVal: 0.838 ± 0.416
0.0HisTrp: 0.0 ± 0.0
0.419HisTyr: 0.419 ± 0.208
0.0HisXaa: 0.0 ± 0.0
Ile
3.351IleAla: 3.351 ± 0.45
1.676IleCys: 1.676 ± 0.831
8.379IleAsp: 8.379 ± 1.938
5.865IleGlu: 5.865 ± 1.583
3.351IlePhe: 3.351 ± 0.955
2.514IleGly: 2.514 ± 1.177
0.419IleHis: 0.419 ± 0.208
2.933IleIle: 2.933 ± 0.792
7.541IleLys: 7.541 ± 2.152
6.703IleLeu: 6.703 ± 2.202
2.095IleMet: 2.095 ± 0.949
5.446IleAsn: 5.446 ± 1.605
2.514IlePro: 2.514 ± 2.104
1.676IleGln: 1.676 ± 1.143
2.933IleArg: 2.933 ± 1.106
5.865IleSer: 5.865 ± 1.63
2.514IleThr: 2.514 ± 0.932
3.77IleVal: 3.77 ± 1.912
0.0IleTrp: 0.0 ± 0.0
2.095IleTyr: 2.095 ± 0.56
0.0IleXaa: 0.0 ± 0.0
Lys
4.189LysAla: 4.189 ± 1.32
0.419LysCys: 0.419 ± 0.208
2.933LysAsp: 2.933 ± 0.792
6.703LysGlu: 6.703 ± 2.458
6.284LysPhe: 6.284 ± 2.942
7.541LysGly: 7.541 ± 1.456
1.257LysHis: 1.257 ± 1.0
6.284LysIle: 6.284 ± 2.309
8.379LysLys: 8.379 ± 2.275
7.122LysLeu: 7.122 ± 2.652
4.189LysMet: 4.189 ± 0.41
4.608LysAsn: 4.608 ± 1.199
2.514LysPro: 2.514 ± 0.932
2.933LysGln: 2.933 ± 3.355
4.608LysArg: 4.608 ± 1.532
7.96LysSer: 7.96 ± 0.754
2.095LysThr: 2.095 ± 0.909
6.284LysVal: 6.284 ± 1.262
1.257LysTrp: 1.257 ± 0.623
4.608LysTyr: 4.608 ± 0.53
0.0LysXaa: 0.0 ± 0.0
Leu
4.189LeuAla: 4.189 ± 1.899
3.77LeuCys: 3.77 ± 1.134
7.96LeuAsp: 7.96 ± 2.187
4.189LeuGlu: 4.189 ± 1.818
5.865LeuPhe: 5.865 ± 1.63
5.865LeuGly: 5.865 ± 1.285
1.257LeuHis: 1.257 ± 1.546
8.798LeuIle: 8.798 ± 1.59
13.406LeuLys: 13.406 ± 1.8
7.96LeuLeu: 7.96 ± 3.948
1.257LeuMet: 1.257 ± 0.588
5.027LeuAsn: 5.027 ± 1.526
2.095LeuPro: 2.095 ± 0.56
2.514LeuGln: 2.514 ± 3.31
3.77LeuArg: 3.77 ± 1.679
7.541LeuSer: 7.541 ± 2.796
4.608LeuThr: 4.608 ± 1.541
4.608LeuVal: 4.608 ± 2.448
0.0LeuTrp: 0.0 ± 0.0
2.933LeuTyr: 2.933 ± 1.454
0.0LeuXaa: 0.0 ± 0.0
Met
2.514MetAla: 2.514 ± 0.932
0.419MetCys: 0.419 ± 0.208
0.838MetAsp: 0.838 ± 0.416
1.257MetGlu: 1.257 ± 0.623
0.419MetPhe: 0.419 ± 1.233
0.838MetGly: 0.838 ± 2.466
0.419MetHis: 0.419 ± 0.208
2.095MetIle: 2.095 ± 1.039
4.189MetLys: 4.189 ± 1.379
4.189MetLeu: 4.189 ± 1.689
1.257MetMet: 1.257 ± 0.623
1.257MetAsn: 1.257 ± 1.0
1.676MetPro: 1.676 ± 0.831
0.838MetGln: 0.838 ± 1.702
1.676MetArg: 1.676 ± 2.479
1.676MetSer: 1.676 ± 1.143
1.257MetThr: 1.257 ± 0.623
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.419MetTyr: 0.419 ± 1.233
0.0MetXaa: 0.0 ± 0.0
Asn
1.676AsnAla: 1.676 ± 1.403
0.0AsnCys: 0.0 ± 0.0
1.257AsnAsp: 1.257 ± 1.546
2.933AsnGlu: 2.933 ± 0.792
6.284AsnPhe: 6.284 ± 0.371
0.838AsnGly: 0.838 ± 0.416
0.419AsnHis: 0.419 ± 0.208
3.351AsnIle: 3.351 ± 2.806
2.514AsnLys: 2.514 ± 1.177
9.636AsnLeu: 9.636 ± 1.475
1.676AsnMet: 1.676 ± 0.933
2.514AsnAsn: 2.514 ± 2.104
2.514AsnPro: 2.514 ± 1.177
1.676AsnGln: 1.676 ± 1.143
2.933AsnArg: 2.933 ± 1.973
5.027AsnSer: 5.027 ± 0.692
1.257AsnThr: 1.257 ± 1.341
1.676AsnVal: 1.676 ± 0.536
0.0AsnTrp: 0.0 ± 0.0
1.676AsnTyr: 1.676 ± 0.831
0.0AsnXaa: 0.0 ± 0.0
Pro
0.419ProAla: 0.419 ± 0.851
0.419ProCys: 0.419 ± 0.208
3.77ProAsp: 3.77 ± 1.23
0.419ProGlu: 0.419 ± 0.208
1.676ProPhe: 1.676 ± 0.933
0.838ProGly: 0.838 ± 0.701
0.0ProHis: 0.0 ± 0.0
2.933ProIle: 2.933 ± 0.792
2.933ProLys: 2.933 ± 0.591
3.351ProLeu: 3.351 ± 1.071
1.676ProMet: 1.676 ± 0.536
0.838ProAsn: 0.838 ± 1.702
0.419ProPro: 0.419 ± 0.208
2.514ProGln: 2.514 ± 1.247
2.514ProArg: 2.514 ± 0.932
1.676ProSer: 1.676 ± 1.143
1.676ProThr: 1.676 ± 0.536
2.514ProVal: 2.514 ± 0.932
0.419ProTrp: 0.419 ± 0.208
0.419ProTyr: 0.419 ± 0.208
0.0ProXaa: 0.0 ± 0.0
Gln
1.676GlnAla: 1.676 ± 1.403
0.0GlnCys: 0.0 ± 0.0
2.095GlnAsp: 2.095 ± 2.096
2.933GlnGlu: 2.933 ± 1.454
0.838GlnPhe: 0.838 ± 0.416
1.257GlnGly: 1.257 ± 0.588
0.0GlnHis: 0.0 ± 0.0
3.351GlnIle: 3.351 ± 1.071
2.095GlnLys: 2.095 ± 0.909
4.189GlnLeu: 4.189 ± 1.379
1.676GlnMet: 1.676 ± 0.831
0.838GlnAsn: 0.838 ± 1.103
0.0GlnPro: 0.0 ± 0.0
0.419GlnGln: 0.419 ± 0.208
1.676GlnArg: 1.676 ± 0.933
2.514GlnSer: 2.514 ± 0.932
0.419GlnThr: 0.419 ± 0.208
1.257GlnVal: 1.257 ± 2.164
0.419GlnTrp: 0.419 ± 0.208
0.419GlnTyr: 0.419 ± 0.208
0.0GlnXaa: 0.0 ± 0.0
Arg
2.933ArgAla: 2.933 ± 0.792
0.0ArgCys: 0.0 ± 0.0
1.676ArgAsp: 1.676 ± 0.831
2.514ArgGlu: 2.514 ± 0.763
4.608ArgPhe: 4.608 ± 1.635
5.446ArgGly: 5.446 ± 3.234
0.838ArgHis: 0.838 ± 0.416
3.77ArgIle: 3.77 ± 1.23
5.446ArgLys: 5.446 ± 0.178
2.514ArgLeu: 2.514 ± 1.247
0.838ArgMet: 0.838 ± 0.416
0.419ArgAsn: 0.419 ± 0.208
1.676ArgPro: 1.676 ± 1.143
0.838ArgGln: 0.838 ± 0.416
2.095ArgArg: 2.095 ± 2.703
2.933ArgSer: 2.933 ± 3.355
1.676ArgThr: 1.676 ± 0.831
6.284ArgVal: 6.284 ± 3.79
0.419ArgTrp: 0.419 ± 0.851
2.095ArgTyr: 2.095 ± 0.56
0.0ArgXaa: 0.0 ± 0.0
Ser
4.608SerAla: 4.608 ± 1.513
1.257SerCys: 1.257 ± 0.623
6.703SerAsp: 6.703 ± 1.462
7.122SerGlu: 7.122 ± 3.827
8.379SerPhe: 8.379 ± 2.275
2.514SerGly: 2.514 ± 0.932
2.095SerHis: 2.095 ± 1.039
6.284SerIle: 6.284 ± 2.878
4.608SerLys: 4.608 ± 1.199
8.798SerLeu: 8.798 ± 2.239
0.838SerMet: 0.838 ± 0.416
5.446SerAsn: 5.446 ± 0.178
2.095SerPro: 2.095 ± 0.909
2.933SerGln: 2.933 ± 1.454
2.933SerArg: 2.933 ± 1.106
8.379SerSer: 8.379 ± 5.159
1.257SerThr: 1.257 ± 0.623
4.189SerVal: 4.189 ± 1.12
1.257SerTrp: 1.257 ± 1.546
3.351SerTyr: 3.351 ± 1.101
0.0SerXaa: 0.0 ± 0.0
Thr
0.419ThrAla: 0.419 ± 0.851
0.838ThrCys: 0.838 ± 1.103
1.676ThrAsp: 1.676 ± 0.831
2.933ThrGlu: 2.933 ± 0.999
2.933ThrPhe: 2.933 ± 0.591
3.351ThrGly: 3.351 ± 1.071
0.419ThrHis: 0.419 ± 0.208
3.351ThrIle: 3.351 ± 1.071
1.676ThrLys: 1.676 ± 0.831
2.933ThrLeu: 2.933 ± 0.999
0.419ThrMet: 0.419 ± 1.233
0.0ThrAsn: 0.0 ± 0.0
1.257ThrPro: 1.257 ± 0.623
0.838ThrGln: 0.838 ± 0.416
0.419ThrArg: 0.419 ± 0.208
0.838ThrSer: 0.838 ± 1.103
0.838ThrThr: 0.838 ± 0.416
2.514ThrVal: 2.514 ± 0.763
0.0ThrTrp: 0.0 ± 0.0
1.257ThrTyr: 1.257 ± 1.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.189ValAla: 4.189 ± 0.41
0.838ValCys: 0.838 ± 1.103
4.608ValAsp: 4.608 ± 1.71
3.77ValGlu: 3.77 ± 1.765
0.838ValPhe: 0.838 ± 0.416
2.514ValGly: 2.514 ± 1.177
1.676ValHis: 1.676 ± 0.831
4.608ValIle: 4.608 ± 1.199
6.284ValLys: 6.284 ± 1.294
3.77ValLeu: 3.77 ± 1.679
1.257ValMet: 1.257 ± 1.0
3.351ValAsn: 3.351 ± 0.955
1.257ValPro: 1.257 ± 0.623
2.514ValGln: 2.514 ± 1.247
1.676ValArg: 1.676 ± 0.831
4.189ValSer: 4.189 ± 4.489
2.095ValThr: 2.095 ± 0.949
2.095ValVal: 2.095 ± 1.039
0.0ValTrp: 0.0 ± 0.0
1.676ValTyr: 1.676 ± 0.933
0.0ValXaa: 0.0 ± 0.0
Trp
0.838TrpAla: 0.838 ± 0.416
0.0TrpCys: 0.0 ± 0.0
0.419TrpAsp: 0.419 ± 0.208
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.419TrpIle: 0.419 ± 0.208
0.838TrpLys: 0.838 ± 0.416
1.257TrpLeu: 1.257 ± 1.546
0.419TrpMet: 0.419 ± 0.208
0.838TrpAsn: 0.838 ± 0.701
0.838TrpPro: 0.838 ± 0.416
0.419TrpGln: 0.419 ± 0.208
0.0TrpArg: 0.0 ± 0.0
0.838TrpSer: 0.838 ± 0.416
0.0TrpThr: 0.0 ± 0.0
0.419TrpVal: 0.419 ± 0.208
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.257TyrAla: 1.257 ± 0.623
0.419TyrCys: 0.419 ± 0.208
2.933TyrAsp: 2.933 ± 1.106
2.095TyrGlu: 2.095 ± 1.039
2.514TyrPhe: 2.514 ± 2.001
1.676TyrGly: 1.676 ± 0.536
0.419TyrHis: 0.419 ± 0.208
1.257TyrIle: 1.257 ± 1.0
2.514TyrLys: 2.514 ± 1.247
2.933TyrLeu: 2.933 ± 0.999
0.419TyrMet: 0.419 ± 0.208
1.676TyrAsn: 1.676 ± 0.536
1.676TyrPro: 1.676 ± 2.207
0.0TyrGln: 0.0 ± 0.0
2.514TyrArg: 2.514 ± 1.247
4.189TyrSer: 4.189 ± 1.32
0.419TyrThr: 0.419 ± 0.208
0.838TyrVal: 0.838 ± 0.416
0.419TyrTrp: 0.419 ± 0.208
1.676TyrTyr: 1.676 ± 0.933
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2388 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski