Amino acid dipepetide frequency for Cocksfoot mottle virus (isolate Dactylis glomerata/Norway/CfMV-NO/1995) (CfMV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.414AlaAla: 6.414 ± 1.342
2.138AlaCys: 2.138 ± 0.682
3.741AlaAsp: 3.741 ± 0.985
7.483AlaGlu: 7.483 ± 2.186
2.138AlaPhe: 2.138 ± 0.682
5.879AlaGly: 5.879 ± 1.41
1.069AlaHis: 1.069 ± 0.684
4.276AlaIle: 4.276 ± 1.224
1.069AlaLys: 1.069 ± 0.684
10.155AlaLeu: 10.155 ± 1.528
4.276AlaMet: 4.276 ± 1.224
1.069AlaAsn: 1.069 ± 0.341
3.207AlaPro: 3.207 ± 0.406
1.069AlaGln: 1.069 ± 0.684
6.414AlaArg: 6.414 ± 1.206
6.414AlaSer: 6.414 ± 1.1
3.741AlaThr: 3.741 ± 0.739
4.81AlaVal: 4.81 ± 0.949
4.276AlaTrp: 4.276 ± 1.588
0.534AlaTyr: 0.534 ± 0.778
0.0AlaXaa: 0.0 ± 0.0
Cys
2.672CysAla: 2.672 ± 1.107
0.534CysCys: 0.534 ± 1.064
1.603CysAsp: 1.603 ± 1.193
3.207CysGlu: 3.207 ± 2.977
1.069CysPhe: 1.069 ± 1.025
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.534CysIle: 0.534 ± 0.369
2.672CysLys: 2.672 ± 0.755
2.138CysLeu: 2.138 ± 0.794
1.069CysMet: 1.069 ± 0.341
1.603CysAsn: 1.603 ± 0.794
1.603CysPro: 1.603 ± 0.487
0.0CysGln: 0.0 ± 0.0
1.603CysArg: 1.603 ± 1.107
2.138CysSer: 2.138 ± 0.447
0.534CysThr: 0.534 ± 0.369
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.069CysTyr: 1.069 ± 2.129
0.0CysXaa: 0.0 ± 0.0
Asp
2.672AspAla: 2.672 ± 0.755
0.0AspCys: 0.0 ± 0.0
2.672AspAsp: 2.672 ± 1.011
2.138AspGlu: 2.138 ± 0.447
2.672AspPhe: 2.672 ± 0.922
5.345AspGly: 5.345 ± 0.784
0.0AspHis: 0.0 ± 0.0
1.069AspIle: 1.069 ± 1.025
0.0AspLys: 0.0 ± 0.0
3.741AspLeu: 3.741 ± 1.018
0.0AspMet: 0.0 ± 0.0
0.0AspAsn: 0.0 ± 0.0
3.207AspPro: 3.207 ± 1.022
2.138AspGln: 2.138 ± 0.447
2.672AspArg: 2.672 ± 1.315
6.414AspSer: 6.414 ± 0.8
2.138AspThr: 2.138 ± 1.343
3.207AspVal: 3.207 ± 1.494
2.138AspTrp: 2.138 ± 1.003
0.534AspTyr: 0.534 ± 0.778
0.0AspXaa: 0.0 ± 0.0
Glu
4.81GluAla: 4.81 ± 1.928
0.0GluCys: 0.0 ± 0.0
2.138GluAsp: 2.138 ± 0.794
2.672GluGlu: 2.672 ± 0.564
1.603GluPhe: 1.603 ± 1.193
2.138GluGly: 2.138 ± 0.794
0.0GluHis: 0.0 ± 0.0
3.207GluIle: 3.207 ± 1.623
1.069GluLys: 1.069 ± 0.341
11.224GluLeu: 11.224 ± 3.793
3.207GluMet: 3.207 ± 0.783
3.741GluAsn: 3.741 ± 0.739
3.741GluPro: 3.741 ± 0.96
1.603GluGln: 1.603 ± 0.487
3.741GluArg: 3.741 ± 1.264
5.345GluSer: 5.345 ± 1.732
6.948GluThr: 6.948 ± 1.572
1.069GluVal: 1.069 ± 1.025
1.069GluTrp: 1.069 ± 0.341
3.207GluTyr: 3.207 ± 0.974
0.0GluXaa: 0.0 ± 0.0
Phe
0.0PheAla: 0.0 ± 0.0
0.534PheCys: 0.534 ± 0.369
3.207PheAsp: 3.207 ± 0.406
1.603PheGlu: 1.603 ± 0.487
0.534PhePhe: 0.534 ± 1.064
0.534PheGly: 0.534 ± 0.369
0.534PheHis: 0.534 ± 0.369
1.069PheIle: 1.069 ± 1.334
1.069PheLys: 1.069 ± 1.334
0.534PheLeu: 0.534 ± 0.778
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.603PhePro: 1.603 ± 0.775
1.603PheGln: 1.603 ± 0.775
6.414PheArg: 6.414 ± 2.967
1.069PheSer: 1.069 ± 1.334
2.138PheThr: 2.138 ± 0.88
3.741PheVal: 3.741 ± 0.518
0.0PheTrp: 0.0 ± 0.0
0.534PheTyr: 0.534 ± 0.369
0.0PheXaa: 0.0 ± 0.0
Gly
2.672GlyAla: 2.672 ± 0.564
2.138GlyCys: 2.138 ± 0.682
3.207GlyAsp: 3.207 ± 0.866
2.672GlyGlu: 2.672 ± 1.138
2.672GlyPhe: 2.672 ± 2.503
6.948GlyGly: 6.948 ± 2.153
1.069GlyHis: 1.069 ± 0.684
3.741GlyIle: 3.741 ± 1.066
4.276GlyLys: 4.276 ± 1.363
5.345GlyLeu: 5.345 ± 1.511
1.603GlyMet: 1.603 ± 0.775
2.138GlyAsn: 2.138 ± 0.447
4.81GlyPro: 4.81 ± 1.132
3.207GlyGln: 3.207 ± 0.974
4.81GlyArg: 4.81 ± 2.81
9.086GlySer: 9.086 ± 1.712
1.069GlyThr: 1.069 ± 1.557
8.552GlyVal: 8.552 ± 1.662
3.741GlyTrp: 3.741 ± 1.264
1.603GlyTyr: 1.603 ± 0.487
0.0GlyXaa: 0.0 ± 0.0
His
0.534HisAla: 0.534 ± 0.369
0.534HisCys: 0.534 ± 1.064
1.069HisAsp: 1.069 ± 0.738
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.069HisLys: 1.069 ± 0.341
4.81HisLeu: 4.81 ± 0.949
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.534HisPro: 0.534 ± 0.369
0.534HisGln: 0.534 ± 0.369
1.069HisArg: 1.069 ± 1.025
1.603HisSer: 1.603 ± 1.107
0.534HisThr: 0.534 ± 0.778
1.069HisVal: 1.069 ± 0.341
0.534HisTrp: 0.534 ± 0.778
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.207IleAla: 3.207 ± 0.842
2.138IleCys: 2.138 ± 1.172
0.534IleAsp: 0.534 ± 0.369
2.672IleGlu: 2.672 ± 1.298
0.0IlePhe: 0.0 ± 0.0
3.207IleGly: 3.207 ± 1.494
0.0IleHis: 0.0 ± 0.0
0.534IleIle: 0.534 ± 0.369
1.069IleLys: 1.069 ± 0.341
3.741IleLeu: 3.741 ± 0.739
0.0IleMet: 0.0 ± 0.0
1.069IleAsn: 1.069 ± 0.738
1.603IlePro: 1.603 ± 2.151
0.0IleGln: 0.0 ± 0.0
2.138IleArg: 2.138 ± 0.92
7.483IleSer: 7.483 ± 2.186
2.672IleThr: 2.672 ± 0.922
6.414IleVal: 6.414 ± 0.8
0.0IleTrp: 0.0 ± 0.0
1.069IleTyr: 1.069 ± 0.341
0.0IleXaa: 0.0 ± 0.0
Lys
5.345LysAla: 5.345 ± 2.462
0.534LysCys: 0.534 ± 0.369
2.138LysAsp: 2.138 ± 0.682
3.741LysGlu: 3.741 ± 1.066
1.069LysPhe: 1.069 ± 0.341
3.207LysGly: 3.207 ± 1.437
1.603LysHis: 1.603 ± 1.193
1.603LysIle: 1.603 ± 1.113
1.069LysLys: 1.069 ± 0.341
2.138LysLeu: 2.138 ± 0.88
0.0LysMet: 0.0 ± 0.461
1.069LysAsn: 1.069 ± 0.341
3.207LysPro: 3.207 ± 1.001
1.069LysGln: 1.069 ± 0.341
2.672LysArg: 2.672 ± 1.107
8.017LysSer: 8.017 ± 2.626
3.207LysThr: 3.207 ± 1.022
2.672LysVal: 2.672 ± 0.995
0.534LysTrp: 0.534 ± 0.778
0.534LysTyr: 0.534 ± 0.369
0.0LysXaa: 0.0 ± 0.0
Leu
4.276LeuAla: 4.276 ± 1.588
2.672LeuCys: 2.672 ± 0.755
4.276LeuAsp: 4.276 ± 1.122
8.017LeuGlu: 8.017 ± 2.392
2.672LeuPhe: 2.672 ± 1.56
6.948LeuGly: 6.948 ± 1.23
3.207LeuHis: 3.207 ± 0.406
5.345LeuIle: 5.345 ± 1.744
4.276LeuLys: 4.276 ± 0.894
8.552LeuLeu: 8.552 ± 0.934
3.741LeuMet: 3.741 ± 1.262
2.672LeuAsn: 2.672 ± 0.577
4.81LeuPro: 4.81 ± 1.326
2.138LeuGln: 2.138 ± 0.794
3.207LeuArg: 3.207 ± 1.001
5.879LeuSer: 5.879 ± 0.93
4.276LeuThr: 4.276 ± 1.768
5.345LeuVal: 5.345 ± 1.744
2.672LeuTrp: 2.672 ± 1.414
3.207LeuTyr: 3.207 ± 1.022
0.0LeuXaa: 0.0 ± 0.0
Met
5.345MetAla: 5.345 ± 0.901
0.534MetCys: 0.534 ± 1.064
0.0MetAsp: 0.0 ± 0.0
1.069MetGlu: 1.069 ± 0.684
0.534MetPhe: 0.534 ± 0.517
4.276MetGly: 4.276 ± 0.894
0.0MetHis: 0.0 ± 0.0
0.534MetIle: 0.534 ± 0.778
0.534MetLys: 0.534 ± 0.369
1.603MetLeu: 1.603 ± 0.487
0.534MetMet: 0.534 ± 0.778
0.534MetAsn: 0.534 ± 0.369
0.534MetPro: 0.534 ± 0.369
0.0MetGln: 0.0 ± 0.0
0.534MetArg: 0.534 ± 0.369
2.138MetSer: 2.138 ± 0.682
1.069MetThr: 1.069 ± 0.341
0.534MetVal: 0.534 ± 0.778
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.672AsnAla: 2.672 ± 1.172
1.603AsnCys: 1.603 ± 0.972
0.534AsnAsp: 0.534 ± 0.369
0.534AsnGlu: 0.534 ± 0.369
0.0AsnPhe: 0.0 ± 0.0
1.603AsnGly: 1.603 ± 1.107
0.0AsnHis: 0.0 ± 0.0
2.138AsnIle: 2.138 ± 0.682
1.069AsnLys: 1.069 ± 1.334
3.207AsnLeu: 3.207 ± 0.842
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
2.138AsnPro: 2.138 ± 0.447
0.534AsnGln: 0.534 ± 0.369
1.069AsnArg: 1.069 ± 0.341
3.741AsnSer: 3.741 ± 1.066
1.069AsnThr: 1.069 ± 0.684
1.603AsnVal: 1.603 ± 0.595
1.603AsnTrp: 1.603 ± 0.794
0.534AsnTyr: 0.534 ± 0.517
0.0AsnXaa: 0.0 ± 0.0
Pro
8.017ProAla: 8.017 ± 1.255
0.534ProCys: 0.534 ± 0.778
2.672ProAsp: 2.672 ± 1.107
2.138ProGlu: 2.138 ± 0.447
1.603ProPhe: 1.603 ± 0.487
4.276ProGly: 4.276 ± 1.26
1.069ProHis: 1.069 ± 0.738
3.741ProIle: 3.741 ± 1.433
5.345ProLys: 5.345 ± 1.877
6.948ProLeu: 6.948 ± 1.605
0.0ProMet: 0.0 ± 0.0
0.534ProAsn: 0.534 ± 0.369
4.276ProPro: 4.276 ± 2.056
1.603ProGln: 1.603 ± 0.775
3.741ProArg: 3.741 ± 1.171
5.879ProSer: 5.879 ± 0.728
2.672ProThr: 2.672 ± 0.577
5.345ProVal: 5.345 ± 1.845
1.069ProTrp: 1.069 ± 0.738
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.603GlnAla: 1.603 ± 0.487
0.534GlnCys: 0.534 ± 1.064
1.603GlnAsp: 1.603 ± 0.487
3.741GlnGlu: 3.741 ± 1.264
0.0GlnPhe: 0.0 ± 0.0
2.672GlnGly: 2.672 ± 1.011
0.534GlnHis: 0.534 ± 0.369
0.534GlnIle: 0.534 ± 0.369
1.603GlnLys: 1.603 ± 0.794
2.138GlnLeu: 2.138 ± 1.475
0.0GlnMet: 0.0 ± 0.0
1.069GlnAsn: 1.069 ± 0.341
2.672GlnPro: 2.672 ± 2.112
2.138GlnGln: 2.138 ± 2.183
1.069GlnArg: 1.069 ± 0.883
2.672GlnSer: 2.672 ± 1.107
2.138GlnThr: 2.138 ± 0.447
2.138GlnVal: 2.138 ± 0.447
0.0GlnTrp: 0.0 ± 0.0
0.534GlnTyr: 0.534 ± 0.369
0.0GlnXaa: 0.0 ± 0.0
Arg
3.741ArgAla: 3.741 ± 1.066
1.603ArgCys: 1.603 ± 1.193
1.069ArgAsp: 1.069 ± 0.738
4.276ArgGlu: 4.276 ± 1.211
3.207ArgPhe: 3.207 ± 0.842
4.81ArgGly: 4.81 ± 2.301
0.0ArgHis: 0.0 ± 0.0
4.276ArgIle: 4.276 ± 0.803
4.276ArgLys: 4.276 ± 0.602
4.81ArgLeu: 4.81 ± 1.461
1.603ArgMet: 1.603 ± 0.487
0.0ArgAsn: 0.0 ± 0.0
5.345ArgPro: 5.345 ± 1.155
2.672ArgGln: 2.672 ± 0.995
4.81ArgArg: 4.81 ± 2.494
5.345ArgSer: 5.345 ± 1.659
3.741ArgThr: 3.741 ± 2.358
5.345ArgVal: 5.345 ± 0.89
1.069ArgTrp: 1.069 ± 1.025
3.207ArgTyr: 3.207 ± 0.974
0.0ArgXaa: 0.0 ± 0.0
Ser
8.017SerAla: 8.017 ± 1.546
1.603SerCys: 1.603 ± 1.113
6.414SerAsp: 6.414 ± 2.442
2.672SerGlu: 2.672 ± 0.577
2.672SerPhe: 2.672 ± 0.564
9.621SerGly: 9.621 ± 2.729
1.603SerHis: 1.603 ± 0.487
1.603SerIle: 1.603 ± 1.367
8.017SerLys: 8.017 ± 2.712
7.483SerLeu: 7.483 ± 1.609
1.603SerMet: 1.603 ± 1.418
4.276SerAsn: 4.276 ± 1.789
8.552SerPro: 8.552 ± 1.719
4.276SerGln: 4.276 ± 1.209
9.621SerArg: 9.621 ± 0.473
14.431SerSer: 14.431 ± 2.218
6.414SerThr: 6.414 ± 3.688
10.689SerVal: 10.689 ± 0.281
1.603SerTrp: 1.603 ± 0.595
1.603SerTyr: 1.603 ± 1.193
0.0SerXaa: 0.0 ± 0.0
Thr
5.345ThrAla: 5.345 ± 1.977
0.0ThrCys: 0.0 ± 0.0
2.672ThrAsp: 2.672 ± 2.112
4.276ThrGlu: 4.276 ± 1.067
1.069ThrPhe: 1.069 ± 0.738
3.741ThrGly: 3.741 ± 0.715
0.534ThrHis: 0.534 ± 0.369
1.069ThrIle: 1.069 ± 1.334
2.672ThrLys: 2.672 ± 0.577
1.603ThrLeu: 1.603 ± 0.595
0.534ThrMet: 0.534 ± 0.778
3.207ThrAsn: 3.207 ± 1.104
1.603ThrPro: 1.603 ± 0.595
2.672ThrGln: 2.672 ± 0.564
4.276ThrArg: 4.276 ± 1.985
7.483ThrSer: 7.483 ± 4.085
7.483ThrThr: 7.483 ± 0.969
6.948ThrVal: 6.948 ± 2.922
1.069ThrTrp: 1.069 ± 1.025
2.672ThrTyr: 2.672 ± 0.577
0.0ThrXaa: 0.0 ± 0.0
Val
10.689ValAla: 10.689 ± 3.111
3.207ValCys: 3.207 ± 0.974
2.138ValAsp: 2.138 ± 1.475
6.414ValGlu: 6.414 ± 1.83
1.603ValPhe: 1.603 ± 0.595
4.276ValGly: 4.276 ± 1.122
1.069ValHis: 1.069 ± 1.025
3.207ValIle: 3.207 ± 0.936
3.741ValLys: 3.741 ± 0.715
4.276ValLeu: 4.276 ± 1.067
1.069ValMet: 1.069 ± 0.341
1.069ValAsn: 1.069 ± 0.341
4.276ValPro: 4.276 ± 1.122
1.069ValGln: 1.069 ± 1.025
4.276ValArg: 4.276 ± 1.26
9.621ValSer: 9.621 ± 1.968
6.414ValThr: 6.414 ± 1.522
7.483ValVal: 7.483 ± 1.816
2.138ValTrp: 2.138 ± 0.447
1.603ValTyr: 1.603 ± 0.595
0.0ValXaa: 0.0 ± 0.0
Trp
0.534TrpAla: 0.534 ± 0.369
1.069TrpCys: 1.069 ± 0.738
0.0TrpAsp: 0.0 ± 0.0
2.672TrpGlu: 2.672 ± 0.564
1.603TrpPhe: 1.603 ± 1.418
3.741TrpGly: 3.741 ± 0.518
0.534TrpHis: 0.534 ± 0.369
1.069TrpIle: 1.069 ± 0.341
0.0TrpLys: 0.0 ± 0.0
0.534TrpLeu: 0.534 ± 0.778
0.534TrpMet: 0.534 ± 0.337
0.534TrpAsn: 0.534 ± 1.064
2.138TrpPro: 2.138 ± 1.475
0.534TrpGln: 0.534 ± 0.517
1.069TrpArg: 1.069 ± 0.684
5.345TrpSer: 5.345 ± 0.524
1.069TrpThr: 1.069 ± 0.341
0.534TrpVal: 0.534 ± 0.369
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.603TyrAla: 1.603 ± 1.418
2.138TyrCys: 2.138 ± 0.794
1.069TyrAsp: 1.069 ± 0.341
0.534TyrGlu: 0.534 ± 0.369
0.0TyrPhe: 0.0 ± 0.0
2.138TyrGly: 2.138 ± 0.682
1.069TyrHis: 1.069 ± 0.341
0.0TyrIle: 0.0 ± 0.0
1.069TyrLys: 1.069 ± 0.341
3.207TyrLeu: 3.207 ± 1.104
0.0TyrMet: 0.0 ± 0.0
1.069TyrAsn: 1.069 ± 0.341
1.603TyrPro: 1.603 ± 0.487
0.534TyrGln: 0.534 ± 0.517
0.0TyrArg: 0.0 ± 0.0
2.672TyrSer: 2.672 ± 1.93
1.603TyrThr: 1.603 ± 0.972
2.138TyrVal: 2.138 ± 0.447
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1872 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski