Amino acid dipepetide frequency for Luffa aphid-borne yellows virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.529AlaAla: 4.529 ± 1.092
0.647AlaCys: 0.647 ± 0.459
2.912AlaAsp: 2.912 ± 0.472
5.823AlaGlu: 5.823 ± 1.337
3.882AlaPhe: 3.882 ± 0.565
6.147AlaGly: 6.147 ± 1.072
0.647AlaHis: 0.647 ± 0.282
3.235AlaIle: 3.235 ± 0.427
2.912AlaLys: 2.912 ± 0.33
5.176AlaLeu: 5.176 ± 0.499
1.294AlaMet: 1.294 ± 0.588
3.882AlaAsn: 3.882 ± 0.999
5.176AlaPro: 5.176 ± 0.85
3.235AlaGln: 3.235 ± 1.143
5.176AlaArg: 5.176 ± 1.172
5.823AlaSer: 5.823 ± 1.348
3.559AlaThr: 3.559 ± 0.883
2.265AlaVal: 2.265 ± 0.403
1.941AlaTrp: 1.941 ± 0.609
2.265AlaTyr: 2.265 ± 0.918
0.0AlaXaa: 0.0 ± 0.0
Cys
1.941CysAla: 1.941 ± 0.448
0.647CysCys: 0.647 ± 0.282
0.647CysAsp: 0.647 ± 0.282
0.324CysGlu: 0.324 ± 0.418
0.647CysPhe: 0.647 ± 0.459
1.294CysGly: 1.294 ± 0.588
0.971CysHis: 0.971 ± 0.393
1.618CysIle: 1.618 ± 0.562
1.618CysLys: 1.618 ± 0.562
2.265CysLeu: 2.265 ± 0.954
0.0CysMet: 0.0 ± 0.0
0.324CysAsn: 0.324 ± 0.418
1.294CysPro: 1.294 ± 0.563
1.618CysGln: 1.618 ± 0.562
0.971CysArg: 0.971 ± 0.529
0.647CysSer: 0.647 ± 0.459
0.647CysThr: 0.647 ± 0.293
1.941CysVal: 1.941 ± 0.492
0.647CysTrp: 0.647 ± 0.459
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.912AspAla: 2.912 ± 0.695
0.324AspCys: 0.324 ± 0.23
1.618AspAsp: 1.618 ± 0.279
4.206AspGlu: 4.206 ± 0.991
1.618AspPhe: 1.618 ± 0.76
4.529AspGly: 4.529 ± 0.856
0.971AspHis: 0.971 ± 0.696
1.618AspIle: 1.618 ± 0.732
0.647AspLys: 0.647 ± 0.464
4.206AspLeu: 4.206 ± 0.863
0.971AspMet: 0.971 ± 0.407
0.971AspAsn: 0.971 ± 0.389
2.912AspPro: 2.912 ± 0.959
1.294AspGln: 1.294 ± 0.635
1.618AspArg: 1.618 ± 1.1
2.912AspSer: 2.912 ± 0.899
3.559AspThr: 3.559 ± 0.625
1.618AspVal: 1.618 ± 1.104
1.294AspTrp: 1.294 ± 0.567
1.294AspTyr: 1.294 ± 0.563
0.0AspXaa: 0.0 ± 0.0
Glu
5.5GluAla: 5.5 ± 0.522
0.647GluCys: 0.647 ± 0.282
3.235GluAsp: 3.235 ± 1.091
5.176GluGlu: 5.176 ± 1.63
4.853GluPhe: 4.853 ± 0.581
4.206GluGly: 4.206 ± 1.17
2.588GluHis: 2.588 ± 0.593
1.941GluIle: 1.941 ± 0.418
1.294GluLys: 1.294 ± 0.494
3.882GluLeu: 3.882 ± 1.57
2.265GluMet: 2.265 ± 0.708
1.294GluAsn: 1.294 ± 0.298
2.265GluPro: 2.265 ± 0.759
2.265GluGln: 2.265 ± 0.48
1.294GluArg: 1.294 ± 0.677
3.235GluSer: 3.235 ± 0.869
1.618GluThr: 1.618 ± 0.278
5.5GluVal: 5.5 ± 1.033
1.618GluTrp: 1.618 ± 0.446
1.294GluTyr: 1.294 ± 0.909
0.0GluXaa: 0.0 ± 0.0
Phe
1.941PheAla: 1.941 ± 0.5
1.294PheCys: 1.294 ± 0.588
2.265PheAsp: 2.265 ± 0.642
1.294PheGlu: 1.294 ± 0.588
0.0PhePhe: 0.0 ± 0.0
2.912PheGly: 2.912 ± 0.722
0.0PheHis: 0.0 ± 0.0
0.647PheIle: 0.647 ± 0.387
2.588PheLys: 2.588 ± 0.626
6.147PheLeu: 6.147 ± 1.497
0.324PheMet: 0.324 ± 0.292
1.294PheAsn: 1.294 ± 0.461
1.941PhePro: 1.941 ± 0.576
0.971PheGln: 0.971 ± 0.529
3.235PheArg: 3.235 ± 0.742
5.176PheSer: 5.176 ± 1.301
0.324PheThr: 0.324 ± 0.292
3.235PheVal: 3.235 ± 0.477
0.971PheTrp: 0.971 ± 0.539
0.971PheTyr: 0.971 ± 0.4
0.324PheXaa: 0.324 ± 0.418
Gly
4.206GlyAla: 4.206 ± 0.769
1.941GlyCys: 1.941 ± 0.501
2.588GlyAsp: 2.588 ± 1.037
6.147GlyGlu: 6.147 ± 0.598
2.912GlyPhe: 2.912 ± 1.383
6.47GlyGly: 6.47 ± 2.749
1.941GlyHis: 1.941 ± 1.277
3.559GlyIle: 3.559 ± 0.729
5.5GlyLys: 5.5 ± 1.435
5.823GlyLeu: 5.823 ± 0.961
0.971GlyMet: 0.971 ± 0.393
3.882GlyAsn: 3.882 ± 0.695
3.559GlyPro: 3.559 ± 1.043
2.265GlyGln: 2.265 ± 0.403
4.853GlyArg: 4.853 ± 0.975
7.441GlySer: 7.441 ± 1.584
3.235GlyThr: 3.235 ± 1.035
4.529GlyVal: 4.529 ± 1.251
0.647GlyTrp: 0.647 ± 0.459
2.588GlyTyr: 2.588 ± 0.51
0.0GlyXaa: 0.0 ± 0.0
His
0.971HisAla: 0.971 ± 0.246
0.324HisCys: 0.324 ± 0.23
0.971HisAsp: 0.971 ± 0.403
0.647HisGlu: 0.647 ± 0.418
0.971HisPhe: 0.971 ± 0.4
0.971HisGly: 0.971 ± 0.4
0.0HisHis: 0.0 ± 0.0
0.971HisIle: 0.971 ± 0.4
1.941HisLys: 1.941 ± 0.501
1.941HisLeu: 1.941 ± 0.65
0.324HisMet: 0.324 ± 0.39
0.0HisAsn: 0.0 ± 0.0
0.971HisPro: 0.971 ± 0.529
1.941HisGln: 1.941 ± 0.403
1.941HisArg: 1.941 ± 1.18
4.853HisSer: 4.853 ± 0.637
1.941HisThr: 1.941 ± 0.384
1.941HisVal: 1.941 ± 0.384
0.324HisTrp: 0.324 ± 0.418
0.324HisTyr: 0.324 ± 0.418
0.0HisXaa: 0.0 ± 0.0
Ile
1.941IleAla: 1.941 ± 0.678
0.971IleCys: 0.971 ± 0.804
0.971IleAsp: 0.971 ± 0.657
1.618IleGlu: 1.618 ± 0.5
0.647IlePhe: 0.647 ± 0.293
2.588IleGly: 2.588 ± 0.649
1.618IleHis: 1.618 ± 0.279
0.647IleIle: 0.647 ± 0.584
2.265IleLys: 2.265 ± 0.341
5.5IleLeu: 5.5 ± 0.798
1.618IleMet: 1.618 ± 0.333
0.647IleAsn: 0.647 ± 0.584
5.176IlePro: 5.176 ± 0.78
1.294IleGln: 1.294 ± 0.319
2.265IleArg: 2.265 ± 0.769
3.559IleSer: 3.559 ± 0.864
1.294IleThr: 1.294 ± 0.584
2.912IleVal: 2.912 ± 1.377
0.647IleTrp: 0.647 ± 0.282
0.971IleTyr: 0.971 ± 0.246
0.0IleXaa: 0.0 ± 0.0
Lys
4.206LysAla: 4.206 ± 1.091
1.294LysCys: 1.294 ± 0.44
2.912LysAsp: 2.912 ± 1.199
2.265LysGlu: 2.265 ± 1.104
2.265LysPhe: 2.265 ± 0.715
3.882LysGly: 3.882 ± 0.476
0.647LysHis: 0.647 ± 0.584
4.206LysIle: 4.206 ± 0.877
1.941LysLys: 1.941 ± 0.713
2.912LysLeu: 2.912 ± 0.804
2.588LysMet: 2.588 ± 0.617
0.647LysAsn: 0.647 ± 0.293
3.882LysPro: 3.882 ± 1.454
2.588LysGln: 2.588 ± 0.518
2.588LysArg: 2.588 ± 1.134
5.176LysSer: 5.176 ± 0.862
3.235LysThr: 3.235 ± 0.882
2.588LysVal: 2.588 ± 0.436
0.0LysTrp: 0.0 ± 0.0
0.324LysTyr: 0.324 ± 0.292
0.324LysXaa: 0.324 ± 0.292
Leu
5.176LeuAla: 5.176 ± 2.757
2.265LeuCys: 2.265 ± 0.915
5.176LeuAsp: 5.176 ± 0.86
4.206LeuGlu: 4.206 ± 2.043
5.823LeuPhe: 5.823 ± 1.539
5.5LeuGly: 5.5 ± 1.346
3.235LeuHis: 3.235 ± 0.845
2.912LeuIle: 2.912 ± 0.68
3.559LeuLys: 3.559 ± 0.856
11.0LeuLeu: 11.0 ± 1.852
1.941LeuMet: 1.941 ± 0.384
2.912LeuAsn: 2.912 ± 0.472
8.088LeuPro: 8.088 ± 1.49
2.588LeuGln: 2.588 ± 0.609
6.47LeuArg: 6.47 ± 1.413
7.764LeuSer: 7.764 ± 1.285
6.47LeuThr: 6.47 ± 1.549
5.176LeuVal: 5.176 ± 1.258
2.265LeuTrp: 2.265 ± 0.922
5.176LeuTyr: 5.176 ± 1.019
0.324LeuXaa: 0.324 ± 0.418
Met
1.941MetAla: 1.941 ± 0.631
0.324MetCys: 0.324 ± 0.292
0.324MetAsp: 0.324 ± 0.292
0.971MetGlu: 0.971 ± 0.403
0.0MetPhe: 0.0 ± 0.0
1.294MetGly: 1.294 ± 0.423
0.971MetHis: 0.971 ± 0.407
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
1.941MetLeu: 1.941 ± 0.759
0.0MetMet: 0.0 ± 0.0
0.647MetAsn: 0.647 ± 0.387
0.324MetPro: 0.324 ± 0.418
0.0MetGln: 0.0 ± 0.0
0.971MetArg: 0.971 ± 0.4
1.618MetSer: 1.618 ± 0.763
2.265MetThr: 2.265 ± 0.698
3.235MetVal: 3.235 ± 1.055
0.324MetTrp: 0.324 ± 0.23
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.294AsnAla: 1.294 ± 0.298
0.647AsnCys: 0.647 ± 0.282
1.618AsnAsp: 1.618 ± 0.567
1.294AsnGlu: 1.294 ± 0.44
1.294AsnPhe: 1.294 ± 0.54
2.588AsnGly: 2.588 ± 0.827
0.971AsnHis: 0.971 ± 0.4
0.647AsnIle: 0.647 ± 0.584
1.941AsnLys: 1.941 ± 0.713
4.206AsnLeu: 4.206 ± 0.318
0.324AsnMet: 0.324 ± 0.299
1.618AsnAsn: 1.618 ± 0.76
0.971AsnPro: 0.971 ± 0.393
0.324AsnGln: 0.324 ± 0.23
5.5AsnArg: 5.5 ± 1.347
3.882AsnSer: 3.882 ± 0.784
2.588AsnThr: 2.588 ± 0.607
2.265AsnVal: 2.265 ± 1.115
0.647AsnTrp: 0.647 ± 0.584
1.941AsnTyr: 1.941 ± 0.501
0.0AsnXaa: 0.0 ± 0.0
Pro
2.912ProAla: 2.912 ± 0.994
0.647ProCys: 0.647 ± 0.282
1.941ProAsp: 1.941 ± 0.811
3.559ProGlu: 3.559 ± 0.54
0.971ProPhe: 0.971 ± 0.246
4.529ProGly: 4.529 ± 1.197
1.941ProHis: 1.941 ± 0.609
3.559ProIle: 3.559 ± 1.027
2.912ProLys: 2.912 ± 0.731
5.823ProLeu: 5.823 ± 1.37
0.647ProMet: 0.647 ± 0.66
1.618ProAsn: 1.618 ± 0.731
9.059ProPro: 9.059 ± 3.406
3.882ProGln: 3.882 ± 0.532
2.588ProArg: 2.588 ± 1.858
7.117ProSer: 7.117 ± 0.667
4.206ProThr: 4.206 ± 1.287
4.206ProVal: 4.206 ± 0.965
0.647ProTrp: 0.647 ± 0.293
0.324ProTyr: 0.324 ± 0.23
0.324ProXaa: 0.324 ± 0.292
Gln
1.941GlnAla: 1.941 ± 0.576
1.294GlnCys: 1.294 ± 0.563
0.647GlnAsp: 0.647 ± 0.836
0.971GlnGlu: 0.971 ± 0.667
0.971GlnPhe: 0.971 ± 0.597
3.882GlnGly: 3.882 ± 0.883
0.647GlnHis: 0.647 ± 0.459
1.941GlnIle: 1.941 ± 0.454
4.529GlnLys: 4.529 ± 0.759
3.882GlnLeu: 3.882 ± 0.666
0.324GlnMet: 0.324 ± 0.23
1.941GlnAsn: 1.941 ± 0.647
1.618GlnPro: 1.618 ± 0.441
0.647GlnGln: 0.647 ± 0.282
3.235GlnArg: 3.235 ± 0.953
3.559GlnSer: 3.559 ± 1.09
1.941GlnThr: 1.941 ± 0.783
1.941GlnVal: 1.941 ± 0.56
0.971GlnTrp: 0.971 ± 0.393
1.294GlnTyr: 1.294 ± 0.563
0.0GlnXaa: 0.0 ± 0.0
Arg
5.5ArgAla: 5.5 ± 0.686
0.647ArgCys: 0.647 ± 0.418
2.588ArgAsp: 2.588 ± 0.892
5.823ArgGlu: 5.823 ± 1.384
1.294ArgPhe: 1.294 ± 0.54
5.823ArgGly: 5.823 ± 1.533
0.647ArgHis: 0.647 ± 0.293
1.618ArgIle: 1.618 ± 0.279
1.294ArgLys: 1.294 ± 0.563
6.47ArgLeu: 6.47 ± 1.365
0.647ArgMet: 0.647 ± 0.752
5.5ArgAsn: 5.5 ± 1.755
1.294ArgPro: 1.294 ± 0.465
2.265ArgGln: 2.265 ± 0.931
11.647ArgArg: 11.647 ± 4.906
7.441ArgSer: 7.441 ± 2.157
1.294ArgThr: 1.294 ± 0.474
3.235ArgVal: 3.235 ± 0.935
0.647ArgTrp: 0.647 ± 0.459
2.588ArgTyr: 2.588 ± 1.104
0.0ArgXaa: 0.0 ± 0.0
Ser
7.764SerAla: 7.764 ± 1.709
1.941SerCys: 1.941 ± 0.501
4.529SerAsp: 4.529 ± 0.65
4.853SerGlu: 4.853 ± 0.849
4.529SerPhe: 4.529 ± 0.856
6.794SerGly: 6.794 ± 0.703
2.912SerHis: 2.912 ± 0.654
3.559SerIle: 3.559 ± 1.727
5.5SerLys: 5.5 ± 1.041
8.735SerLeu: 8.735 ± 0.795
0.324SerMet: 0.324 ± 0.418
4.206SerAsn: 4.206 ± 0.591
4.206SerPro: 4.206 ± 0.991
4.206SerGln: 4.206 ± 1.215
4.206SerArg: 4.206 ± 1.541
11.0SerSer: 11.0 ± 3.222
5.823SerThr: 5.823 ± 0.752
3.882SerVal: 3.882 ± 1.24
2.588SerTrp: 2.588 ± 0.615
3.882SerTyr: 3.882 ± 0.75
0.0SerXaa: 0.0 ± 0.0
Thr
4.529ThrAla: 4.529 ± 0.856
2.912ThrCys: 2.912 ± 0.731
2.265ThrAsp: 2.265 ± 0.48
2.265ThrGlu: 2.265 ± 1.209
1.618ThrPhe: 1.618 ± 0.441
4.529ThrGly: 4.529 ± 0.485
0.324ThrHis: 0.324 ± 0.292
2.912ThrIle: 2.912 ± 0.599
2.588ThrLys: 2.588 ± 0.615
5.176ThrLeu: 5.176 ± 1.176
0.647ThrMet: 0.647 ± 0.387
0.647ThrAsn: 0.647 ± 0.459
3.559ThrPro: 3.559 ± 1.066
0.647ThrGln: 0.647 ± 0.387
2.912ThrArg: 2.912 ± 0.663
6.47ThrSer: 6.47 ± 0.822
4.853ThrThr: 4.853 ± 1.016
2.912ThrVal: 2.912 ± 0.86
2.265ThrTrp: 2.265 ± 0.494
1.294ThrTyr: 1.294 ± 0.325
0.324ThrXaa: 0.324 ± 0.39
Val
6.47ValAla: 6.47 ± 0.647
1.294ValCys: 1.294 ± 0.44
3.235ValAsp: 3.235 ± 1.338
3.559ValGlu: 3.559 ± 0.882
1.941ValPhe: 1.941 ± 0.647
4.206ValGly: 4.206 ± 0.325
1.294ValHis: 1.294 ± 0.563
2.265ValIle: 2.265 ± 0.496
4.529ValLys: 4.529 ± 1.339
8.088ValLeu: 8.088 ± 1.094
0.647ValMet: 0.647 ± 0.285
1.941ValAsn: 1.941 ± 0.568
3.235ValPro: 3.235 ± 1.305
3.235ValGln: 3.235 ± 0.557
2.265ValArg: 2.265 ± 0.742
4.206ValSer: 4.206 ± 0.596
3.559ValThr: 3.559 ± 1.553
4.206ValVal: 4.206 ± 1.388
0.0ValTrp: 0.0 ± 0.0
1.941ValTyr: 1.941 ± 0.402
0.0ValXaa: 0.0 ± 0.0
Trp
1.618TrpAla: 1.618 ± 0.652
0.0TrpCys: 0.0 ± 0.0
0.324TrpAsp: 0.324 ± 0.292
0.647TrpGlu: 0.647 ± 0.418
1.294TrpPhe: 1.294 ± 0.461
1.618TrpGly: 1.618 ± 0.435
0.971TrpHis: 0.971 ± 0.539
0.324TrpIle: 0.324 ± 0.292
0.324TrpLys: 0.324 ± 0.292
1.618TrpLeu: 1.618 ± 0.798
0.324TrpMet: 0.324 ± 0.378
0.324TrpAsn: 0.324 ± 0.292
1.618TrpPro: 1.618 ± 0.572
0.647TrpGln: 0.647 ± 0.282
2.912TrpArg: 2.912 ± 0.731
2.265TrpSer: 2.265 ± 0.65
0.647TrpThr: 0.647 ± 0.282
1.294TrpVal: 1.294 ± 0.474
0.324TrpTrp: 0.324 ± 0.39
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.235TyrAla: 3.235 ± 0.843
0.0TyrCys: 0.0 ± 0.0
0.324TyrAsp: 0.324 ± 0.292
0.324TyrGlu: 0.324 ± 0.292
0.324TyrPhe: 0.324 ± 0.23
1.294TyrGly: 1.294 ± 0.58
0.971TyrHis: 0.971 ± 0.5
0.971TyrIle: 0.971 ± 0.538
2.588TyrLys: 2.588 ± 0.632
3.235TyrLeu: 3.235 ± 1.213
0.647TyrMet: 0.647 ± 0.464
2.265TyrAsn: 2.265 ± 0.725
1.294TyrPro: 1.294 ± 0.325
1.618TyrGln: 1.618 ± 0.875
1.294TyrArg: 1.294 ± 0.474
1.294TyrSer: 1.294 ± 0.325
2.588TyrThr: 2.588 ± 0.809
2.912TyrVal: 2.912 ± 0.599
0.647TyrTrp: 0.647 ± 0.464
0.971TyrTyr: 0.971 ± 0.667
0.647TyrXaa: 0.647 ± 0.282
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.971XaaPro: 0.971 ± 0.667
0.647XaaGln: 0.647 ± 0.282
0.324XaaArg: 0.324 ± 0.418
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.324XaaVal: 0.324 ± 0.292
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3092 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski