Software modes

Running as a script

When running the software from a terminal or command prompt, the __init__ file saves two important locations, defined as _ROOT and PACKAGEDIR, for software-related files.

The _ROOT directory has everything from input data and information to target results and therefore by default, is defined in an easily accessible place (aka the current working directory.). From this, the software assumes there are 3 subdirectories:

INFDIR : ‘~/path/to/local/pysyd/directory/info’
INPDIR : ‘~/path/to/local/pysyd/directory/data’
OUTDIR : ‘~/path/to/local/pysyd/directory/results’

The latter, PACKAGEDIR should never need to be touched unless for some reason the installation or setup was modified by the user and intentionally left out package data (which is not recommended). For example, the pySYD matplotlib stylesheet is saved there as well as relevant info dictionaries. Since this is used a lot by the software and does not need to be modified by a user, this is typically installed in the user root directory within the pysyd directory (e.g., /usr/local/lib/python3.10/site-packages/pysyd/data/).

Importing as a module

Pipeline overview

The software generally operates in four main steps:

Loads in parameters and data
Gets initial values
Fits global parameters
Estimates uncertainties

Note

The software will process the pipeline on oversampled spectra for the first iterations but will always switch to critically-sampled spectra for estimating uncertainties. Calculating uncertainties with oversampled spectra can produce unreliable results and uncertainties!

A pySYD pipeline Target class object has two main function calls:

The first module :
- Summary: a crude, quick way to identify the power excess due to solar-like oscillations
- This uses a heavy smoothing filter to divide out the background and then implements a frequency-resolved, collapsed autocorrelation function (ACF) using 3 different box sizes
- The main purpose for this first module is to provide a good starting point for the second module. The output from this routine provides a rough estimate for numax, which is translated into a frequency range in the power spectrum that is believed to exhibit characteristics of p-mode oscillations
The second module :
- Summary: performs a more rigorous analysis to determine both the stellar background contribution as well as the global asteroseismic parameters.
- Given the frequency range determined by the first module, this region is masked out to model the white- and red-noise contributions present in the power spectrum. The fitting procedure will test a series of models and select the best-fit stellar background model based on the BIC.
- The power spectrum is corrected by dividing out this contribution, which also saves as an output text file.
- Now that the background has been removed, the global parameters can be more accurately estimated. Numax is estimated by using a smoothing filter, where the peak of the heavily smoothed, background-corrected power spectrum is the first and the second fits a Gaussian to this same power spectrum. The smoothed numax has typically been adopted as the default numax value reported in the literature since it makes no assumptions about the shape of the power excess.
- Using the masked power spectrum in the region centered around numax, an autocorrelation is computed to determine the large frequency spacing.

Note

By default, both modules will run and this is the recommended procedure if no other information is provided.

If stellar parameters like the radius, effective temperature and/or surface gravity are provided in the info/star_info.csv file, pySYD can estimate a value for numax using a scaling relation. Therefore the first module can be bypassed, and the second module will use the estimated numax as an initial starting point.

There is also an option to directly provide numax in the info/star_info.csv (or via command line, see advanced usage for more details), which will override the value found in the first module. This option is recommended if you think that the value found in the first module is inaccurate, or if you have a visual estimate of numax from the power spectrum.

Introduction

This was initially created to be used with the command-line tool but has been expanded to include functions compatible with Python notebooks as well. This is still a work in progress!

Imports

Pipeline modes

There are currently four operational pySYD modes (and two under development):

setup : Initializes pysyd.pipeline.setup for quick and easy setup of directories, files and examples. This mode only inherits higher level functionality and has limited CLI (see parent parser below). Using this feature will set up the paths and files consistent with what is recommended and discussed in more detail below.
load : Loads in data for a single target through pysyd.pipeline.load. Because this does handle data, this has full access to both the parent and main parser.
run : The main pySYD pipeline function is initialized through pysyd.pipeline.run and runs the two core modules (i.e. find_excess and fit_background) for each star consecutively. This mode operates using most CLI options, inheriting both the parent and main parser options.
parallel : Operates the same way as the previous mode, but processes stars simultaneously in parallel. Based on the number of threads available, stars are separated into groups (where the number of groups is exactly equal to the number of threads). This mode uses all CLI options, including the number of threads to use for parallelization (see here).
display : will primarily be used for development and testing purposes as well, but
test : Currently under development but intended for developers.

Examples

`pysyd.pipeline` API

pysyd.pipeline.check(args)

Check target

This is intended to be a way to check a target before running it by plotting the times series data and/or power spectrum. This works in the most basic way but has not been tested otherwise

Parameters

argsargparse.Namespace: the command line arguments

Important

has not been extensively tested - also it is exactly the same as “load” so think through if this is actually needed and decided which is easier to understand

pysyd.pipeline.fun(args)

Get logo output

Parameters

argsargparse.Namespace: the command line arguments

pysyd.pipeline.load(args)

Load target

Load in a given target to check data or figures

Note

this does not load in a target or target data, this is purely information that is required to run any pySYD mode successfully (with the exception of pysyd.pipeline.setup)

Parameters

argsargparse.Namespace: the command line arguments

pysyd.pipeline.parallel(args)

Parallel execution

Run pySYD concurrently for a large number of stars

Parameters

argsargparse.Namespace: the command line arguments

Methods

pipe

Software modes

Running as a script

Importing as a module

Pipeline overview

Introduction

Imports

Pipeline modes

Examples

pysyd.pipeline API

`pysyd.pipeline` API