Search this site
Skip to main content
Skip to navigation
BEN CHEN's Homepage
Good Day
wiki
Big Data / Cloud
AWS
Setup SSH access on EC2
Start with S3
Lambda
Athena
Glue Studio
DynamoDB get started
DynamoDB connection Boto3 and Python
AWS RDS vs Redshift
AWS notes
Boto3 list AWS S3 bucket objectse
Azure
Data factory
Event Hubs
Databricks - send to & receive from event hubs
data lake notes
managed identity notes
DevOps deploy repo to databricks
Data factory self-hosted integration runtime
service principal
data factory - load to datalake and databricks
Data factory - incremental load - high watermark
Connect to Datalake from Python
Data factory - small things
AZ900 fundamentals from Adam
Data factory - partition files in dataflow
Azure App Service in Python
synapse mount data lake
synapse spark streaming
synapse quick notes
stream analytics job
Azure function
ARM template / Infrastructure as code
Microsoft Graph API
Microsoft Dataverse API via python
Kubernetes
k8s setup a cluster
k8s dashboard
k8s notes
hadoop
Install Hive on Hadoop
Install HBase
Install Hive with MySQL metadata store
install hadoop
firewall for hadoop on ec2
spark
learn pyspark
Streaming late arriving data
small things
Read data files from multiple sub-folders
Split string column in Spark
spark infer and update schema
Databricks
Schedule by CRON expression
REST API
SQL endpooint
Connect to cluster from PowerBI
Secrets and CLI
Query cluster or sql endpoint
Read files in repos
small things
Range query join hint
range join using pandas merge_asof
Databricks spark structure streaming
databricks database and table
delta table
databricks late arriving data
Delta Live Table
databricks notes
databricks functions mis
delta table merge / upsert
databricks parameters
Install jar file in Compute libraries
git on databricks
Download file from DBFS
Create Date Dimension
Mount Fabric Lakehouse / onelake / blob storage
Delta Lake
Delta lake vs standard parquet file
snowflake
snowflake get started
MongoDB
enconomy
2008 financial crisis
货币起源
ATO super contribution notes
李白
Finance Account Dimensions
Accounting 101
profit & loss, balanced sheet
linux mis
commands
docker img creation and run
docker intro
Dockerfile example
gunicorn py3
install docker rhel
install flask & gunicorn
Install Flask,Gunicorn & Nginx
install sklearn & xgboost
k8s notes
lvm resize logical volume
move docker default directory
nexus and docker
setup ms odbc
systemctl
Update Internet Interface in Ubuntu
Virtual Machine Network
Windows Subsystem for Linux WSL
install dash on WSL ubuntu
wsl - systemd service
linux commands
programming
codes
CNN dropout
generate gt for rbox
lstm
Neural network dropout
ocr mser
ocr pytesseract
ocr_cv2
text classification
Query Twitter API in Python
typescript
typescript environment setup
typescript datatypes
typescript function and object
typescript arrow function
typescript class
c#
C# Rest Service Call
C# Rest Service Host
C# Soap RDA
C# Thread
C# Thread Safe
Convert String to Bytes
EDIT EXCEL C#
Query AD without the 1000 limit
R
SAS
Data Step
DecisionTree
DTREE
gchart
gmap
Graph Style
Load data file
logit
model output
ODBC & SQL
Procedures
SGPLOT
react
react helloworld
react building components
Python
bytes image cv2
compile python to exe
Conda
Config
ctc loss
CUDA gpu
cv2 fillpoly
Decorator
Edge Recognition
Flask image post
flask render template
google search
GPS from photo
Heatmap
heatmap2 (meshgrid)
hello pytorch
Install Package
jupyter
jupyter dark theme
miscellaneous
miscellaneous plots
plot major minor range as ticks
matplotlib format timestamp axis
oracle conn
pandas
pandas update values by condition
pandas plot
pandas join by time range
pandas custom groupby
Open excel spreadsheet
plot img and polygon
Python web and WSGI
PyTorch
Scatter Plot
string similarity
virtual env
word cloud
selenium xpath
multi processing
subfigure / subplot
Reactive Extensions (RX)
reflection / reflective programming
Unix / Posix / Epoch time
matplotlib table
write data to ms sql through pyodbc
Get normal distribution from data
find peak and bottom values from a series
Python OLEDB query
dash
dash helloworld tutorial
dash show static plot figure
multi-page dash app
Dagster
Dagster - code structure for jobs
dagster job and op config
Bulk insert rows to SQL
interpolate grid data
SQL Server windows authentication linux python
Log log file to avoid re-running
Kernel density estimate KDE
geopandas
unzip tar file
SVM / SVC in sklearn
numpy
interpolate nan values
add numbers to different dimensions
airflow
airflow install
airflow dag and task
matrix rotation in python
regex
FastAPI
Call API with OAuth
flask website with authentication
flask api with authentication
miscellaneous
Cisco Data Virtulization
compress pdf
Git
Google Analytics API
Google Vision API
html css
html loading wheel on click
jenkins
Json schema
LightSwitch
mt4 basic
OBIEE RPD study notes
oracle regex
Oracle XML Query
Read multi-page TIFF
Silverpop
Tableau quick look
VBA query SQL Server
vlookup
oracle KEEP FIRST/LAST
jQuery
jQuery progress bar
C# async & await & Task
3rd normal form
conceptual logical & physical models
View wifi password on windows
Anaconda installation
read pdf text in python
OSI PI - query data
read write parquet in python
Json read, write and pretty print
VS Code for C# winform
NSSM
Azure Git Repo credential
move window from outside of screen
mysql
Task scheduler runs python script
osi pi asset framework quick overview
OSI PI AF SDK query
InfluxDB
dimensional modeling
asset maintenance
power bi
power bi training notes
Power BI python setup
power bi DAX
power bi direct query and performance analyzer
power bi cool visuals
parameterize powerbi data source
powerbi role and row-level security
powerbi semantic model
Enable service principal to access Fabric / onelake
ML and Math
Adam & Gradient Descent
ARIMA
auto encoder
bag of words
batch normalization
bayes and maximum likelihood
Canny Edge Detection
Chi-Square Test
connected components
convolution network design
convolution notes
Cross Validation and Grid Search
ctc loss
ctpn
Derivative table
Eigenvector Eigenvalue
embedding
encoder-decoder
Evaluation Metric
Generalized linear models
Goodness-of-fit Test
hinge loss
Hypothesis tests def
kNN
L1 & L2 Penalty
linear algebra
Linear Regression
Logistic Regression Python
logit logloss
Loss function and Eval metric
LSTM
MaximumWeightedMatching
Model measure
naive bayes
nan loss / gradient
negative log loss and cross entropy in pytorch
neural network
non-maximum supression
pack variable-length sequences
position info encoded in CNN
RBOX
Rotate vector
Sampling - Margin of Error
SMOTE
stacking model
SVM
TF IDF
training data different sizes
xgboost
特征工程
Generic Attention & Self Attention
Binomial, Multinomial, Gamma, Beta and Dirichlet
Intuition behind Beta Distribution
LDA
PCA
transformer
east
Fitting exponential decay curve
Why minimax >= maximin
Vector Auto regression VAR
Gelu
Generative Adversarial Network GAN
Generative AI
GPT introduction
SQL Server
Backup and Restore
build & publish sql project
CDC
Change server-level collation
Check DB Lock
Custom Aggregate
Database Encryption TDE
exec job from script
Geometry
partitioned table
Replication
Script index definition
split_string
SQL 2014 In-Memory Table
SQL2012_Exec_Package
SSIS 2014 Deployment
string agg
TDE encryption
TFS for VS2008
mining
open cut - truck cycle / load & haul cycle
BEN CHEN's Homepage
LDA
Notes for the LDA algorithm
Google Sites
Report abuse
Page details
Page updated
Google Sites
Report abuse